Ashvara
Blog/Strategy
Strategy

How to show up in AI search: GEO in 2026

AI Overviews now hit nearly half of all searches. Here's how Generative Engine Optimization actually works — backed by the research — and why it's SEO's next chapter, not its replacement.

S
Sahil Jain
Strategy · Ashvara
Jun 26, 2026
6 min read
GEO / AI search

Generative Engine Optimization (GEO) is the practice of getting your content cited inside AI-generated answers — on ChatGPT, Google's AI Overviews, Gemini, Perplexity, and Claude. If classic SEO is about ranking on the page, GEO is about being in the answer. And in 2026 it's no longer optional: Google's AI Overviews now appear on roughly 48% of searches — up about 58% year over year — and most of those answers resolve the user's question without a single click on a blue link.

The good news, which we'll back with evidence below: GEO isn't a mysterious new discipline. It's a measurable extension of SEO, and the tactics that work have actually been studied.

The discovery channel fractured

A year ago "AI search" basically meant ChatGPT. Not anymore:

  • ChatGPT's share of generative-AI traffic fell from 86.7% to 64.5% in twelve months.
  • Gemini climbed from 5.7% to 21.5% in the same period.
  • AI Overviews keep expanding into more query types, and Perplexity, Claude, and Copilot all send real traffic.

So your customers now discover businesses across several AI surfaces — and every one of them collapses ten blue links into one synthesized answer with a handful of cited sources. The entire game is becoming one of those sources.

How the engines actually choose sources

This is where most GEO advice hand-waves. Mechanically, AI answer engines don't "rank" pages the way Google's ten links do — they retrieve and ground. When you ask a question, the engine runs live searches, pulls candidate passages from pages it can access and trust, and synthesizes an answer that quotes or paraphrases a few of them. ChatGPT leans on a web index and live browsing; AI Overviews draw on Google's own index; Perplexity retrieves and cites in real time.

Two practical consequences fall out of this:

  • It's passage-level, not page-level. The engine lifts a chunk that cleanly answers the question — so a single well-structured section can get you cited even if the whole page doesn't "rank."
  • Trust and identity matter. Models favor sources they can recognize and that the wider web vouches for. An unknown domain with no mentions is a weak citation candidate.

What the research actually shows

Here's the part that turns GEO from folklore into engineering. A Princeton/Georgia Tech-led study, GEO: Generative Engine Optimization (presented at KDD 2024), tested nine content tactics across 10,000 real queries and measured how each changed a source's visibility in AI answers. The standouts:

  • Adding expert quotations: +41% visibility
  • Adding relevant statistics: +32%
  • Citing authoritative sources: +30%
  • And separately: 44% of all AI citations come from the first third of the text.

The single most reliable way to get cited by an AI is to write content that is itself well-cited, specific, and front-loaded with the answer.

Notice that this entire article is built that way on purpose — a direct definition up top, real numbers, named sources. That's not a coincidence; it's the method.

The thesis: GEO and SEO are converging

GEO is not replacing SEO. They are becoming the same discipline.

AI engines ground their answers in the live web, which means the fundamentals that make Google rank you — crawlable, fast, server-rendered pages; a credible brand; links and mentions — are the same signals that make an AI willing to cite you. The worst reaction to GEO is to abandon SEO basics and chase a "hack." There is no hack. The pages getting cited by AI are overwhelmingly the pages that already did SEO properly; GEO just changes how you write and structure on top of that foundation.

The practical playbook (mapped to the evidence)

  1. Front-load a direct, quotable answer. Since ~44% of citations come from the first third, lead each page with a clean, self-contained answer, then support it. Burying it in paragraph nine gets you skipped.
  2. Add statistics, quotations, and citations. This is the highest-leverage move the research found (+30–41%). Cite credible sources by name; include real numbers; quote experts.
  3. Structure for extraction. Semantic headings, short paragraphs, lists, comparison tables, and an FAQ give engines clean, liftable chunks. Add JSON-LD (Organization, Article, FAQPage, Product) so machines know what each page is.
  4. Be a recognizable entity. Consistent brand name, an Organization schema with sameAs links to your real profiles, and mentions across the web teach models "this company exists and is trusted."
  5. Earn citations and mentions — backlinks still matter, and so do plain brand mentions on sites the engines already read (reviews, roundups, forums).
  6. Be machine-readable and fast. Server-rendered HTML (not content trapped behind heavy JavaScript), a clean sitemap, and an llms.txt that points AI crawlers at your best pages.

Measure it — or you're guessing

AI traffic is famously under-counted. GA4 added a native "AI Assistant" channel in 2026, but it still misses an estimated 35–70% of AI sessions, because the AI app or in-app browser strips the referrer and the visit lands in GA4 as "direct." The fix is a custom channel group with a regex covering chatgpt.com|perplexity.ai|gemini.google.com|claude.ai|copilot.microsoft.com (and review it quarterly as new engines appear). It's worth the effort: AI referral traffic has been measured converting at roughly 4.4× the rate of traditional organic — these are people who arrived already half-sold by the AI's recommendation. Beyond traffic, track the new "rank check": does an AI name your brand when asked about your space?

What doesn't work

Gaming this backfires faster than old-school SEO. Keyword-stuffed, robot-targeted pages are exactly what these models are trained to ignore; content hidden behind client-side JavaScript may never be retrieved; and fabricated "stats" get you dropped the moment they don't check out. The durable strategy is almost old-fashioned: be the clearest, most credible, most genuinely useful answer to a real question — and make it trivially easy for both people and machines to read.

How Ashvara helps

This is exactly how we build. Every site we ship is server-rendered and fast, carries full structured data and an Organization entity with sameAs, publishes an llms.txt, and is written to answer real questions clearly — so it's eligible to be cited by AI, not just ranked by Google. We built our own site that way, and we bake it into client work from the first commit instead of bolting it on later.

If you want a site that shows up in the AI answers your customers are already reading — and keeps ranking the classic way too — tell us what you're building, or see how we approach web development.


Source on the tactic data: “GEO: Generative Engine Optimization,” Aggarwal et al., KDD 2024 (arxiv.org/abs/2311.09735).

S
Sahil Jain

Founder at Ashvara. Writes about local-first data, native iOS, and shipping software that stays shipped.

Building something? Let's talk.