How is AI citability different from SEO?

SEO targets humans clicking blue links. AI citability targets models extracting passages for synthesis. SEO success is a page appearing in the top 10 for a query. AI citability success is a specific passage being quoted or referenced in an AI-generated answer. Classical SEO signals (backlinks, keyword match, E-E-A-T) still matter because some AI systems ride Google's index, but new signals dominate: crawler access for GPTBot and ClaudeBot, answer-first content structure, passage chunkability, entity recognition in LLM training data, and query paraphrase coverage. Sites with strong SEO but weak citability often have high traffic and zero AI authority.

What are the five pillars of AI citability?

Retrievability (can LLM crawlers reach the page), answer-first structure (does the query-relevant content appear early), chunkability (does the content break cleanly into passage-sized blocks the model retrieves), entity authority (does the LLM recognize your brand as a source from its training data or Knowledge Graph signals), and paraphrase coverage (does the page satisfy the multiple query rephrasings models generate during fan-out retrieval). Missing any one pillar caps total citability at roughly 30% regardless of how strong the others are.

How do I measure my AI citability rate?

Pick 15 to 20 topic queries your target readers ask. Run each in ChatGPT Search, Perplexity, and Claude. Count how many return your domain as a cited source. Divide by total queries tested per platform. A result of 3 citations from 20 queries on Perplexity equals a 15% citability rate on that platform. For statistical reliability, use at least 20 queries per platform and the Wilson score interval to compute a 95% confidence band. Quarterly retesting captures model update effects. The citability.dev free scan complements this with a 10-point infrastructure audit that predicts citability without requiring query testing.

What is a good AI citability score?

Context-dependent, and it hinges on separating two metrics: visibility (being mentioned) is common, while citation (being linked as the source) is rare. In baseline testing, Ahrefs (Domain Authority 92) was visible in 100% of relevant answers but cited only 5% of the time, while citability.dev (Domain Authority under 10) reached a 15% citation rate, three times Ahrefs. A citation rate above 10% on your core topics is strong for a site under two years old; below that usually points to structural gaps worth fixing before optimizing content. Domain authority predicts visibility, not citation, which is why citability is a more actionable metric than domain authority for newer sites.

Does high domain authority guarantee AI citations?

No. Domain authority predicts Google rank, not AI citation. The citability.dev baseline revealed high-DA sites with sub-10% citability because their content structure was not model-readable, and newer sites with 30%+ citability because they built for answer engines from the start. The signals diverge: DA rewards backlink volume over time; citability rewards passage-level clarity and structural discipline today. Both can coexist, but neither substitutes for the other.

Which LLM crawlers should my robots.txt allow?

Allow GPTBot (OpenAI), ClaudeBot and anthropic-ai (Anthropic), PerplexityBot (Perplexity), Google-Extended (for Bard and Gemini training), and CCBot (Common Crawl, which feeds many LLM training pipelines). Blocking any of these is the most common self-inflicted damage to AI citability. A site that technically cites perfectly in a test environment can still score 0% in production because robots.txt silently blocks the crawlers that matter. Verify with a tool that parses your live robots.txt against the LLM crawler inventory.

What Is AI Citability? The Five-Pillar Framework

Q: What is AI citability?

AI citability is the probability that an AI answer engine (ChatGPT, Claude, Perplexity, Gemini) cites your site when answering a question in your topic area. It is measured as a rate: cited queries divided by tested queries, expressed as a percentage. A site with 30% citability appears as a source in roughly 3 of 10 relevant answers. Citability is distinct from search rank because AI systems select passages, not pages, and synthesize answers rather than listing links. A top-ranking page in Google can have 0% AI citability if its content is not retrievable, chunkable, or attributed to an entity the model recognizes.

AI citability is the probability that an LLM cites your site when someone asks a question in your topic area. It is measurable, it behaves differently from classical SEO, and it is built from five distinct structural pillars. Most sites score 0% at baseline not because their content is bad but because their structure is illegible to the retrieval pipeline LLMs use. This is the framework for understanding how citations happen, how to measure your site's current citability, and how to fix the specific pillar blocking you.

The Problem: AI Answers Without You#

Search behavior is splitting into two pipelines. Users asking a question still type it into Google, but a growing share ask ChatGPT, Perplexity, or Claude instead. When those systems answer, they cite sources. Those cited sites get the trust. The uncited sites get nothing: no link, no mention, no future referral. The search result pages that drove traffic for 20 years are being replaced by synthesized answers with a small number of named sources, and the sources are not chosen the way Google picks the top 10.

You can rank first for a term on Google and score 0% when an AI system answers the same question. This is not a Google bug. AI answer engines use a different selection process, weight different signals, and extract passages rather than listing pages. A site that was built for humans clicking blue links will not automatically transfer into AI answer surfaces. The structural changes required are small, but they are specific, and without them your visibility erodes as search volume shifts to AI engines.

This is the core problem citability.dev was built to measure and fix. The free scan audits the 10 infrastructure signals that correlate with AI citability. The full audit measures your actual citation rate across four AI engines and maps the gaps. The framework below is what those audits are built on.

Defining AI Citability#

AI citability is the probability, expressed as a rate, that an AI answer engine names your site as a source when responding to a relevant topic query in your domain. It is measured as citations divided by queries tested, per platform. A site with a 30% citability rate on Perplexity appears as a cited source in roughly 3 of every 10 relevant queries on that platform. A rate of 0% means AI systems are answering questions in your niche without ever crediting your content.

Citability and search rank are related but distinct. The table below clarifies how the two pipelines differ:

Dimension	Google SERP	LLM Answer Engine
Consumer	Human skimming 10 results	Model extracting 2-4 source passages
Output	List of links	Synthesized answer with inline citations
Selection unit	Whole page	Specific passage or sentence
Primary signals	Backlinks, E-E-A-T, keyword match	Retrievability, chunk quality, entity recognition
Index source	Google	Bing (ChatGPT), Google (SGE/Gemini), proprietary (Perplexity)
Freshness tolerance	Flexible	Strict, especially for time-sensitive queries
Failure mode	Page not in top 10	Page not quoted or referenced

The practical consequence: optimizing only for Google leaves half the visibility pipeline unaddressed. Optimizing only for citability without a baseline of classical SEO leaves you invisible to the AI systems that ride Google's index (Bard, Gemini, SGE). Both pipelines need attention, and the signals only partially overlap.

The Five Pillars of AI Citability#

Every citable page satisfies five structural conditions. Missing any one pillar caps total citability at roughly 30% regardless of how strong the others are. The pillars are not ranked by importance because the weakest one becomes the bottleneck. You improve citability by identifying which pillar is blocking your site and fixing that one first.

1. Retrievability#

An LLM cannot cite a page its crawlers cannot reach. This is the most common self-inflicted damage because it is invisible from the site owner's perspective. The page loads fine in a browser, appears in Google, and feels "live," but the AI crawler hit a 403 or saw an empty HTML body and dropped the page from consideration.

Retrievability has three components. First, robots.txt must not block GPTBot (OpenAI), ClaudeBot and anthropic-ai (Anthropic), PerplexityBot (Perplexity), Google-Extended (for Gemini training and retrieval), or CCBot (Common Crawl, which feeds many training pipelines). Some sites added AI crawler blocks during the 2023 "no AI training on my content" wave and never removed them. If you want AI citations, those blocks must come off. Second, your content must render in HTML without requiring JavaScript execution because most LLM crawlers do not run JS. Single-page apps that ship blank HTML and hydrate client-side are effectively invisible. Third, your sitemap must be submitted to Bing in addition to Google, because ChatGPT Search and Copilot rely on Bing's index more than most SEO professionals realize.

The fix is usually a one-line change in robots.txt plus a server-side rendering audit, which is exactly what the citability.dev free scan checks first.

2. Answer-First Structure#

LLMs retrieve passages, and rerankers favor passages where the direct answer appears early. A 2000-word article that buries the answer under 800 words of context will lose retrieval to a 500-word page that states the answer in the first paragraph. This reverses the "engagement first" writing pattern many content teams were trained on.

The structural rule is: every page answers one question per heading, and the paragraph directly after each heading is the direct answer. Context, nuance, and expansion come after the answer, not before. The heading "What is structured data?" is answered in the first sentence under it, not the fifth. FAQPage schema markup amplifies this because models recognize the question-answer pairing as a retrievable unit.

This is also the pillar that trips up authoritative content on older sites. The writing is expert-level but the structure predates AI retrieval. Rewriting the first paragraph of every core page to lead with the answer, without changing the underlying content, measurably lifts citability within weeks of re-crawl.

3. Chunkability#

LLM retrieval operates on passage-sized chunks, typically 256 to 1024 tokens. A 3000-word page is not retrieved as one unit. It is chunked, each chunk scored, and the top-scoring chunks surface for the model to synthesize. Long paragraphs without clear structural breaks produce chunks that contain partial arguments and partial context, which rerank poorly.

Chunkable content has three properties. Paragraphs are 3 to 8 sentences. Headings appear frequently enough that no single section runs past 500 words. Tables, bullet lists, and numbered steps replace prose walls when the content is naturally enumerable. These are writing conventions, not infrastructure, which is why chunkability gaps persist even on sites with strong technical SEO.

The companion metric is "passage answerability": if you extract any 400-token window from your page, does it answer a meaningful question on its own? If the answer requires context from 600 tokens away, your chunks are load-bearing on each other and retrieval will miss most of them.

4. Entity Authority#

LLMs do not cite unknown entities. Even a page with perfect retrievability, answer-first structure, and flawless chunking will score low if the brand behind the page is not an entity the model recognizes from its training data.

Entity authority comes from three sources. The model's training corpus, which includes Wikipedia, Common Crawl, and specific curated datasets: a brand mentioned 500 times in the training set is a recognized entity; a brand mentioned zero times is not. The Knowledge Graph, which Google's Gemini and Bard use directly and which Perplexity signals through named-entity linking: a Wikidata entry, a Knowledge Panel, and consistent Organization schema across your site all feed this. And retrieval-time authority signals: how many authoritative domains in your niche link to you, whether authoritative sources cite your methodology, how your brand compares against entity-competitors in the same topic cluster.

This is the slowest pillar to move because training data updates lag by months and Knowledge Graph recognition requires qualifying citations from recognized sources. It is also the highest-ceiling pillar because a strongly-recognized entity with mediocre content often outranks a weakly-recognized entity with excellent content in AI answers. See Why AI Recommends Competitors, Not You for the mechanism.

5. Paraphrase Coverage#

Users ask the same question in dozens of ways, and LLMs generate multiple query paraphrases internally during retrieval, a process called query fan-out. Your page needs to satisfy a sufficient fraction of those paraphrases to be retrieved regardless of how the user phrased the original question.

Paraphrase coverage means a single page answers "how do I measure AI citability," "what is AI citation rate," "how often does ChatGPT cite my website," and "how to track AI search visibility" all together. The surface forms are different, the underlying information need is the same, and the page must cover the semantic space broadly enough that any reasonable paraphrase retrieves it. This is partly a keyword research exercise (what phrasings exist) and partly a content organization exercise (cover them inside one page rather than spawning five thin pages).

The inverse failure is over-fragmentation: five pages each targeting one paraphrase, none of them strong enough individually to be retrieved, and cannibalizing each other on classical SEO. Consolidation typically helps both pipelines.

How Citation Happens: The LLM Answer Pipeline#

Understanding citation is easier once you see the pipeline. An AI answer engine processes a query in five stages:

Query understanding. The raw user question is rephrased into retrieval queries. The model may generate 3 to 10 paraphrases, expanding the surface area of the original query. This is why paraphrase coverage matters.
Retrieval. Each retrieval query is sent against the model's index. For ChatGPT and Copilot, that index is Bing. For Gemini and SGE, it is Google's index plus the Knowledge Graph. For Perplexity, it is a proprietary web index built on top of CC and custom crawls. Top-N candidate passages are returned per query.
Passage reranking. Candidate passages from all retrieval queries are pooled, deduplicated, and reranked on relevance, authority, recency, and passage quality. The top 5 to 10 passages survive to synthesis.
Synthesis. The model is given the surviving passages as grounded context and generates an answer. This is where the actual citation selection happens: the model writes the answer, sentence by sentence, and attaches citations to the passages it drew from.
Citation rendering. The final answer is returned with inline numbered citations, source cards, or sidebar footnotes depending on the platform.

Your site's citability is a function of how well your pages perform at stages 2, 3, and 4. Retrievability determines whether you enter the candidate pool. Answer-first structure and chunkability determine whether you survive reranking. Entity authority and content quality determine whether the model cites you rather than just using your passage for context.

Measuring Your Citability#

Measurement has two parts. Citation rate is the outcome you track. Infrastructure score is the leading indicator that predicts the outcome before the crawlers re-index.

For citation rate, use the 20-query protocol: pick 20 topic queries your audience asks, run each on ChatGPT Search, Perplexity, and Claude, and count citations per platform. This is the same process documented step-by-step in How to Measure Your AI Citation Rate. Twenty queries is the minimum for meaningful signal because fewer produces a confidence interval so wide the result is not actionable. At n=20 with 3 citations measured, your true rate is likely between 5% and 36%. At n=50 with the same 15% measured rate, the interval narrows to 7% to 28%.

For infrastructure, use the citability.dev free scan against your domain. It checks 10 signals across the five pillars: robots.txt permissions for AI crawlers, sitemap validity and Bing submission, rendering mode, per-page structured data coverage, heading hierarchy and answer-first layout, freshness signals, and entity recognition via Knowledge Graph presence. The output is a score 0-100 with per-check pass/fail and specific remediation for each gap.

The combined view is diagnostic. A high infrastructure score with a low citation rate points to content problems inside the five pillars (usually answer-first or entity authority). A low infrastructure score with a low citation rate points to fixable structure: robots, schema, rendering. Fix infrastructure first because it gates everything else.

Common Failure Patterns#

Five failure patterns account for the majority of low-citability sites.

The SPA wall. React, Vue, or Svelte apps that render client-side ship empty HTML. Most LLM crawlers do not execute JavaScript, so your entire site is effectively invisible. Fix: server-side render or pre-render core content pages. Next.js, Nuxt, SvelteKit, and Astro all handle this by default if you opt into SSR or SSG.

The silent crawler block. robots.txt blocks GPTBot or ClaudeBot from a policy written in 2023 that was never revisited. The site owner does not realize it because traffic looks normal. Fix: audit robots.txt against the LLM crawler inventory and remove obsolete blocks. The What AI Crawlers See breakdown lists every crawler UA string you need to check.

The buried answer. Pages that are technically answer-first on narrow queries but bury the answer on broader queries. "What is citability" is answered in the first paragraph; "how do I improve citability" is answered in paragraph eight. Retrieval preferences the former and drops the latter. Fix: rewrite top-level summaries for every core page so each heading's first paragraph is self-contained.

The wall of prose. Content that reads well to a human but chunks poorly. A 2000-word article with five headings and no tables or lists. Retrieval chunks it into 400-token windows, each window partial, none individually retrievable. Fix: add headings every 300-500 words, convert enumerable content to lists or tables, and add an FAQ section with schema at the bottom.

The entity ghost. A site with good content, good infrastructure, and no Wikipedia entry, no Wikidata, no Knowledge Graph presence, and no mentions in authoritative peer domains. LLMs treat the site as an unknown entity and cite recognized competitors even when the unknown site has better content. Fix: build entity authority deliberately through schema markup, Wikidata submission, and citations from recognized domains. Slower to move, bigger ceiling when it lands.

The Citability Maturity Model#

Sites progress through five levels:

L0: Invisible. 0% citability. Crawlers blocked or content unrenderable. Most personal sites and newer SaaS start here.
L1: Reachable. 1-10% citability. Crawlers can reach the site but content structure is not optimized for passage retrieval.
L2: Structured. 10-30% citability. Five pillars addressed at baseline; citations happen on some topic queries but not consistently.
L3: Authoritative. 30-60% citability. Strong on most pillars plus growing entity recognition. Consistent citation across at least two AI platforms.
L4: Canonical. 60%+ citability. The site is a first-choice source in its topic area. Ahrefs and Semrush are at L4 for SEO topics. Most niches have zero or one L4 sites, which is why picking the right niche and building for L4 is the highest-leverage strategy for emerging brands.

The path from L0 to L2 is mechanical and takes 30 to 60 days once the fixes land and crawlers re-index. L2 to L3 requires content investment plus entity authority work and takes a quarter. L3 to L4 requires sustained topical depth over 12+ months and is where most sites plateau.