← All articles

How to Measure Your Domain's AI Citation Rate: Tools + Manual Method (2026)

Compare 3 ways to measure your domain's AI citation rate: free 2-minute scan, manual 20-query test, or automated API testing. Real baseline from 7 domains — one site at 15% citation rate outperformed Ahrefs (DA 92) at 5%.

Chudi Nnorukam||7 min read

Your AI citation rate is the percentage of topic queries where AI search engines name your site as a source. You can measure it manually in under an hour: pick 20 queries in your niche, run them in ChatGPT Search and Perplexity, count how many return your domain, divide by 20. That number is your baseline. Most sites have never checked. The conceptual foundation is in What Is AI Citability, which defines citability as a measurable property and lays out the five pillars that drive it. This post is the procedural counterpart: how to actually measure your rate. When we ran this process across seven well-known domains as part of building citability.dev, the results were clarifying.

What Does Real Baseline Data Look Like?#

Before building the AVR (AI Visibility Readiness) framework, we audited seven domains and separated two metrics that are easy to conflate. Visibility is when an AI engine mentions your brand or topic in its answer, with or without a link. Citation is the higher bar: the engine links to a URL on your domain as a source. Here is what the baseline showed:

DomainDomain AuthorityAI VisibilityAI CitationNotes
ahrefs.com92100%5%Foundation-ready: always mentioned, rarely the cited source
citability.devunder 1044%15%Foundation-strong: 3x Ahrefs citation despite far lower authority
chudi.dev2825%0%Foundation-strong infrastructure, no topic citations at baseline
semrush.com91PartialPartialFoundation-ready; strong schema; only partially scored
reddit.com97UntestedUntestedFailed basic infrastructure checks; cited via training data, not structure
medium.com95UntestedUntestedFailed basic infrastructure checks
x.com96UntestedUntestedFailed basic infrastructure checks

Source: /avr-benchmark.json (measured 2026-04-07, re-measured quarterly). Visibility = mentioned in an AI answer; citation = linked as the source.

The 0% citation for chudi.dev at baseline is not an edge case. It is the starting point for most personal sites: the infrastructure audit found no per-page structured data, no FAQPage or HowTo schema, and content that answered questions but not in an AI-readable format.

The most important pattern is the gap between the two columns. Ahrefs (Domain Authority 92) is mentioned in essentially every relevant answer, a 100% visibility rate, yet it is the cited source only 5% of the time. Meanwhile citability.dev, with Domain Authority under 10, reached a 15% citation rate, three times Ahrefs. Domain authority predicts visibility; it does not predict citation. Citation is a structure problem, won with answer-first content and clean schema rather than backlinks, and that is the gap this guide helps you measure.

Tools That Measure Domain Citation Rate#

Three approaches exist, with different tradeoffs:

ToolWhat It MeasuresCostManual Work
citability.dev scan10 infrastructure signals that predict citation likelihood + crawl accessFree (basic), paid (full AVR)2 minutes
Manual API testingActual citation rate: run queries in ChatGPT, Perplexity, and Claude, then count citationsFree~1 hour for 20 queries
Browser automationAutomated citation rate measurement via Playwright or the OpenAI/Anthropic API with web searchFree (self-hosted)Setup: 4–8 hours

The infrastructure scan tells you why you are not cited (structural problems). The manual method tells you whether you are cited (your actual rate). Use both: scan first to fix blocking issues, then measure to establish a baseline.

Step 1: Define Your Topic Query Set#

Do not test with branded queries ("chudi nnorukam consulting" or "citability.dev review"). Those test whether AI knows you exist, not whether AI cites you as an authority on your topics.

Instead, write down the 10-20 questions your target readers are asking that your content should answer. For a site like citability.dev, that means queries like:

  • "how to improve AI citability"
  • "why is my site not cited by ChatGPT"
  • "what is answer engine optimization"
  • "how do AI search engines choose sources"
  • "how to get cited by Perplexity"

Group them into 3-5 clusters by intent. Topic clusters matter because citation rate can vary dramatically by intent type. Informational queries ("what is X") often show different citation patterns than procedural queries ("how to X") or comparative queries ("X vs Y").

Step 2: Run the Manual Query Test#

Open ChatGPT with web search enabled (the globe icon). Run each query. For each response, look for cited sources in the footnotes or sidebar. Check whether your domain appears. Log it as yes or no.

Repeat for Perplexity (which shows sources prominently as cards) and Claude.ai (check for cited URLs in responses).

The logging format does not need to be complex:

Query: "how to measure AI citability"
ChatGPT: no citation
Perplexity: no citation
Claude: no citation
 
Query: "what is structured data for SEO"
ChatGPT: schema.org cited, google.com cited
Perplexity: search.google.com cited
Claude: no web citations returned

Twenty queries across three platforms gives you 60 data points per platform after normalization. Expect significant variation. Perplexity typically cites more sources per response than ChatGPT. Claude in standard mode does not browse the web by default, so limit those tests to Claude Search or note when web access is enabled.

Step 3: Calculate Your Citation Rate#

The math is straightforward. Count how many of your queries returned your domain as a cited source. Divide by total queries. Multiply by 100.

queries_tested = 20
times_cited = 3
 
citation_rate = (times_cited / queries_tested) * 100
# Result: 15.0%

Calculate this separately per platform since rates diverge. A site might appear in 25% of Perplexity results and 5% of ChatGPT results, which points to different structural issues (Perplexity weighs recency and structured data; ChatGPT Search weighs Bing index signals and domain authority).

For a 95% confidence interval on your measured rate, use the Wilson score interval. At n=20 and 15% measured rate, your true rate is likely between 5% and 36%. That is a wide range. At n=50 with 15% measured, the CI narrows to 7%-28%. This is why sample size matters before making infrastructure changes.

Step 4: Run the Infrastructure Scan#

Manual citation testing tells you your current rate. Infrastructure scanning tells you why.

The citability.dev free scan checks 10 signals across four categories:

  • Crawl access: robots.txt permissions for GPTBot, ClaudeBot, PerplexityBot. Sitemap existence and validity.
  • Rendering: Whether your content is visible in raw HTML (server-side rendered) or requires JavaScript execution to appear.
  • Structured data: JSON-LD schema coverage at site level and per-page level. Presence of Article, FAQPage, HowTo markup.
  • Content signals: Answer-first layout, heading hierarchy, freshness indicators (dateModified schema, recent publication dates).

Cross-referencing your citation rate with your infrastructure score reveals the pattern. Sites with 8+ infrastructure checks passing and low citation rates typically have a content structure problem: technically accessible but not answer-formatted. Sites with low infrastructure scores and low citation rates have the simpler problem: fix the infrastructure first, then reassess.

Step 5: Interpret the Gap Between Platforms#

Citation rate gaps between platforms are diagnostic. Here is what the gaps typically mean:

High Perplexity, low ChatGPT: Your content is indexed but your Bing authority is weaker than your broader web footprint. Perplexity uses multiple sources including direct crawls; ChatGPT Search relies more heavily on Bing's index. Backlink building and Bing Webmaster Tools registration help close this gap.

High ChatGPT, low Perplexity: Your content ranks well in Bing but may lack the structured data and real-time freshness signals Perplexity weights. Adding FAQPage schema and updating content within the last 90 days helps.

Low on all platforms: Infrastructure or content structure problem. Run the scan before changing anything else.

Consistent 0% across all platforms: You are almost certainly blocked by robots.txt, not indexed by Bing, or your content is client-side rendered without SSR. These are fixable in hours.

Step 6: Repeat and Track#

AI citation rates are not static. Major model updates (GPT-4o, Claude 3.5 Sonnet, Gemini 2.0) shift citation patterns. When OpenAI updated its web search integration in late 2025, dozens of sites saw citation rates shift by 15+ percentage points in either direction.

Set a quarterly calendar reminder. Run the same 20 queries each quarter using the same logging format. Track rate by platform and by topic cluster. Quarterly data over two or three cycles reveals which content types your site gets cited for and which gaps remain.

The query set should stay consistent so you are measuring the same thing each quarter. If you change your query set, note it in your log so you can account for the change when comparing periods.

What to Do With Your Baseline#

A measured baseline is only useful if it triggers action. The decision tree is simple:

  • Citation rate above 15%: Strong. In our baseline the best-structured site reached 15% while a Domain-Authority-92 site sat at 5%, so clearing 15% means you are outperforming far larger sites. Focus on expanding topic coverage.
  • Citation rate 5 to 15%: Working, with room to grow. Run the scan, fix the failing checks, and retest in 30 days.
  • Citation rate 1 to 5%: Structural gaps. Fix crawl access and structured data first, then content structure. High visibility with citation in this band is normal: you are getting mentioned but not linked as the source.
  • Citation rate 0%: Diagnose before anything else. Check robots.txt for GPTBot blocks, confirm your content is server-rendered rather than client-only, and check that Bing has indexed your domain. Fix the blocking issue before any content work.

The guide I wish I had when I started this: measure first, then fix. Every hour spent "optimizing for AI" without a baseline is potentially wasted effort. The baseline tells you which lever to pull. The full step-by-step methodology, including the 20-query panel template and per-engine scoring approach, is also published as a comprehensive tutorial on freeCodeCamp: How to Measure Your AI Citation Rate Across ChatGPT, Perplexity, and Claude.

Run the free citability scan on your domain to get your infrastructure score alongside your manual citation rate baseline. The scan takes under two minutes and shows exactly which signals are failing and why they matter for AI citation likelihood. You will have a clearer picture of your AI visibility in the time it takes to read a single blog post.

The methodology behind the scan is documented at citability.dev/methodology if you want to understand the scoring criteria before running it. For the engine-level differences in citation behavior that this measurement protocol abstracts over (Perplexity quotes liberally; ChatGPT quotes selectively), see Chudi's Perplexity vs ChatGPT Citation Rules on chudi.dev.

Topics:ai-citability·measurement·tutorial·answer-engine-optimization

Chudi Nnorukam

AI-Visible Web Architect

Builds chudi.dev and citability.dev. Authored the AI Visibility Readiness Framework. Contributor at freeCodeCamp /news.

chudi.dev|Published |Updated

Check your AI visibility

Free scan. No account required. Results in 10 seconds.

Start Free Scan