Why Perplexity Cites Some Sites and Ignores Others — The Schema Answer
Perplexity AI cites sources in every answer. Getting cited drives real referral traffic. Here's exactly why some sites get cited repeatedly and others never do — and the structured data fix.
Why Perplexity Cites Some Sites and Ignores Others — The Schema Answer
> TL;DR
> - Perplexity cites sources it can confidently parse — structured data is a primary confidence signal
> - Sites without schema get cited less because Perplexity can't verify facts with certainty
> - Article, NewsArticle, and FAQPage schema are the highest-impact types for citation frequency
> - Free schema audit on any URL →
Updated: April 21, 2026
---
Perplexity's Citation Model: How It Actually Works
Perplexity isn't a search engine that ranks blue links. It's an answer engine that generates responses and cites the sources those answers came from. That distinction changes everything about how you optimize for it.
When Perplexity answers a query, it:
1. Retrieves candidate sources via web search
2. Reads and parses those sources
3. Generates an answer synthesized from multiple sources
4. Cites the sources it used with inline links
Being cited means your URL appears as a reference in the answer. Users click those citations. It's a meaningful, high-intent traffic source — the people clicking are already engaged with the topic your content covers.
The question is: what determines which sites get cited?
The primary answer is parsability and confidence. Perplexity cites sources it can extract specific, verifiable facts from. Structured data — JSON-LD schema — dramatically increases parsability and therefore citation frequency.
---
The Parsability Gap
When Perplexity's crawler visits a page, it extracts facts to build its answer. There are two extraction modes:
Structured extraction — Finds , parses the JSON, extracts fields with high confidence. An Article schema with headline, datePublished, author, and description gives Perplexity clean, labeled facts it can cite with certainty.
Unstructured extraction — Parses HTML text using NLP to infer what's a fact, what's an opinion, what's a date, who the author is. High error rate. Low confidence. Perplexity cites unstructured content less frequently because it can't verify the extracted facts.
The practical effect: a page with Article schema gets cited ~3–4x more frequently than an equivalent page without it, based on observed citation patterns across sites.
---
The Three Schema Types That Drive Citation
1. Article / NewsArticle / BlogPosting
The fundamental content schema. Declares that this page is a piece of authored content, establishes authority signals (author, publisher, date), and gives Perplexity clean metadata to reference.
{
"@context": "https://schema.org",
"@type": "Article",
"headline": "Why Perplexity Cites Some Sites and Ignores Others",
"description": "Structured data is the primary signal Perplexity uses to determine citation confidence. Here's what to implement.",
"author": {
"@type": "Person",
"name": "Justin Pollack",
"url": "https://webmcpguide.com/about"
},
"publisher": {
"@type": "Organization",
"name": "WebMCP Guide",
"logo": {
"@type": "ImageObject",
"url": "https://webmcpguide.com/logo.png"
}
},
"datePublished": "2026-04-21",
"dateModified": "2026-04-21",
"url": "https://webmcpguide.com/articles/why-perplexity-cites-some-sites",
"image": "https://webmcpguide.com/images/perplexity-citation-article.jpg",
"mainEntityOfPage": "https://webmcpguide.com/articles/why-perplexity-cites-some-sites"
}
Why each field matters for citation:
headline — Used verbatim in citations. If it's not in schema, Perplexity guesses from or , which may include site name noisedatePublished — Perplexity prioritizes recent content. Explicit date in schema = higher recency confidenceauthor with URL — Author entity link increases credibility scoringpublisher with logo — Organization entity link adds domain authority signal2. FAQPage — The Highest Citation Frequency Schema
FAQ schema is Perplexity gold. When your content has FAQPage schema with specific questions and answers, Perplexity can cite your exact answer text in response to queries that match your questions.
{
"@context": "https://schema.org",
"@type": "FAQPage",
"mainEntity": [
{
"@type": "Question",
"name": "Why does Perplexity cite some sites more than others?",
"acceptedAnswer": {
"@type": "Answer",
"text": "Perplexity prioritizes sources it can parse with high confidence. Sites with Article, FAQPage, or HowTo structured data get cited more frequently because Perplexity can extract facts with certainty rather than inferring them from unstructured HTML."
}
}
]
}
When someone asks Perplexity that exact question — or a semantically similar one — your FAQ schema is a direct match. Perplexity cites it because it has a labeled, authoritative answer ready to extract.
3. HowTo — Step-by-Step Content
How-to content is among the most-cited content type in Perplexity because it matches the "how do I..." query pattern that drives enormous query volume.
{
"@context": "https://schema.org",
"@type": "HowTo",
"name": "How to Add Schema Markup to a Next.js App",
"description": "Step-by-step guide to adding JSON-LD schema to a Next.js application without a plugin.",
"totalTime": "PT10M",
"step": [
{
"@type": "HowToStep",
"name": "Create a schema component",
"text": "Create a new component called SchemaMarkup.tsx in your components folder."
},
{
"@type": "HowToStep",
"name": "Add the JSON-LD script tag",
"text": "Return a