Skip to main content
← Back to Articles

Article Schema for LLM Citation Optimization: Get Cited by ChatGPT, Perplexity, and Claude

Article, NewsArticle, Podcast, and VideoObject schema are how LLMs identify authoritative content to cite. Here's the complete guide to structured data for content publishers in 2026.

By Web MCP Guide•April 21, 2026•7 min read


Article Schema for LLM Citation Optimization: Get Cited by ChatGPT, Perplexity, and Claude

> TL;DR
> - LLMs cite sources they can parse with confidence — structured data is the confidence signal
> - Article, FAQPage, HowTo, Podcast, and VideoObject schema each serve different citation use cases
> - Author and publisher entity schema builds long-term citation authority
> - Add citation-optimized schema to your content →

Updated: April 21, 2026

---

Why LLMs Cite Some Content and Not Others

Every time ChatGPT uses Browse, Perplexity answers a question, or Claude summarizes current information, it's making citation decisions: which sources to use, which to trust, which to quote.

Those decisions aren't random. LLMs weight sources by:

  • Parsability — can we extract clean, labeled facts?

  • Authority — is this a recognized publisher with consistent metadata?

  • Recency — is this content current?

  • Specificity — does this content directly answer the query?
  • Structured data improves all four. Schema makes facts parsable. publisher and author entity links build authority. datePublished and dateModified signal recency. FAQPage and HowTo schema target specific query patterns with precision.

    This guide covers every content schema type and how to implement each for maximum LLM citation frequency.

    ---

    Article Schema: The Foundation

    Every piece of content you publish should have Article schema (or a subtype). This is the foundational declaration that tells LLMs: this is authored, published content with a specific topic.

    {
    "@context": "https://schema.org",
    "@type": "Article",
    "headline": "Article Schema for LLM Citation Optimization",
    "description": "How to implement structured data for content that gets cited by ChatGPT, Perplexity, and Claude.",
    "author": {
    "@type": "Person",
    "name": "Justin Pollack",
    "url": "https://webmcpguide.com/author/justin",
    "sameAs": [
    "https://twitter.com/justinpollack",
    "https://linkedin.com/in/justinpollack"
    ]
    },
    "publisher": {
    "@type": "Organization",
    "name": "WebMCP Guide",
    "url": "https://webmcpguide.com",
    "logo": {
    "@type": "ImageObject",
    "url": "https://webmcpguide.com/logo.png",
    "width": 600,
    "height": 60
    }
    },
    "datePublished": "2026-04-21",
    "dateModified": "2026-04-21",
    "url": "https://webmcpguide.com/articles/article-schema-llm-citation-optimization",
    "image": {
    "@type": "ImageObject",
    "url": "https://webmcpguide.com/images/article-schema-guide.jpg",
    "width": 1200,
    "height": 630
    },
    "mainEntityOfPage": {
    "@type": "WebPage",
    "@id": "https://webmcpguide.com/articles/article-schema-llm-citation-optimization"
    },
    "articleSection": "Schema & Structured Data",
    "keywords": ["article schema", "LLM citation", "structured data", "JSON-LD"]
    }

    Article Subtypes — When to Use Each

    Article — General-purpose. Use for guides, tutorials, opinion pieces, and explainers.

    NewsArticle — Time-sensitive reporting. Use for news coverage, announcements, and current events. LLMs weight NewsArticle more heavily for recency-sensitive queries.

    BlogPosting — Blog content, personal essays, less formal writing. Signals lower authority weight than Article — use Article for any content you want cited in professional/technical contexts.

    TechArticle — Technical documentation, developer guides, API references. Signals high technical authority. Use this for developer-focused content.

    ScholarlyArticle — Research-grade content with citations. High authority weight but rarely appropriate for marketing/SEO content.

    For most content, use Article or TechArticle. Reserve NewsArticle for time-sensitive pieces. Avoid BlogPosting for content you want cited as authoritative.

    ---

    Author Entity Schema: Building Personal Citation Authority

    The author field in article schema is more powerful than it looks. When an LLM sees the same author entity across multiple articles, it builds a profile: this person publishes on these topics, at this level of specificity, with this track record.

    Over time, author entity recognition means LLMs cite your content specifically because of authorship — not just because the article happens to be on topic.

    Author page schema (Person type):

    {
    "@context": "https://schema.org",
    "@type": "Person",
    "name": "Justin Pollack",
    "url": "https://webmcpguide.com/author/justin",
    "description": "Founder of WebMCP Guide. Writing about AI infrastructure, structured data, and agent-to-agent commerce.",
    "image": "https://webmcpguide.com/author/justin-photo.jpg",
    "sameAs": [
    "https://twitter.com/justinpollack",
    "https://linkedin.com/in/justinpollack",
    "https://github.com/justinpollack"
    ],
    "knowsAbout": [
    "Model Context Protocol",
    "JSON-LD structured data",
    "AI agent development",
    "E-commerce SEO"
    ],
    "worksFor": {
    "@type": "Organization",
    "name": "WebMCP Guide",
    "url": "https://webmcpguide.com"
    }
    }

    Add this to your author page. Link to it from every article's author.url field. Over time, LLMs connect the author entity to the topics in knowsAbout — improving citation frequency for content in those areas.

    ---

    FAQPage Schema: The Highest-Frequency Citation Type

    No schema type drives citation more consistently than FAQPage. The reason is structural: LLMs answer questions. FAQ schema contains questions and answers. The match is exact.

    Add FAQPage as a separate, additional schema block on any article that covers specific questions:

    {
    "@context": "https://schema.org",
    "@type": "FAQPage",
    "mainEntity": [
    {
    "@type": "Question",
    "name": "What is the best schema type for getting cited by Perplexity?",
    "acceptedAnswer": {
    "@type": "Answer",
    "text": "FAQPage schema drives the highest citation frequency because questions in schema directly match the query patterns LLMs process. Article schema builds foundational authority. Together they maximize citation across both direct question queries and broader topic queries."
    }
    },
    {
    "@type": "Question",
    "name": "How long does it take for schema changes to affect LLM citations?",
    "acceptedAnswer": {
    "@type": "Answer",
    "text": "Perplexity typically reflects schema changes within 2–4 weeks. Google AI Overviews take 2–6 weeks. ChatGPT's indexed knowledge updates on a longer cycle (weeks to months), but Browse mode reflects changes faster."
    }
    }
    ]
    }

    Write your FAQ questions to match how people actually query. Not "What is article schema?" — but "Why does Perplexity cite some sites more than others?" The question phrasing in schema should match the natural language query pattern you're targeting.

    ---

    HowTo Schema: Procedural Citation

    How-to content is the second-highest citation frequency type after FAQ. Implement HowTo schema on any step-by-step content:

    {
    "@context": "https://schema.org",
    "@type": "HowTo",
    "name": "How to Add Article Schema to a Next.js Site",
    "description": "Add JSON-LD Article schema to a Next.js app using a reusable component.",
    "totalTime": "PT15M",
    "supply": [
    { "@type": "HowToSupply", "name": "Next.js project" },
    { "@type": "HowToSupply", "name": "Article metadata (title, author, date)" }
    ],
    "step": [
    {
    "@type": "HowToStep",
    "position": 1,
    "name": "Create the schema component",
    "text": "Create ArticleSchema.tsx in your components directory.",
    "image": "https://webmcpguide.com/images/step1-schema-component.jpg"
    },
    {
    "@type": "HowToStep",
    "position": 2,
    "name": "Pass article metadata as props",
    "text": "Accept headline, datePublished, author, and description as props and construct the schema object."
    },
    {
    "@type": "HowToStep",
    "position": 3,
    "name": "Inject into page head",
    "text": "Use dangerouslySetInnerHTML to render the script tag. Place it in the component on each article page."
    }
    ]
    }

    ---

    Podcast Schema: Audio Content for LLM Citation

    Podcasts are systematically under-structured. Most show notes pages have no schema, which means LLMs can't cite specific episodes in relevant answers — even when the content is highly relevant.

    {
    "@context": "https://schema.org",
    "@type": "PodcastEpisode",
    "name": "MCP and the Agent-to-Agent Web with Justin Pollack",
    "description": "Deep dive on how Model Context Protocol enables agent-to-agent commerce, how schema fits into the stack, and why most SaaS products are invisible to AI agents right now.",
    "url": "https://yourpodcast.com/episodes/mcp-agent-web",
    "associatedMedia": {
    "@type": "AudioObject",
    "contentUrl": "https://yourpodcast.com/audio/episode-42.mp3",
    "duration": "PT52M"
    },
    "partOfSeries": {
    "@type": "PodcastSeries",
    "name": "The AI Infrastructure Podcast",
    "url": "https://yourpodcast.com"
    },
    "datePublished": "2026-04-21",
    "author": {
    "@type": "Person",
    "name": "Justin Pollack"
    },
    "keywords": ["MCP", "agent-to-agent commerce", "structured data", "AI infrastructure"]
    }

    ---

    VideoObject Schema: Video Content for LLM Citation

    YouTube and video content are increasingly cited by Perplexity and ChatGPT. Structured data on your video pages — whether YouTube embeds or native video — improves citation frequency.

    {
    "@context": "https://schema.org",
    "@type": "VideoObject",
    "name": "How to Add SoftwareApplication Schema to Your SaaS Product",
    "description": "Step-by-step tutorial: adding complete SoftwareApplication JSON-LD schema to a SaaS product page for AI agent discoverability.",
    "thumbnailUrl": "https://webmcpguide.com/thumbnails/saas-schema-tutorial.jpg",
    "uploadDate": "2026-04-21",
    "duration": "PT8M30S",
    "contentUrl": "https://webmcpguide.com/videos/saas-schema-tutorial.mp4",
    "embedUrl": "https://youtube.com/embed/XXXXX",
    "publisher": {
    "@type": "Organization",
    "name": "WebMCP Guide"
    }
    }

    ---

    Citation Optimization Checklist for Content Publishers

    Per article:

  • [ ] Article (or appropriate subtype) with headline, datePublished, author, publisher

  • [ ] FAQPage block with 3–5 questions matching target query patterns

  • [ ] HowTo block for any step-by-step sections

  • [ ] dateModified updated whenever content is revised
  • Site-wide:

  • [ ] Organization schema on homepage with logo and url

  • [ ] Author pages with Person schema and sameAs links

  • [ ] Consistent publisher declaration across all articles

  • [ ] PerplexityBot, GPTBot, ClaudeBot allowed in robots.txt
  • Advanced:

  • [ ] Podcast / PodcastEpisode schema on show notes pages

  • [ ] VideoObject schema on video pages

  • [ ] breadcrumb schema on all content pages
  • → Automate all of this with schema injection →

    ---

    Related articles:

  • Why Perplexity Cites Some Sites and Ignores Others

  • Structured Data for AI Content Discovery

  • JSON-LD Product Schema: Complete Guide

  • Free AI Visibility Audit →