Not all structured data helps AI visibility equally. Here are the schema types that actually influence AI citation rates — and the ones that waste your time.
Most brands add schema markup because their SEO tool flagged it as missing. They implement whatever the plugin or checklist suggests, mark it done, and move on. FAQPage here, Article there, maybe a LocalBusiness block if someone remembered. Job done.
The problem is there's a meaningful difference between schema that helps Googlebot understand your pages and schema that actually influences whether an AI model cites you. The overlap is smaller than most people assume — and the gap is getting more expensive to ignore.
AI models aren't crawling your schema to generate rich results. They're using structured data as a signal to understand what your brand is, what you know, and whether you're a credible source on a topic. That's a different job than feeding Google's FAQ carousel. And the schema types that do it well are not always the ones that SEO tools prioritise.
Google's primary use of schema is to power rich results. It wants to know: does this page have a recipe? A product rating? An event? Match the template, get the featured snippet or carousel slot.
AI models trained on web data did something different. They learned from billions of pages — including the structured metadata — and developed an understanding of entity relationships. Organization → Person → Article. Product → Review → Author. FAQ question → direct answer. These chains matter because they help the model understand who is saying what and whether it's trustworthy.
This is why two sites can have identical content but one consistently gets cited and the other doesn't. One has clean, complete, linked entity schema. The other has generic, incomplete markup that doesn't actually describe the entities behind the content.
Schema for AI visibility is less about triggering a rich result and more about building a machine-readable identity layer for your brand.
FAQPage is probably the single highest-ROI schema type for AI citation. The reason is obvious once you think about it: AI models answer questions. FAQPage schema is literally a list of questions with direct answers. It's already in the format the model wants to output. When your content has clean FAQPage schema, you're essentially handing the AI a pre-packaged answer it can quote or paraphrase directly.
The key is specificity. "What is [product]?" with a one-line answer is borderline useless. "What's the difference between [product A] and [product B]?" with a 3–4 sentence answer that includes specific details, numbers, or comparisons — that's what gets lifted.
HowTo schema works for similar reasons. Step-by-step structured content is extremely easy for AI to extract and present. If your content walks through a process and you've marked it up with HowTo schema — with each step as a discrete named item — you're giving the model a citation-ready format. The pages on this site that rank in Perplexity for procedural queries almost always have either FAQPage or HowTo markup.
Article with linked Person schema is where E-E-A-T meets structured data. An Article that has an author field pointing to a Person with a defined @id, jobTitle, knowsAbout list, and sameAs links to verified profiles (LinkedIn, author pages) — that's a real signal. It connects your content to a specific human with specific expertise. An Article with author: "Admin" or no author at all tells the model nothing it can verify.
This is one area where AI models genuinely do evaluate expertise differently from Google. Google gave you authority credit through PageRank and backlinks. AI models look for entity coherence: does this author appear consistently across multiple credible sources? Does their knowledge graph entry connect to legitimate affiliations? Proper Person schema is the technical layer that makes that legible.
Organization schema belongs on every site and most sites have it — but most implementations are incomplete. The description field matters more than people realise. It's what the model reads to understand what your company does and in which category. A vague description like "We help businesses grow online" tells an AI model almost nothing it can use when someone asks for a recommendation in your space. A specific, structured description that names your category, your primary capability, and who you serve — that's what feeds the model's understanding of where you fit.
BreadcrumbList is underrated for AI indexing specifically. It tells models how your content is organised — which topics you own, how they cluster, what sits under what. This contributes to topical authority signals that influence citation frequency on topic clusters. If the model sees 30 articles on technical SEO, all properly breadcrumbed under a consistent category, it reads that as a signal that this site has sustained depth on the topic.
Generic WebPage markup — basically worthless from an AI citation perspective. It says "this is a web page." The model already knows that. Unless you're extending it with meaningful about, mentions, or mainEntity properties, it adds no useful information to the entity graph.
AggregateRating without substantive review schema — adding a 4.8-star rating without backing it with actual Review markup that includes reviewer names, dates, and review bodies is hollow. AI models have absorbed enough web data to recognise the pattern of fake or inflated ratings. Schema that looks complete on the surface but lacks substance doesn't earn citation trust.
Event schema is genuinely useful — if your content is about events. If you're adding it just to have another schema type in the source, it contributes nothing. Schema bloat is a real thing. Having 15 different schema types on a page doesn't help if most of them are vaguely implemented. A page with three complete, properly linked schema blocks will outperform one with eight incomplete ones every time.
The most common schema mistake I see isn't the wrong type — it's partial implementation. You have the FAQPage block, but the answers are one sentence. You have the Article schema, but the author is a string instead of a linked Person entity. You have the Organization schema, but the description is empty or generic.
Completeness matters because AI models use schema to fill in knowledge gaps. An incomplete schema block says "we started describing this entity but ran out of steam." It creates an incomplete picture that the model either ignores or treats as lower-confidence.
The two properties most often left blank are description and sameAs. Both are critical for AI. description tells the model what the entity is. sameAs links it to other web representations — Wikipedia, LinkedIn, Crunchbase, Wikidata — that the model can cross-reference. If your Organization schema has sameAs links to three verified external sources, the model has multiple confirmation points for the entity. If it has none, you're asking the model to take your word for it.
The @id property is the other underused one. Using consistent @id URIs across your schema blocks — the same @id for your Organization referenced from your Articles, your Person schema, your SiteNavigationElement — lets you build a proper entity graph that the model can traverse. Without @id linking, each schema block is an island. With it, you're building a connected knowledge structure.
Run your site through Google's Rich Results Test for basic validation, but don't stop there — it only checks for rich result eligibility, not AI completeness. For AI-specific schema audit, you need to look at:
@id?Surfaceable's SEO audit checks schema completeness as part of its 50+ signal scan — including whether your Article schema has linked author entities and whether your Organization block has a usable description. If you want a baseline on where you actually stand, that's the quickest way to find out.
The practical takeaway: stop thinking of schema as a box to tick. The right structured data, fully implemented and properly linked, is one of the more direct levers you have for influencing how AI models understand and cite your brand. It's not magic — it won't override thin content or a lack of topical authority — but on a site that already has good content, it's often the difference between being cited and being invisible.
Surfaceable is built for
Try Surfaceable
See how often ChatGPT, Claude, Gemini, and Perplexity mention your brand — and get a full technical SEO audit. Free to start.
Get started free →Schema Markup for AI: JSON-LD Strategies That Get You Cited by LLMs
Learn how to use schema markup and JSON-LD to improve your chances of being cited by ChatGPT, Perplexity, Claude, and Google AI Overviews.
Agentic SEO: How to Make Your Brand Discoverable by AI Agents
AI agents are browsing the web and making decisions on behalf of users. Here's what agentic SEO is, why it matters now, and 5 concrete steps to make your brand agent-discoverable.
We Just Built a Full SEO Platform — and It Starts at £29
Surfaceable now covers technical SEO, rank tracking, backlinks, keyword research, competitor intelligence, traffic analytics, brand mentions, and AI visibility across five platforms. Here's what that means for your stack — and your budget.