Schema Markup for GEO: The 4 Schema Types That Move AI Citation Rates

Alejandro Rioja

June 28, 2026 8 min read

TL;DR

Schema markup was a Google rich-result tiebreaker in 2024. In 2026 it’s a primary signal for AI engines (ChatGPT, Perplexity, Google AI Overviews, Claude) — they parse JSON-LD to extract clean, citable answers from pages. Four schema types do 90% of the work for GEO: Article, FAQPage, HowTo, and Speakable.

Free newsletter

Every Wednesday. 28,400+ operators. Zero fluff.

Open Table of contents

Why schema matters more for AI engines than it ever did for Google
Schema #1: Article
Schema #2: FAQPage
Schema #3: HowTo
Schema #4: Speakable
Step-by-step: adding schema to a pillar post in 20 minutes
Schema mistakes that hurt AI citation rates
The schema priority order if you’re starting from scratch
Schema Markup for GEO — 2026 FAQ
Updated for May 2026

Why schema matters more for AI engines than it ever did for Google

Classic Google has always been able to read your page without schema. Schema was a tiebreaker — it earned you rich results (review stars, FAQ accordions, HowTo cards), but the underlying ranking didn’t depend on it.

Generative engines have a different problem. They’re not ranking your page; they’re extracting an answer from it. Without structured data, they have to infer the structure from the raw HTML — possible but lossy and error-prone. With JSON-LD, the extraction is clean. The page becomes machine-readable in a way that raw HTML isn’t.

In my testing, adding the four schema types in this post to a previously schema-free pillar post moved AI-engine citation rates by single-digit-to-mid-teens percentage points. That’s real money for a piece of work that takes 20 minutes per post.

Schema #1: Article

Article schema is the foundation. Every blog post on your site should have it. It tells engines who wrote the post, when it was published, when it was last updated, what category it belongs to, and what the headline is.

What to include: headline, datePublished, dateModified, author (with name and URL), publisher (Organization), image, mainEntityOfPage, description.

The fields that AI engines actually use: author + dateModified are the high-signal ones. AI engines lean on author for trust attribution; dateModified for freshness. Get both right.

Mistake to avoid: Setting dateModified to “now” on every page load (some plugins do this) — engines see through it. Only update dateModified when you actually change the content.

Schema #2: FAQPage

FAQPage schema marks up question-and-answer pairs so engines can extract them cleanly. Despite Google’s 2023 removal of FAQ rich results from classic SERPs, generative engines still parse FAQPage and use it heavily.

What to include: mainEntity array of Question objects, each with name (the question) and acceptedAnswer (the answer text).

When to use it: any page with an actual FAQ section. Don’t add FAQPage schema to a page without an FAQ — that’s a quality signal worth preserving.

The leverage point: match the literal phrasing of the questions to how people search. “How long does it take to rank in Google?” gets cited; “Time-to-rank considerations” doesn’t, even if the underlying answer is the same.

Mistake to avoid: stuffing the FAQ with marketing-style answers. Engines want short, direct, factual answers. If your answer reads like a sales pitch, it gets cited less.

Schema #3: HowTo

HowTo schema marks up step-by-step procedures. It’s the most underused of the four because it requires a real procedure to mark up — but on pages that have one, it’s high-leverage.

What to include: name (the procedure name), step array (each with name, text, optionally image and url to a section anchor), totalTime, supply, tool.

When to use it: playbook posts, tutorials, anything with numbered steps. Generative engines pull HowTo schema directly into their answers when a user asks a “how to” query.

The leverage point: use clear, action-verb step names. “Install the plugin” not “Step 1.” Engines lift the step.name into their summaries.

Mistake to avoid: marking up a numbered list as HowTo when it’s not actually a procedure. A “10 reasons SEO matters” listicle isn’t a HowTo — it’s an Article with an itemList. Mismatched schema is worse than no schema.

Schema #4: Speakable

Speakable schema is the one most teams skip. It tells voice/audio AI assistants which specific sentence on your page to read aloud when answering a voice query. As voice and AI-assistant queries converge in 2026, Speakable is becoming a real GEO move.

What to include: a SpeakableSpecification object inside your Article schema, with cssSelector or xpath pointing to the sentence(s) you want read aloud. Usually that’s your TL;DR block.

When to use it: any pillar post with a clean TL;DR-style answer at the top. The selector points to that block.

The leverage point: Speakable + a clean TL;DR is what wins voice-assistant queries. Without it, the assistant has to guess which sentence to read.

Mistake to avoid: pointing Speakable at content that’s not actually a clean answer. The assistant will read whatever you point it at; if it’s awkward, the answer is awkward.

Step-by-step: adding schema to a pillar post in 20 minutes

Add Article schema with author + datePublished + dateModified + headline + image + mainEntityOfPage. Most SEO plugins (RankMath, Yoast) do this automatically; verify in Google’s Rich Results Test that it’s present and correct.
Identify the FAQ section. If there isn’t one, add one — 3–7 questions with literal user-phrasing.
Add FAQPage schema wrapping the FAQ. RankMath and Yoast can generate this from existing FAQ blocks; verify the output.
Identify the step-by-step section. If the post has numbered steps that describe a procedure, mark them up as HowTo. Skip if the post doesn’t have one.
Identify the TL;DR block at the top of the post. Add a Speakable specification pointing to its CSS selector.
Validate with Google’s Rich Results Test (search.google.com/test/rich-results) and Schema.org’s validator. Both should report no errors.
Re-fetch the page in your AI engine of choice 2–4 weeks later and check whether citation rate moved. Manual sampling is still the most reliable measurement.

Schema mistakes that hurt AI citation rates

Schema that doesn’t match the content. FAQPage on a page without an FAQ. HowTo on a listicle. Article author set to a name that doesn’t appear in the byline. All of these are quality signals engines flag.
Multiple competing Article objects on the same page. Some plugins double-emit — verify in your page source that there’s exactly one Article block per post.
Stale dateModified. Either too old (signals the page is unmaintained) or set to “now” on every load (signals manipulation). Update only when you actually change content.
Empty or placeholder fields. “John Doe” as author, generic Organization with no logo, headline that doesn’t match the actual H1. Engines validate these against page content.
Schema buried in JavaScript. Some AI crawlers don’t render JS reliably. Server-side render JSON-LD into the page head or right before </body>.

The schema priority order if you’re starting from scratch

Article on every post. Non-negotiable. Most plugins do this; verify it’s correct.
FAQPage on every post with an FAQ. Add an FAQ section to your top 20 pillar posts; mark them up.
HowTo on every playbook post with numbered steps. Don’t force it where it doesn’t belong.
Speakable on every pillar post with a clean TL;DR. Adds 30 seconds of work per post.

That sequence covers ~90% of the schema-driven GEO lift available in 2026. The remaining 10% (Person, Organization, Course, SoftwareApplication, Product) is niche-specific and worth adding case-by-case.

Schema Markup for GEO — 2026 FAQ

Do all AI engines use schema markup in 2026?

The major ones — ChatGPT, Perplexity, Google AI Overviews, Claude — all parse JSON-LD when present. Smaller engines vary. The cost of adding schema is low enough that doing it once covers all engines.

Which SEO plugin handles schema best for GEO in 2026?

RankMath and Yoast both cover the basics well — Article, FAQPage, HowTo. RankMath’s HowTo block is slightly easier to use; Yoast’s FAQ block is tighter. Either is fine. For Speakable, you’ll likely need a custom snippet — neither plugin handles it natively as of 2026.

Does schema help with classic Google rankings?

Indirectly. Google has stated schema isn’t a direct ranking signal, but rich results that schema enables improve CTR, and AI Overview citations from schema’d pages are now a measurable signal that feeds back into traffic.

Should I add schema to category pages and tag archives?

Article schema, no — those aren’t articles. CollectionPage or BreadcrumbList schema, yes — they help engines understand site structure.

How do I check if my schema is actually working?

Google’s Rich Results Test for syntax validation. Manual AI-engine sampling for citation impact. Specialized tools (Schema App, Schema Pro) for ongoing monitoring at scale. The combination is more reliable than any single tool.

Want help building this on your own site? Read the full SEO + GEO playbook or get in touch — I run AI SEO + GEO consulting projects for operator teams that want to compound visibility across both classic Google and AI engines.

Updated for May 2026

SEO in 2026 is unrecognizable from the 2020-era playbook. Three shifts that matter for anything written before mid-2024:

AI Overviews are the new SERP zero position. Google’s AI Overviews default to roughly 60% of US informational queries, eating most “what is” / “how to” CTR. Optimizing for citation inside the AI Overview is now as important as ranking #1.
GEO (Generative Engine Optimization) is the working term for cross-engine optimization — getting cited inside ChatGPT, Perplexity, Claude, and Gemini answers. ~12% of high-intent commercial queries in late 2025 sample studies showed a direct-citation flow from these engines (vs. zero pre-2023).
E-E-A-T (now E^3-A-T, Experience + Expertise + Establishment + Authoritativeness + Trustworthiness) continues to be the framing Google uses internally — “Establishment” was the 2024 addition emphasizing brand-level signals.

Tool landscape (May 2026): Ahrefs and Semrush both shipped Generative Engine tracking. Surfer SEO + the Topical Authority crowd added GEO scoring. Screaming Frog still the standard crawler. AlsoAsked, Keyword Insights, and Frase shifted heavily into AI-Overview snippet engineering.

If this post predates May 2024, treat its core advice as the Google-search baseline and layer the GEO playbook on top.

Free tool: paste a URL into the schema generator to get valid Article, FAQPage, HowTo, and BreadcrumbList JSON-LD without hand-writing it.

Keep reading

GEO

How to Get Your Brand Cited Inside ChatGPT Answers (2026)

Updated for 2026. Getting cited by ChatGPT isn't about ranking — it's about being the cleanest source to lift. Write a direct-answer TL;DR, mark up entities, earn third-party mentions, and keep your facts machine-checkable.

SEO

ChatGPT Search vs Google: A Side-by-Side Test on 50 Head Terms

Updated for 2026. I ran the same 50 head terms in ChatGPT search and Google (with AI Overviews) and tracked which sources each engine cited. Source overlap was about 40% — the rest of the time the two engines surfaced completely different sources.

SEO

How to Write a TL;DR That Gets Cited by AI Engines (Step-by-Step)

Updated for 2026. A TL;DR isn’t a summary of your post — it’s a direct answer to the head query, written in 2–4 sentences, structured so an AI engine can lift it verbatim into its answer. Open with the takeaway, follow with the why, end with the constraint or caveat.