{"id":"cdcd44f8-e4a7-4788-a27b-48e612f867a7","shortId":"BA5VfU","kind":"skill","title":"seo-schema","tagline":"Detect existing JSON-LD structured data on a page, validate against Google's rich-result requirements, and generate missing schema markup (Article, Product, LocalBusiness, FAQPage, BreadcrumbList). Produces paste-ready `<script type=\"application/ld+json\">` blocks. Use when the us","description":"# Schema Markup\n\nDetect, validate, and generate Schema.org JSON-LD for a page. Output is paste-ready `<script>` blocks the user can drop into their CMS or page template, plus a validation report on what's currently present and what's broken.\n\n## Prerequisites\n\n- **Required for detect/validate paths:** `mcp__firecrawl-mcp__firecrawl_scrape` (raw HTML access). WebFetch returns markdown only — every `<script type=\"application/ld+json\">` block is stripped before the skill ever sees it. Without Firecrawl, the skill can still generate new schema from intent detection (steps 4–6) but cannot detect or validate what's already on the page (steps 2–3, 7).\n- Optional: SE Ranking MCP server (used in step 7 for benchmarking competitor schema).\n- User provides: a target URL. Optionally a hint about page intent (\"this is a product page\", \"this is a how-to\") if the URL pattern doesn't make it obvious.\n\n## Process\n\n1. **Fetch HTML** `mcp__firecrawl-mcp__firecrawl_scrape` (preferred) or degrade\n   - **Cost note.** Firecrawl: 1 credit for the target URL, +10 credits if step 7 (competitor benchmark) runs (1 per top-10 SERP result). User may pass `--no-firecrawl` to force the degraded path (generate-only mode) for credit conservation.\n   - **If Firecrawl available:** scrape the target URL. For SPAs, pass `waitFor: 2000` (or a CSS selector for the main content) so the JS-rendered DOM is captured. Use the response's `html` for JSON-LD parsing in step 2 and `metadata` for canonical/robots cross-reference.\n   - **If Firecrawl unavailable:** skip steps 2, 3, and 7 entirely (they all need raw HTML). Steps 4–6 still run — the skill becomes \"generate-only\", producing recommended JSON-LD blocks from intent detection without comparing to what's on the page. Surface clearly in `SCHEMA.md`: `Existing-schema detection: skipped — Firecrawl required (WebFetch returns markdown only). Install via extensions/firecrawl/install.sh.`\n   - Even with Firecrawl: if JSON-LD blocks appear only after JS render, flag in the output: \"JS-rendered schema may not be detected by all crawlers — server-side render JSON-LD where possible.\"\n\n2. **Detect existing schema** (requires Firecrawl HTML from step 1)\n   - From the returned `html`: extract every `<script type=\"application/ld+json\">` block.\n   - Parse each as JSON. Report syntax errors.\n   - List each detected `@type`.\n   - Also detect Microdata (`itemscope`/`itemprop`) and RDFa (`typeof`/`property`) — flag as legacy and recommend migration to JSON-LD (Google's stated preference).\n   - **If step 1 degraded:** skip this step. Record `Existing-schema detection skipped` in `01-detected.md`.\n\n3. **Validate against Google's spec**\n   - Load `references/google-rich-results.md`.\n   - For each detected `@type`, check required and recommended properties.\n   - Surface common errors: missing `@context`, dates not in ISO 8601, prices as numbers instead of strings, `availability` as plain text instead of schema.org URL, telephone not in international format.\n\n4. **Detect page intent**\n   - From URL pattern (`/blog/`, `/products/`, `/contact/`, `/how-to/`, `/faq/`).\n   - From `<title>` and `<h1>` tone.\n   - From content signals (numbered list of steps → HowTo; visible Q&A blocks → FAQPage; price + buy button → Product; address + hours → LocalBusiness).\n   - If multiple intents detected, generate schema for each.\n\n5. **Generate missing JSON-LD**\n   - For each detected intent without matching valid schema, load the relevant template from `templates/`:\n     - `article.json` — for editorial/blog content\n     - `product.json` — for product/SKU pages\n     - `local-business.json` — for brick-and-mortar landing pages\n     - `faq-page.json` — for explicit Q&A blocks (gov/health allowlist only — see references/google-rich-results.md)\n     - `breadcrumb-list.json` — for any page with breadcrumb navigation\n   - Fill template fields from the live HTML (title → headline, h2s → mainEntity questions, etc.).\n   - Mark any field that couldn't be auto-filled as `{REPLACE: ...}` so the user knows to complete it.\n   - **Don't generate `HowTo`** — Google retired HowTo rich results in September 2023 (mobile + desktop). The schema can still ship for semantic clarity, but expect zero rich-result uplift; flag this in the recommendation rationale rather than treating HowTo as a live option.\n\n6. **Validate generated JSON-LD**\n   - Re-run the same validation rubric from step 3 on the generated blocks.\n   - Surface any required fields still marked `{REPLACE: ...}`.\n\n7. **Optional: benchmark against top SERP results** `DATA_getSerpResults` + `mcp__firecrawl-mcp__firecrawl_scrape`\n   - Identify the page's primary keyword (from `<title>` or user input).\n   - Pull top 10 organic results.\n   - **If Firecrawl available:** scrape each of the top 10 (10 Firecrawl credits). For each, parse JSON-LD blocks from the returned `html` and list detected `@type`s. This produces real schema data, not inferences from markdown.\n   - **If Firecrawl unavailable:** skip the benchmark — WebFetch's markdown strips all schema blocks, so any \"detection\" from it would be guesswork. Write `Competitor benchmark skipped — Firecrawl required to read JSON-LD from competitor pages.` into `04-competitor-benchmark.md`.\n   - Surface \"schema types used by 6+ of the top 10 that this page is missing.\" High-signal addition list. (Only emitted when benchmark ran.)\n\n8. **Synthesise** `SCHEMA.md`\n   - Validation report (existing schema, pass/fail per block).\n   - Recommended additions (with rationale linking back to step 4 detection or step 7 benchmark).\n   - Generated `<script>` blocks ready to paste.\n\n## Output format\n\nCreate a folder `seo-schema-{target-slug}-{YYYYMMDD}/` with:\n\n```\nseo-schema-{target-slug}-{YYYYMMDD}/\n├── 01-detected.md           (existing schema, validation results)\n├── 02-recommended.md        (which types this page should add and why)\n├── 03-generated/\n│   ├── article.jsonld\n│   ├── faq-page.jsonld\n│   └── ... (per generated type)\n├── 04-competitor-benchmark.md  (only if step 7 ran)\n└── SCHEMA.md                (deliverable: paste-ready blocks + install instructions)\n```\n\n`SCHEMA.md` follows this shape:\n\n```markdown\n# Schema Markup: {URL}\n\n> Snapshot dated {YYYY-MM-DD}.\n\n## Currently present\n- `Article` — valid ✓\n- `BreadcrumbList` — invalid ✗ (missing `position` on item 2)\n- ...\n\n## Recommended additions\n- `FAQPage` — page has 6 visible Q&A blocks but no FAQ schema. Adding this is eligible for FAQ rich results (subject to Google's 2024+ tightening — see references/google-rich-results.md).\n- `HowTo` — ...\n\n## Paste these into the `<head>` of the page\n\n### FAQPage\n\\`\\`\\`html\n<script type=\"application/ld+json\">\n{\n  \"@context\": \"https://schema.org\",\n  \"@type\": \"FAQPage\",\n  \"mainEntity\": [...]\n}\n</script>\n\\`\\`\\`\n\n### ... (per generated block)\n\n## Validation pass\n- All generated blocks parse cleanly ✓\n- All required fields filled ({n} {REPLACE: ...} placeholders remain — see below)\n- {REPLACE: ...} placeholders to fill manually:\n  - article.jsonld → `image` (need a hero image URL ≥ 1200×800)\n  - ...\n\n## Install\n1. Copy each `<script>` block above.\n2. Paste into the `<head>` of the relevant page (or the global `<head>` template, gated by page type).\n3. Test with [Google's Rich Results Test](https://search.google.com/test/rich-results).\n4. Submit the URL to GSC for re-indexing if changes are critical.\n```\n\n## Tips\n\n- JSON-LD goes in `<head>` or top of `<body>`. Don't bury it.\n- Test every generated block in [Google's Rich Results Test](https://search.google.com/test/rich-results) before shipping. The validation in step 3/6 follows the published spec but Google's actual test is authoritative.\n- `references/google-rich-results.md` is dated. If it's >6 months old when you run this skill, flag staleness in the output and recommend the user verify against current docs.\n- **Don't mark up content that isn't visibly on the page.** Google penalises hidden-content schema. If a page doesn't actually have FAQs visible, don't generate FAQPage schema.\n- For Article schema, `image` is required. If the page has no obvious hero image, leave the `{REPLACE: hero image URL}` placeholder rather than guessing.\n- The skill is read-mostly on the SE Ranking side: zero SE Ranking credits unless step 7 (competitor benchmark) is requested — that adds ~5–10 SE Ranking credits for `DATA_getSerpResults`. Firecrawl costs are separate: 1 credit for the target URL, +10 credits when step 7 runs.\n- **Verify after deploy:** once the generated schema is pasted into your CMS and re-deployed, re-run this skill on the same URL — the new run's \"Currently present\" section reflects the live state and confirms the schema actually rendered (vs sitting in the CMS but not yet pushed). Ad-hoc alternative: invoke `seo-firecrawl` on the URL and grep `META.md` for the expected `@type`s.","tags":["seo","schema","skills","seranking","agent-skills","ai-search","anthropic","backlinks","claude","claude-code","claude-plugin","claude-skills"],"capabilities":["skill","source-seranking","skill-seo-schema","topic-agent-skills","topic-ai-search","topic-anthropic","topic-backlinks","topic-claude","topic-claude-code","topic-claude-plugin","topic-claude-skills","topic-content-brief","topic-ga4","topic-keyword-research","topic-mcp"],"categories":["seo-skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/seranking/seo-skills/seo-schema","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add seranking/seo-skills","source_repo":"https://github.com/seranking/seo-skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 9 github stars · SKILL.md body (8,832 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:08:37.072Z","embedding":null,"createdAt":"2026-05-18T13:14:11.080Z","updatedAt":"2026-05-18T19:08:37.072Z","lastSeenAt":"2026-05-18T19:08:37.072Z","tsv":"'1':71 '1200':68 '800':69 'articl':27 'article.jsonld':61 'block':38,43 'breadcrumblist':31 'clean':45 'copi':72 'data':10 'detect':4 'exist':5 'faqpag':30 'field':48 'fill':49,59 'generat':23,37,42 'googl':16 'hero':65 'imag':62,66 'instal':70 'json':7 'json-ld':6 'ld':8 'localbusi':29 'manual':60 'markup':26 'miss':24 'n':50 'need':63 'page':13 'pars':44 'pass':40 'past':34 'paste-readi':33 'per':36 'placehold':52,57 'produc':32 'product':28 'readi':35 'remain':53 'replac':51,56 'requir':21,47 'result':20 'rich':19 'rich-result':18 'schema':3,25 'see':54 'seo':2 'seo-schema':1 'skill' 'skill-seo-schema' 'source-seranking' 'structur':9 'topic-agent-skills' 'topic-ai-search' 'topic-anthropic' 'topic-backlinks' 'topic-claude' 'topic-claude-code' 'topic-claude-plugin' 'topic-claude-skills' 'topic-content-brief' 'topic-ga4' 'topic-keyword-research' 'topic-mcp' 'url':67 'valid':14,39","prices":[{"id":"7182bd4e-7cec-408c-a510-a832f0ad8d41","listingId":"cdcd44f8-e4a7-4788-a27b-48e612f867a7","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"seranking","category":"seo-skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:14:11.080Z"}],"sources":[{"listingId":"cdcd44f8-e4a7-4788-a27b-48e612f867a7","source":"github","sourceId":"seranking/seo-skills/seo-schema","sourceUrl":"https://github.com/seranking/seo-skills/tree/main/skills/seo-schema","isPrimary":false,"firstSeenAt":"2026-05-18T13:14:11.080Z","lastSeenAt":"2026-05-18T19:08:37.072Z"}],"details":{"listingId":"cdcd44f8-e4a7-4788-a27b-48e612f867a7","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"seranking","slug":"seo-schema","github":{"repo":"seranking/seo-skills","stars":9,"topics":["agent-skills","ai-search","answer-engine-optimization","anthropic","backlinks","claude","claude-code","claude-plugin","claude-skills","content-brief","ga4","generative-engine-optimization","keyword-research","mcp","mcp-server","search-console","seo","seo-tools","seranking","site-audit"],"license":"mit","html_url":"https://github.com/seranking/seo-skills","pushed_at":"2026-05-11T20:07:40Z","description":"Claude SEO Skills — production Claude Agent Skills for the SE Ranking MCP server. Content briefs, AI Search share of voice, audits, backlink gaps, keyword clusters, schema, sitemap, GEO, and more.","skill_md_sha":"9b3d607bef105a76eba2546c0075195a88704d9f","skill_md_path":"skills/seo-schema/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/seranking/seo-skills/tree/main/skills/seo-schema"},"layout":"multi","source":"github","category":"seo-skills","frontmatter":{"name":"seo-schema","description":"Detect existing JSON-LD structured data on a page, validate against Google's rich-result requirements, and generate missing schema markup (Article, Product, LocalBusiness, FAQPage, BreadcrumbList). Produces paste-ready `<script type=\"application/ld+json\">` blocks. Use when the user asks for \"schema markup\", \"structured data\", \"JSON-LD\", \"rich results\", \"schema validation\", or \"fix the schema on this page\"."},"skills_sh_url":"https://skills.sh/seranking/seo-skills/seo-schema"},"updatedAt":"2026-05-18T19:08:37.072Z"}}