Skillquality 0.45

Extract schema.org, Open Graph, and JSON-LD metadata from web pages for indexing

Uses extruct to pull machine-readable metadata from raw HTML so an agent can classify, deduplicate, or enrich pages without brittle full-page parsing. It is best for metadata harvesting workflows, not for crawling an entire site or rendering JavaScript-heavy pages.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/extract-schema-org-open-graph-and-json-ld-metadata-from-web-pages-for-indexing

What it does

Extract schema.org, Open Graph, and JSON-LD metadata from web pages for indexing

Prerequisites

Python 3 environment

Installation

Use the upstream install or setup path that matches your environment:

pip install extruct
pip install 'extruct[cli]'
pip install -r requirements-dev.txt

Requirements and caveats from upstream:

:target: https://pypi.python.org/pypi/extruct
.. _rdflib: https://pypi.python.org/pypi/rdflib/
First fetch the HTML using python-requests and then feed the response body to extruct::

Basic usage or getting-started notes:

Source: https://github.com/scrapinghub/extruct
Extracted from upstream docs: https://raw.githubusercontent.com/scrapinghub/extruct/HEAD/README.rst

Documentation

https://github.com/scrapinghub/extruct#readme

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-extract-schema-org-open-graph-and-json-ld-metadata-from-web-pages-for-indexingtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/extract-schema-org-open-graph-and-json-ld-metadata-from-web-pages-for-indexing

skills.shhttps://skills.sh/agentskillexchange/skills/extract-schema-org-open-graph-and-json-ld-metadata-from-web-pages-for-indexing

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,238 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:10:24Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/KTPHCF