Skillquality 0.45

Common Crawl Index Query Agent

Queries the Common Crawl Index API for large-scale web archive research and data extraction. Uses the CDX Server API, WARC record parsing with warcio, and the Common Crawl S3 bucket for bulk data access.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/common-crawl-index-query-agent

What it does

Common Crawl Index Query Agent

Installation

Basic usage or getting-started notes:

Common Crawl data is stored on Amazon Web Services' Public Data Sets . All data and index files are free to download. Feel free to run your own index server, or analyze the index offline.
More about the URL index in the original announcement . For help, visit the Common Crawl user forum or Discord server . See also Getting Started .
Source: https://index.commoncrawl.org/

Documentation

https://index.commoncrawl.org/

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-common-crawl-index-query-agenttopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/common-crawl-index-query-agent

skills.shhttps://skills.sh/agentskillexchange/skills/common-crawl-index-query-agent

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (834 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:09:53Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/jtKtda