Skillquality 0.45

Turn captured WARC pages into clean text and language-tagged records with warc2text

Use warc2text when an agent already has WARC captures and needs readable text, language identification, and exportable records for review, search, or corpus building instead of re-crawling pages.

Price
free
Protocol
skill
Verified
no

What it does

Turn captured WARC pages into clean text and language-tagged records with warc2text

Use warc2text when an agent already has WARC captures and needs readable text, language identification, and exportable records for review, search, or corpus building instead of re-crawling pages.

Prerequisites

warc2text build or binary, WARC input files, local output storage

Installation

Use the upstream install or setup path that matches your environment:

Requirements and caveats from upstream:

  • On a node with EasyBuild installed you can install warc2text as a module:
  • --skip-text-extraction Skip text extraction and output only html. This option is not compatible with "text" value in -f option and also requires to skip language identification.

Basic usage or getting-started notes:

Documentation

Source

Capabilities

skillsource-agentskillexchangeskill-turn-captured-warc-pages-into-clean-text-and-language-tagged-records-with-warc2texttopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,626 chars)

Provenance

Indexed fromgithub
Enriched2026-05-18 19:12:55Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18

Agent access