Skillquality 0.45

Turn messy document collections into structured rows with DocETL

Define repeatable extraction pipelines that pull fields from large document collections, normalize outputs, and audit failures across the corpus.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/turn-messy-document-collections-into-structured-rows-with-docetl

What it does

Turn messy document collections into structured rows with DocETL

Define repeatable extraction pipelines that pull fields from large document collections, normalize outputs, and audit failures across the corpus.

Prerequisites

Python 3.10+, DocETL, document corpus, extraction configuration

Installation

Use the upstream install or setup path that matches your environment:

Use Docker (recommended for quick start): make docker
pip install docetl
Run Docker:
make docker

Requirements and caveats from upstream:

A Python package for running production pipelines from the command line or Python code
2. 📦 Python Package (For Production Use)
If you want to use DocETL as a Python package:

Basic usage or getting-started notes:

🚀 Getting Started
DocWrangler is hosted at docetl.org/playground. But to run the playground locally, you can either:
OpenAI API key
Source: https://github.com/ucbepic/docetl
Extracted from upstream docs: https://raw.githubusercontent.com/ucbepic/docetl/HEAD/README.md

Documentation

https://docetl.org/

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-turn-messy-document-collections-into-structured-rows-with-docetltopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/turn-messy-document-collections-into-structured-rows-with-docetl

skills.shhttps://skills.sh/agentskillexchange/skills/turn-messy-document-collections-into-structured-rows-with-docetl

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,254 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:12:56Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/jvZfDW

What it does

Turn messy document collections into structured rows with DocETL

Prerequisites

Installation

2. 📦 Python Package (For Production Use)

🚀 Getting Started

Documentation

Source

Capabilities

Install

Quality

Provenance

Agent access