Skillquality 0.45

Unstructured Document Partitioning and ETL Library for LLM Pipelines

Unstructured is an open-source library for ingesting and partitioning PDFs, HTML, Office documents, emails, and other unstructured inputs into structured elements and metadata. It is commonly used as a preprocessing layer for RAG, search, extraction, and downstream AI pipelines.

Price
free
Protocol
skill
Verified
no

What it does

Unstructured Document Partitioning and ETL Library for LLM Pipelines

Unstructured is an open-source library for ingesting and partitioning PDFs, HTML, Office documents, emails, and other unstructured inputs into structured elements and metadata. It is commonly used as a preprocessing layer for RAG, search, extraction, and downstream AI pipelines.

Prerequisites

Python 3.11+

Installation

Use the upstream install or setup path that matches your environment:

  • docker pull downloads.unstructured.io/unstructured-io/unstructured:latest
  • docker run -dt --name unstructured downloads.unstructured.io/unstructured-io/unstructured:latest
  • docker exec -it unstructured bash
  • make docker-build

Requirements and caveats from upstream:

  • <a href="https://github.com/Unstructured-IO/unstructured/blob/main/LICENSE.md">https://pypi.python.org/pypi/unstructured/</a>
  • <a href="https://pypi.python.org/pypi/unstructured/">https://pypi.python.org/pypi/unstructured/</a>
  • <a href="https://pypi.python.org/pypi/unstructured/">https://github.com/Naereen/badges/</a>

Basic usage or getting-started notes:

Documentation

Source

Capabilities

skillsource-agentskillexchangeskill-unstructured-document-partitioning-etl-library-llm-pipelinestopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,847 chars)

Provenance

Indexed fromgithub
Enriched2026-05-18 19:12:59Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18

Agent access