Skillquality 0.45

Extract structured text, metadata, tables, and images from mixed documents through an MCP server with Kreuzberg

Expose one document-extraction surface to MCP-compatible agents so they can normalize PDFs, Office files, images, HTML, and other mixed inputs before downstream review or indexing.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/extract-structured-text-metadata-tables-and-images-from-mixed-documents-through-an-mcp-server-with-kreuzberg

What it does

Extract structured text, metadata, tables, and images from mixed documents through an MCP server with Kreuzberg

Expose one document-extraction surface to MCP-compatible agents so they can normalize PDFs, Office files, images, HTML, and other mixed inputs before downstream review or indexing.

Prerequisites

Kreuzberg install or container image, document files to process, MCP-compatible client

Installation

Use the upstream install or setup path that matches your environment:

npx skills add kreuzberg-dev/kreuzberg

Requirements and caveats from upstream:

<img src="https://img.shields.io/pypi/v/kreuzberg?label=Python&color=007ec6" alt="Python">
<a href="https://www.npmjs.com/package/@kreuzberg/node">
<img src="https://img.shields.io/npm/v/@kreuzberg/node?label=Node.js&color=007ec6" alt="Node.js">

Basic usage or getting-started notes:

Each language binding provides comprehensive documentation with examples and best practices. Choose your platform to get started:
Scripting Languages:
Ruby – RubyGems package, idiomatic Ruby API, native bindings
Source: https://github.com/kreuzberg-dev/kreuzberg
Extracted from upstream docs: https://raw.githubusercontent.com/kreuzberg-dev/kreuzberg/HEAD/README.md

Documentation

https://github.com/kreuzberg-dev/kreuzberg#readme

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-extract-structured-text-metadata-tables-and-images-from-mixed-documents-through-an-mcp-server-with-kreuzbergtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/extract-structured-text-metadata-tables-and-images-from-mixed-documents-through-an-mcp-server-with-kreuzberg

skills.shhttps://skills.sh/agentskillexchange/skills/extract-structured-text-metadata-tables-and-images-from-mixed-documents-through-an-mcp-server-with-kreuzberg

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,574 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:10:25Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/T3CDwY