Skillquality 0.45

Apache Tika Document Parser Agent

Extracts text and metadata from 1000+ file formats using Apache Tika server REST API. Handles PDF OCR via Tesseract integration, Office document parsing, and email archive extraction with MIME detection.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/apache-tika-document-parser-agent

What it does

Apache Tika Document Parser Agent

Installation

Requirements and caveats from upstream:

N.B. Docker is used for tests in tika-integration-tests. If Docker is not installed, those tests are skipped.

Basic usage or getting-started notes:

===========
Parse a file in Java:
java
Source: https://github.com/apache/tika
Extracted from upstream docs: https://raw.githubusercontent.com/apache/tika/HEAD/README.md

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-apache-tika-document-parser-agenttopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/apache-tika-document-parser-agent

skills.shhttps://skills.sh/agentskillexchange/skills/apache-tika-document-parser-agent

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (792 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:09:23Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/a3y2DE