Tabula PDF Table Extraction Agent
Uses Tabula Java library via tabula-py bindings to detect and extract tables from PDF documents. Supports both lattice and stream extraction modes with configurable area coordinates and outputs to pandas DataFrames or CSV.
What it does
Tabula PDF Table Extraction Agent
Uses Tabula Java library via tabula-py bindings to detect and extract tables from PDF documents. Supports both lattice and stream extraction modes with configurable area coordinates and outputs to pandas DataFrames or CSV.
Installation
Requirements and caveats from upstream:
Basic usage or getting-started notes:
-
Commandline Usage Examples
-
[-g] [-h] [-i] [-l] [-n] [-o <OUTFILE>] [-p <PAGES>] [-r] [-s
-
<PASSWORD>] [-t] [-u] [-v]
-
Extracted from upstream docs: https://raw.githubusercontent.com/tabulapdf/tabula-java/HEAD/README.md
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (957 chars)