Tabula PDF Table Extractor
Extracts structured tables from PDF documents using Tabula-java with lattice and stream detection modes. Outputs to CSV, JSON, or pandas DataFrames with automatic column type inference via python-tabula.
What it does
Tabula PDF Table Extractor
Extracts structured tables from PDF documents using Tabula-java with lattice and stream detection modes. Outputs to CSV, JSON, or pandas DataFrames with automatic column type inference via python-tabula.
Installation
Requirements and caveats from upstream:
Basic usage or getting-started notes:
-
Commandline Usage Examples
-
[-g] [-h] [-i] [-l] [-n] [-o <OUTFILE>] [-p <PAGES>] [-r] [-s
-
<PASSWORD>] [-t] [-u] [-v]
-
Extracted from upstream docs: https://raw.githubusercontent.com/tabulapdf/tabula-java/HEAD/README.md
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (924 chars)