Parquet Column Statistics Profiler
Profiles Apache Parquet files using pyarrow metadata APIs to extract column statistics, row group distributions, and encoding efficiency metrics without reading full datasets.
What it does
Parquet Column Statistics Profiler
Profiles Apache Parquet files using pyarrow metadata APIs to extract column statistics, row group distributions, and encoding efficiency metrics without reading full datasets.
Installation
Use the upstream install or setup path that matches your environment:
- $ npm install parquetjs
Requirements and caveats from upstream:
- This project requires a major overhaul, as well as handling and sorting through dozens of issues and prs.
- fully asynchronous, pure node.js implementation of the Parquet file format
- To use parquet.js with node.js, install it using npm:
Basic usage or getting-started notes:
-
Once you have installed the parquet.js library, you can import it as a single
-
Extracted from upstream docs: https://raw.githubusercontent.com/ironSource/parquetjs/HEAD/README.md
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,031 chars)