Profile and clean large CSV datasets from the terminal with qsv
Inspect, profile, normalize, and diff large CSV files before loading them into downstream analytics or automation workflows.
What it does
Profile and clean large CSV datasets from the terminal with qsv
Inspect, profile, normalize, and diff large CSV files before loading them into downstream analytics or automation workflows.
Prerequisites
qsv binary and CSV datasets
Installation
Use the upstream install or setup path that matches your environment:
- cargo install zipsign
- brew install qsv
- conda install conda-forge::qsv
- git clone https://github.com/dathere/qsv.git
Requirements and caveats from upstream:
- | pyโจ<br>๐๐ฃ | Create a new computed column or filter rows by evaluating a Python expression on every row of a CSV file. Python's [f-strings](https://www.freecodecamp.org/news/python-f-strings-tutori...
- ๐ฃ: requires UTF-8 encoded input.
Basic usage or getting-started notes:
-
| luauโจ<br>๐๐๐ฃ๐
| <a name="luau_deeplink"></a>Create multiple new computed columns, filter rows, compute aggregations and build complex... -
| moarstats<br>๐๐๏ธ | Add up to an additional 55 statistical measures, including extended outlier, robust & bivariate statistics to an existing stats CSV file. ([example](docs/moarstats/NYC_31...
-
| schema<br>๐๐ฃ๐ปโโ๏ธ๐๏ธ๐๐ช | <a name="schema_deeplink"></a>Infer either a JSON Schema Validation Draft 2020-12 ([Example](https://gith...
-
Source: https://github.com/dathere/qsv
-
Extracted from upstream docs: https://raw.githubusercontent.com/dathere/qsv/HEAD/README.md
Documentation
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: ยท indexed on github topic:agent-skills ยท 8 github stars ยท SKILL.md body (1,791 chars)