{"id":"892a024b-c5d3-40de-972a-da226cd16026","shortId":"dDtEQd","kind":"skill","title":"Parquet Schema Extractor for S3","tagline":"Extracts and validates Parquet file schemas from Amazon S3 using the PyArrow library and AWS S3 SDK (boto3). Compares schemas across multiple partitions to detect schema drift and incompatible type changes. Outputs a schema diff report with partition paths and affected column det","description":"# Parquet Schema Extractor for S3\n\nExtracts and validates Parquet file schemas from Amazon S3 using the PyArrow library and AWS S3 SDK (boto3). Compares schemas across multiple partitions to detect schema drift and incompatible type changes. Outputs a schema diff report with partition paths and affected column details.\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- $ npm install parquetjs\n\nRequirements and caveats from upstream:\n- This project requires a major overhaul, as well as handling and sorting through dozens of issues and prs.\n- fully asynchronous, pure node.js implementation of the Parquet file format\n- To use parquet.js with node.js, install it using npm:\n\nBasic usage or getting-started notes:\n- ------------\n- --------------------\n- Once you have installed the parquet.js library, you can import it as a single\n\n- Source: https://github.com/ironSource/parquetjs\n- Extracted from upstream docs: https://raw.githubusercontent.com/ironSource/parquetjs/HEAD/README.md\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/parquet-schema-extractor-for-s3/)","tags":["parquet","schema","extractor","for","skills","agentskillexchange","agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex"],"capabilities":["skill","source-agentskillexchange","skill-parquet-schema-extractor-for-s3","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/parquet-schema-extractor-for-s3","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,135 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:11:41.474Z","embedding":null,"createdAt":"2026-05-18T13:18:17.629Z","updatedAt":"2026-05-18T19:11:41.474Z","lastSeenAt":"2026-05-18T19:11:41.474Z","tsv":"'/ironsource/parquetjs':178 '/ironsource/parquetjs/head/readme.md':185 '/skills/parquet-schema-extractor-for-s3/)':192 'across':26,74 'affect':46,94 'agent':187 'agentskillexchange.com':191 'agentskillexchange.com/skills/parquet-schema-extractor-for-s3/)':190 'amazon':13,61 'asynchron':136 'aw':20,68 'basic':154 'boto3':23,71 'caveat':114 'chang':36,84 'column':47,95 'compar':24,72 'det':48 'detail':96 'detect':30,78 'diff':40,88 'doc':182 'dozen':130 'drift':32,80 'environ':108 'exchang':189 'extract':6,54,179 'extractor':3,51 'file':10,58,143 'format':144 'fulli':135 'get':158 'getting-start':157 'github.com':177 'github.com/ironsource/parquetjs':176 'handl':126 'implement':139 'import':170 'incompat':34,82 'instal':97,101,110,150,164 'issu':132 'librari':18,66,167 'major':121 'match':106 'multipl':27,75 'node.js':138,149 'note':160 'npm':109,153 'output':37,85 'overhaul':122 'parquet':1,9,49,57,142 'parquet.js':147,166 'parquetj':111 'partit':28,43,76,91 'path':44,92,104 'project':118 'prs':134 'pure':137 'pyarrow':17,65 'raw.githubusercontent.com':184 'raw.githubusercontent.com/ironsource/parquetjs/head/readme.md':183 'report':41,89 'requir':112,119 's3':5,14,21,53,62,69 'schema':2,11,25,31,39,50,59,73,79,87 'sdk':22,70 'setup':103 'singl':174 'skill':188 'skill-parquet-schema-extractor-for-s3' 'sort':128 'sourc':175,186 'source-agentskillexchange' 'start':159 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'type':35,83 'upstream':100,116,181 'usag':155 'use':15,63,98,146,152 'valid':8,56 'well':124","prices":[{"id":"9f941bc8-3b85-4d2d-86e0-be4ac3005e32","listingId":"892a024b-c5d3-40de-972a-da226cd16026","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:18:17.629Z"}],"sources":[{"listingId":"892a024b-c5d3-40de-972a-da226cd16026","source":"github","sourceId":"agentskillexchange/skills/parquet-schema-extractor-for-s3","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/parquet-schema-extractor-for-s3","isPrimary":false,"firstSeenAt":"2026-05-18T13:18:17.629Z","lastSeenAt":"2026-05-18T19:11:41.474Z"}],"details":{"listingId":"892a024b-c5d3-40de-972a-da226cd16026","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"parquet-schema-extractor-for-s3","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"90278c98782dda0cd0460a258a809ba1669d24a8","skill_md_path":"skills/parquet-schema-extractor-for-s3/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/parquet-schema-extractor-for-s3"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"Parquet Schema Extractor for S3","description":"Extracts and validates Parquet file schemas from Amazon S3 using the PyArrow library and AWS S3 SDK (boto3). Compares schemas across multiple partitions to detect schema drift and incompatible type changes. Outputs a schema diff report with partition paths and affected column details."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/parquet-schema-extractor-for-s3"},"updatedAt":"2026-05-18T19:11:41.474Z"}}