Skillquality 0.45

Apache Spark DataFrame ETL Pipeline

Automates PySpark DataFrame transformations including schema inference, partition pruning, and Delta Lake merge operations. Integrates with AWS Glue Data Catalog and Apache Iceberg table formats for lakehouse architectures.

Price
free
Protocol
skill
Verified
no

What it does

Apache Spark DataFrame ETL Pipeline

Automates PySpark DataFrame transformations including schema inference, partition pruning, and Delta Lake merge operations. Integrates with AWS Glue Data Catalog and Apache Iceberg table formats for lakehouse architectures.

Installation

Requirements and caveats from upstream:

  • high-level APIs in Scala, Java, Python, and R (Deprecated), and an optimized engine that
  • Interactive Python Shell

  • Alternatively, if you prefer Python, you can use the Python shell:

Basic usage or getting-started notes:

Source

Capabilities

skillsource-agentskillexchangeskill-spark-dataframe-etl-pipelinetopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (935 chars)

Provenance

Indexed fromgithub
Enriched2026-05-18 19:12:35Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18

Agent access