data-engineering
Data engineering patterns for ETL pipelines, data warehousing, Apache Spark, and data quality validation
Content Preview
---
name: data-engineering
description: Data engineering patterns for ETL pipelines, data warehousing, Apache Spark, and data quality validation
---
# Data Engineering
## ETL Pipeline Pattern
```python
from datetime import datetime
from dataclasses import dataclass
@dataclass
class PipelineResult:
records_extracted: int
records_transformed: int
records_loaded: int
errors: list[str]
duration_seconds: float
class OrderPipeline:
def __init__(self, source_db, warehouse_dHow to Use
Recommended: Install to project (local)
mkdir -p .claude/skills
curl -o .claude/skills/data-engineering.md \
https://raw.githubusercontent.com/rohitg00/awesome-claude-code-toolkit/main/skills/data-engineering/SKILL.mdSkill is scoped to this project only. Add .claude/skills/ to your .gitignoreif you don't want to commit it.
Alternative: Clone full repo
git clone https://github.com/rohitg00/awesome-claude-code-toolkitThen reference at skills/data-engineering/SKILL.md
Related Skills
data-engineering-data-driven-feature
Build features guided by data insights, A/B testing, and continuous measurement using specialized agents for analysis, implementation, and experimentation.
data-aidataengineeringdriven
by sickn33 (Antigravity) · antigravity-awesome-skills
data-engineering-data-pipeline
You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective data pipelines for batch and streaming data processing.
infrastructuredataengineeringpipeline
by sickn33 (Antigravity) · antigravity-awesome-skills
data
Data engineering for Apache Airflow and Astronomer. Author DAGs with best practices, debug pipeline failures, trace data lineage, profile tables, migrate Airflow 2 to 3, and manage local and cloud deployments.
pluginpluginmarketplacedata
by Anthropic · anthropic-official-plugins
data-engineer
Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms.
infrastructuredata
by sickn33 (Antigravity) · antigravity-awesome-skills