·data-engineering-best-practices
{}

data-engineering-best-practices

Data engineering best practices: medallion architecture, dataset lifecycle, partitioning, file sizing, schema evolution, and append/overwrite/merge patterns across Polars, PyArrow, DuckDB, Delta Lake, and Iceberg. Use when designing production data pipelines or reviewing data platform decisions.

7Installs·2Trend·@legout

Installation

$npx skills add https://github.com/legout/data-platform-agent-skills --skill data-engineering-best-practices

How to Install data-engineering-best-practices

Quickly install data-engineering-best-practices AI skill to your development environment via command line

  1. Open Terminal: Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.)
  2. Run Installation Command: Copy and run this command: npx skills add https://github.com/legout/data-platform-agent-skills --skill data-engineering-best-practices
  3. Verify Installation: Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code, Cursor, or OpenClaw

Source: legout/data-platform-agent-skills.

SKILL.md

View raw

Use this skill for production architecture and standards decisions: storage layout, lifecycle, incremental semantics, schema evolution, quality checks, and cost/performance tradeoffs.

Do not skip Silver validation for convenience; silent quality drift is costly.

| Append | strictly new immutable events | simplest, cheapest | | Partition overwrite | deterministic reprocessing for date/key slice | safe for backfills | | Merge/Upsert | corrections/late updates/deletes | needs key + conflict semantics |

Data engineering best practices: medallion architecture, dataset lifecycle, partitioning, file sizing, schema evolution, and append/overwrite/merge patterns across Polars, PyArrow, DuckDB, Delta Lake, and Iceberg. Use when designing production data pipelines or reviewing data platform decisions. Source: legout/data-platform-agent-skills.

Facts (cite-ready)

Stable fields and commands for AI/search citations.

Install command
npx skills add https://github.com/legout/data-platform-agent-skills --skill data-engineering-best-practices
Category
{}Data Analysis
Verified
First Seen
2026-02-22
Updated
2026-03-10

Browse more skills from legout/data-platform-agent-skills

Quick answers

What is data-engineering-best-practices?

Data engineering best practices: medallion architecture, dataset lifecycle, partitioning, file sizing, schema evolution, and append/overwrite/merge patterns across Polars, PyArrow, DuckDB, Delta Lake, and Iceberg. Use when designing production data pipelines or reviewing data platform decisions. Source: legout/data-platform-agent-skills.

How do I install data-engineering-best-practices?

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/legout/data-platform-agent-skills --skill data-engineering-best-practices Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code, Cursor, or OpenClaw

Where is the source repository?

https://github.com/legout/data-platform-agent-skills