Lakehouse formats add ACID transactions, schema evolution, and time travel to data lakes stored on object storage (S3, GCS, Azure). This skill covers the three major open table formats: Delta Lake, Apache Iceberg, and Apache Hudi.
| Feature | Delta Lake | Apache Iceberg | Apache Hudi |
| ACID Transactions | ✅ | ✅ | ✅ | | Time Travel | ✅ | ✅ | ✅ | | Schema Evolution | ✅ | Advanced (branching) | ✅ | | Primary Ecosystem | Spark/Databricks | Engine-agnostic | Spark (CDC focus) | | Write Optimization | Copy-on-write | CoW, Merge-on-Read | CoW, Merge-on-Read | | Python API | deltalake (pure), PySpark | pyiceberg (pure) | PySpark only |
Formats de table Lakehouse : Delta Lake, Apache Iceberg et Apache Hudi pour les transactions ACID, l'évolution des schémas et les voyages dans le temps sur les lacs de données. Source : legout/data-platform-agent-skills.