Tips and Tricks #188: Use dbt for Maintainable Data Transformations
Build modular, tested, documented data transformations with dbt.
Designing intelligent systems, one layer at a time.
Build modular, tested, documented data transformations with dbt.
Use table partitioning to dramatically speed up queries on large datasets.
Use MERGE (upsert) for safe, rerunnable data pipelines that handle duplicates gracefully.
Calculate running totals, rankings, and moving averages efficiently with SQL window functions.
When LinkedIn open-sourced Kafka in 2011, few predicted it would become the de facto standard for real-time data streaming. Fourteen years later, Kafka processes trillions of messages daily across organizations…
Something remarkable happened in the Python ecosystem over the past year. After decades of incremental improvements, we’ve witnessed a fundamental shift in how data engineers approach their craft. The tools…