Data Engineering Lab
Pipelines.
A hands-on resource for data engineers. Learn through real-world write-ups and access members-only deep-dives.
Learn
Deep-dive articles and members-only guides on dbt, Polars, and modern data stack architecture.
Lab notes
Honest write-ups from a working homelab — the experiments, the failures, and what actually ships to production.
Recent posts
-
From Pandas to Polars: A Data Engineer's Migration Guide
A practical walkthrough of migrating production data pipelines from pandas to Polars — covering lazy frames, expressions, schema inference, encoding, and method-by-method translations.
-
Salesforce Data Loader: From v56 to v65 with Bulk API 2.0 — A 4× Speedup 🔒 Members
How a few config.properties tweaks, a proper log4j2.properties, and a small PowerShell wrapper turned a slow nightly load into a fast async pipeline.
-
Hexagonal Architecture in Python
Building a market data app with hexagonal architecture in Python. Clean separation of concerns and maintainable code.