Data Engineering Lab
Pipelines. Courses. Consulting.
A hands-on resource for data engineers — built by someone who has shipped production pipelines at Bloomberg and Mercer. Learn through real-world write-ups, access members-only deep-dives, or bring me in to solve your data infrastructure problems.
Learn
Deep-dive articles and members-only guides on dbt, Polars, Kafka, and modern data stack architecture.
Lab notes
Honest write-ups from a working homelab — the experiments, the failures, and what actually ships to production.
Hire
Available for freelance engagements and consulting. I build data infrastructure that investment teams trust with real numbers.
See my work →Recent posts
-
Polars Streaming and Out-of-Core: When Your Data Won't Fit
Practical guide to Polars' streaming engine — the 2025 rewrite, sinks, partitioned outputs, multiplexed pipelines, and how to process datasets that don't fit in RAM on a single node.
-
From Pandas to Polars: A Data Engineer's Migration Guide
A practical walkthrough of migrating production data pipelines from pandas to Polars — covering lazy frames, expressions, schema inference, encoding, and method-by-method translations.
-
Hexagonal Architecture in Python
Building a market data app with hexagonal architecture in Python. Clean separation of concerns and maintainable code.