Archives

Posts by Category

All Posts

Creativity Loves Constraints: Lessons from the Data Engineering Trenches

April 16, 2025

We love to dream big in data.Real-time everything. Auto-scaling infrastructure. Infinite flexibility. A tool for every use case. But back in the real world? You’re dealing with budget approvals, half-documented APIs, slow dashboards, and a team of two trying to wrangle fifteen tools. And that’s not a failure — that’s the job. In fact, that’s…
Read More
Microsoft Fabric: Finally a Way to Get Sh*t Done in Data Without Fighting the Stack

March 31, 2025

I recently joined an organization that runs entirely on the Microsoft stack—a shift for me, coming from AWS environments where I relied on third-party tools for data integration and orchestration. Frankly, I knew this was going to be a challenge. In the past, working with native Microsoft cloud tools meant stitching together brittle pipelines, jumping…
Read More
Do You Really Need Data Modeling? A Practical Look

February 5, 2025

For years, data modeling has been the foundation of structured reporting, ensuring performance, consistency, and efficiency. But today, the landscape has changed. With cheap storage, powerful processing, and modern BI tools that enable flexible, real-time analysis, is data modeling still necessary, or has it become just one of many options? Many organizations, especially startups, are…
Read More
Insights, Not Infrastructure: The True Goal of Data Engineering

January 17, 2025

“No one wants to use software. They just want to catch Pokémon.” This quote from The Staff Engineer’s Path nails a key truth: people don’t care about the tools, just the results. In data engineering, this couldn’t be more relevant. Business teams don’t want to wrestle with raw data or learn SQL; they want clear,…
Read More
Demystifying Real-Time Reporting

December 23, 2024

Real-time reporting is about making decisions based on data the moment it’s created. As businesses strive for faster insights, BI teams are often tasked with handling these requests, particularly in lean tech startups where developer resources are stretched thin. However, assigning these requests to BI teams often results in frustration and inefficiency. To deliver effective…
Read More
Streamline Your API Workflows with DuckDB

November 27, 2024

DuckDB outperforms Pandas for API integrations by addressing key pain points: it enforces schema consistency, prevents data type mismatches, and handles deduplication efficiently with built-in database operations. Unlike Pandas, DuckDB offers persistent local storage, enabling you to work beyond memory constraints and handle large datasets seamlessly. It also supports downstream SQL transformations and exports to…
Read More
Unlocking Spanish Fluency: Avoiding Common Pitfalls with Polysemous Words

October 31, 2024

Polysemous words, such as “get” or “put,” carry multiple meanings in English, making them versatile and efficient in conversation. For instance, “get” can mean to retrieve something (“I’ll get that”), to understand something (“I don’t get it”), or to arrive somewhere (“When will we get there?”). This flexibility makes polysemous words powerful tools in English,…
Read More
Revolutionizing Data Engineering: The Zero ETL Movement

September 24, 2024

Imagine you’re a chef running a bustling restaurant. In the traditional world of data (or in this case, food), you’d order ingredients from various suppliers, wait for deliveries, sort through shipments, and prep everything before you can even start cooking. It’s time-consuming, prone to errors, and by the time the dish reaches your customers, those…
Read More
The Modern Data Stack: Still Too Complicated

August 30, 2024

In the quest to make data-driven decisions, what seems like a straightforward process of moving data from source systems to a central analytical workspace often explodes in complexity and overhead. This post explores why the modern data stack remains too complicated and how various tools and services attempt to address these challenges today.
Read More
Boost Your Spanish Vocabulary: Using ChatGPT for Effective Mnemonics

July 15, 2024

Imagine trying to remember the Spanish word for in-laws — suegros. Instead of rote memorization, picture your in-laws swaying side to side in a silly manner, while you watch with an exaggerated expression of disgust. This humorous scene, combined with the phonetic cue sway gross, creates a vivid mental image that effortlessly etches the word…
Read More