Services About Contact Deep Cuts Book a Call Get Started

ATHING

Deep Cuts.

Technical writing on data engineering, AI infrastructure, automation, and strategy. No surface-level takes. No sponsored opinions. Just the stuff that actually matters when you're building.

Data Architecture Intelligent Automation AI & Machine Learning Business Intelligence Systems Integration Data Governance & Quality Custom Application Dev Data Strategy Consulting

Data Architecture

6 min read·10 March 2026

Idempotency in Data Pipelines: The Property That Separates Reliable Systems from Fragile Ones

Most pipeline failures aren't caused by bad code — they're caused by code that wasn't designed to run twice. Idempotency is the property that makes reruns safe, and most teams underestimate how hard it is to get right.

Read article →

5 min read·22 April 2025

Designing for Backfill: The Capability Your Pipeline Needs Before It Goes Live

Every data pipeline will eventually need to reprocess historical data. The ones that weren't designed for it make that day expensive, slow, and stressful.

Read article →

7 min read·14 January 2025

Lakehouse vs Data Warehouse: When the Classic Architecture Isn't Enough

The data warehouse isn't dead. But for a growing number of organisations, it was never the right answer in the first place. Understanding when to move beyond it is one of the most important architectural decisions a data team makes.

Read article →

Intelligent Automation

5 min read·3 September 2024

Orchestration vs Choreography: Two Patterns, One Hard Choice

Both patterns coordinate distributed work. They fail in very different ways, and most teams only discover which one they chose when something goes wrong.

Read article →

6 min read·11 June 2024

Event-Driven vs Scheduled: Choosing the Right Trigger Model for Your Workflows

Scheduling is familiar. Event-driven is powerful. Choosing the wrong one for your context creates problems that compound over time and are surprisingly expensive to undo.

Read article →

AI & Machine Learning

7 min read·18 February 2025

Why Most ML Models Never Reach Production

The model is rarely the problem. The infrastructure around it — pipelines, feature consistency, deployment tooling, monitoring — is where most AI projects quietly die.

Read article →

6 min read·7 November 2024

RAG vs Fine-Tuning: A Practical Guide to Choosing the Right Approach

Both techniques make language models more useful for your specific domain. They solve different problems, and reaching for the wrong one wastes months and produces worse results.

Read article →

6 min read·19 August 2024

Feature Stores: The Missing Layer Between Your Data Team and Your ML Team

Without a feature store, your data scientists recompute the same features in isolation. Your models train on data that doesn't match what they'll see in production. Both problems are expensive.

Read article →

Business Intelligence

6 min read·5 March 2025

The Self-Serve BI Trap: Why Most Implementations Quietly Fail

Self-serve analytics sounds like the answer to every data team's capacity problem. In practice, most rollouts produce dashboards nobody trusts and questions nobody can answer.

Read article →

5 min read·29 July 2024

The Metrics Layer: Why Your Business Logic Doesn't Belong in Your BI Tool

When the definition of 'revenue' lives inside a Looker model, a Tableau workbook, and a Metabase query — and they all disagree — you don't have a tooling problem. You have a metrics problem.

Read article →

Systems Integration

6 min read·15 October 2024

The Strangler Fig Pattern: Migrating Legacy Systems Without the Big Bang

Big bang migrations fail more often than they succeed. The strangler fig pattern lets you replace a legacy system incrementally — without a single high-risk cutover date that keeps everyone awake.

Read article →

5 min read·8 May 2024

API-First vs Event-Driven Integration: A Practical Decision Framework

APIs and event streams both move data between systems. The choice between them shapes your architecture for years and determines how your systems behave under failure.

Read article →

Data Governance & Quality

6 min read·2 December 2024

Data Contracts: The Interface Between Teams That Nobody Wrote Down

When an upstream team changes a column name, the downstream pipeline breaks. Data contracts are how you prevent that — and how you assign accountability when it happens anyway.

Read article →

5 min read·27 February 2024

Data Quality Checks Belong in the Pipeline, Not the Dashboard

Finding a data quality issue in a dashboard means the bad data is already in production, downstream systems have already consumed it, and the damage is already done.

Read article →

Custom Application Dev

6 min read·16 April 2024

Build vs Buy for Data-Intensive Applications: A Framework for the Decision

Off-the-shelf tools work until they don't. Custom builds are expensive until they aren't. The decision is more nuanced than most teams make it, and the consequences last for years.

Read article →

5 min read·9 January 2024

The Operational Data Store: Bridging the Gap Between Transactional and Analytical Systems

Your OLTP database can't handle analytical queries without slowing down. Your data warehouse can't handle real-time operational reads. The ODS sits between them — and most teams skip it until they can't.

Read article →

Data Strategy Consulting

6 min read·4 August 2025

Your Data Strategy Needs a Cost Model, Not Just a Roadmap

A data roadmap tells you what you want to build. A cost model tells you what it will actually take. Most organisations have one and not the other, and the one they're missing is usually the one that matters.

Read article →

7 min read·17 June 2025

The Data Maturity Model: Where Most Organisations Actually Are

Most companies rate themselves higher on the data maturity curve than the evidence supports. The gap between self-assessment and reality is where data budgets disappear.

Read article →