Data infrastructure

The Zero-Dependency Data Architecture Blueprint

The DuckDB + Parquet + Python position: why managed platforms extract value at the contract renewal, and the exact stack that eliminates the exposure.

Read →
Stack evaluation

The Anti-Hype Tech Stack Decision Matrix

Comparison tables for every major tooling decision, a 20-point over-engineering checklist, and a translation glossary for vendor sales phrases.

Read →
Migration reference

Migrating Postgres and Redshift to DuckDB

Diagnostic baseline, compatibility matrix, step-by-step migration sequence, and the failure modes that appear in every migration at scale.

Read →
Compliance reference

GCC Data Compliance Field Reference

Data localisation requirements and technical implementation for UAE, Saudi Arabia, and Bahrain. PDPL coverage, cloud region mapping, and breach notification timelines.

Read →
dbt reference

The dbt Production Handbook

What no dbt tutorial covers: project structure, the twelve model anti-patterns, a strategic testing framework, Slim CI, and documentation that gets read.

Read →
Cost analysis

The Data Engineering Cloud Cost Playbook

Where the money actually leaks: a 20-point audit, AWS and GCP high-impact cuts, Databricks cost controls, and a monitoring setup that catches anomalies before the invoice.

Read →
Architecture analysis

Data Mesh Without the Hype

What data mesh actually requires, when it is genuinely justified, and how to implement it without burning down the architecture you already have.

Read →
Privacy and sovereignty

Privacy-First Data Engineering

PII risk audit, pseudonymisation toolkit with Python code, GDPR requirements as engineering tasks, and the analytics-safe data layer pattern.

Read →