Writing - Articles & Decisions | Ridzwan Haron

April 26, 2026 Building an Airflow + S3 Pipeline That Runs Entirely on Your Laptop How I built a production-style Airflow 3.2.1 data ingestion pipeline with S3, without needing a single AWS credential, thanks to LocalStack and Docker.

data-engineeringairflowdockers3localstack

Read article April 23, 2025 How the Tableau Metadata API Saved Me from Manual Impact Analysis When upstream changes threatened dozens of dashboards, the Metadata API turned hours of manual tracing into a single query.

data-engineeringtableauimpact-analysis

Read article April 22, 2025 Parquet and Iceberg: Why I Stopped Treating My Data Lake Like a Folder Columnar file formats and table abstractions turned my messy S3 data lake into something I can actually query and trust.

data-engineeringparqueticebergdata-lake

Read article April 21, 2025 Redshift Taught Me That Storage Format Matters More Than Query Logic Why moving from row-based to columnar storage in Redshift was the single biggest performance leap I've seen.

data-engineeringredshiftaws

Read article April 20, 2025 What dbt Actually Does for You dbt didn't just change how I write SQL — it changed how I think about data transformations.

data-engineeringdbt