Data engineering garden

Ankur Kelkar

Data Architect & Analytics Engineer

10+ years of building robust data infrastructures, pipelines, and analytical platforms. Over a decade of turning massive raw data into resilient, clean analytics pipelines and cost-efficient cloud infrastructures.

Core Technology Expertise

A look at the languages, databases, and orchestration platforms I deploy to orchestrate production analytics architectures.

Databricks

Unified analytics engine, Delta Lake, Delta Live Tables (DLT), and Spark compute optimization.

Snowflake

Enterprise data warehousing, multi-cluster compute, tasks, stored procedures, and zero-copy clones.

dbt

Analytics engineering, dimensional modeling, incremental strategies, and mesh architectures.

Python

ETL pipelines, custom extraction packages, pandas, PySpark scripting, and automated workflows.

AWS & Clouds

S3 lake architectures, Lambda serverless, SQS/SNS pipelines, EventBridge scheduler, and GCP/GCS setups.

PostgreSQL & SQL

Advanced query tuning, indexing, stored procedures, execution plan analysis, and migration.

Featured Tutorial Series

Witty, comprehensive, and documentation-cited step-by-step guides written for builders who want to master data tooling.

New Release

๐Ÿš€ Databricks Lakehouse: Zero to Hero

Master the modern Databricks Lakehouse platform. In this 10-part guide, we cover cluster configurations, Delta table optimizations, streaming with Auto Loader, Delta Live Tables (DLT) pipelines, Unity Catalog security, and workflows scheduling.

Delta Lake Spark SQL Unity Catalog
10-PART COMPREHENSIVE SERIES Read Series โ†’
Deep-Dive

๐Ÿš€ ClickHouse Mastery

Master the fastest analytical column store database on the planet. From columnar compression under the hood to index modeling with MergeTree engines, materializing views, cluster sharding, and raw query optimizations.

OLAP MergeTree Performance
10-PART EXPANDED SERIES (10-15m READS) Read Series โ†’

About Ankur

I'm a Data Architect with over 10 years of experience designing data pipelines, data mesh architectures, and cost optimization initiatives. I have designed pipelines and data platforms for companies in travel tech, marketing tech, and healthcare analytics, achieving significant compute savings and slashing data latency.