Airflow / Prefect / Dagster
Orchestrate and monitor pipelines.
Data Engineering · Pipelines & Warehousing
Data engineering is the backbone of modern data-driven organisations. This roadmap covers the skills to design, build, and maintain robust pipelines, warehouses, and infrastructure at scale.
What you'll get
Learn how to build reliable data systems: clean ingestion, trustworthy models, and production-ready pipelines with monitoring and governance.
This track moves quickly. Having the basics will help you focus on system design and scale.
Recommended before you start
A modern data stack blends orchestration, modelling, and scalable compute — then deploys it cleanly.
Airflow / Prefect / Dagster
Orchestrate and monitor pipelines.
dbt
Transform, test, and document analytics models.
Airbyte / Fivetran
Connector-based ingestion at speed.
PostgreSQL / MySQL
Operational sources + deep SQL practice.
Snowflake / BigQuery / Redshift
Warehouses and scalable analytics.
Spark + Kafka
Big data processing and streaming.
A clear phase-based path — we intentionally don't show weeks so you can progress at your pace.
Build end-to-end systems that demonstrate engineering maturity: reliability, performance, and clean modelling.
Reliable batch pipeline
Warehouse modelling with dbt
Real-time stream + CDC
Cloud deployment
Share your background and goals — we'll recommend the best starting phase and a first project to build.