Module-6 – ADF & Synapse

Module-6 – ADF & Synapse

🎯 Objectives

  • Build advanced ETL/ELT pipelines in Azure Data Factory.
  • Tune Synapse serverless & dedicated SQL pools for performance & cost.
  • Automate CI/CD for data workloads.

🗓️ Weekly Plan

  • Week 21 – ADF fundamentals: pipelines, activities, parameters.
  • Week 22 – Mapping Data Flows: transformations & expression builder.
  • Week 23 – Synapse Analytics: serverless vs dedicated tuning.
  • Week 24 – CI/CD: ARM templates, Git integration, releases.

🔑Key Concepts

ADF Core Concepts

  • Linked services, datasets, pipelines, triggers.

Integration Runtimes

  • Azure vs self-hosted IR, cost & performance trade-offs.

Mapping Data Flows

  • Derived columns, aggregations, join behavior.

Synapse Analytics

  • Serverless SQL pool vs dedicated SQL pool: MPP, DWUs, distributions.

CI/CD Practices

  • ARM templates, Git branch strategies, release pipelines.

🔨 Mini-Projects

  • End-to-End ADF: API → ADLS → Databricks → Synapse.
  • Complex Data Flow: multi-step transformation & error handling.
  • Pipeline CI: deploy via Azure DevOps or GitHub Actions.

📚 Resources

  • Azure Data Factory documentation
  • Azure Synapse performance tuning guide
  • Azure DevOps Pipelines docs