Module-5 – Azure Lakehouse
Module-5 – Azure Lakehouse
🎯 Objectives
- Secure & configure ADLS Gen2 storage.
- Use Azure Databricks & Unity Catalog for governance.
- Implement Bronze/Silver/Gold medallion pattern.
🗓️ Weekly Plan
- Week 17 – ADLS Gen2: hierarchical namespace, RBAC, firewall.
- Week 18 – Databricks: workspace, clusters, notebooks, jobs.
- Week 19 – Unity Catalog: metastore, schemas, permissions.
- Week 20 – Lakehouse best practices & folder layout.
🔑Key Concepts
ADLS Gen2 Features
- Hierarchical namespace, storage tiers, lifecycle policies.
Security & Networking
- RBAC vs POSIX ACLs, service principals, private endpoints.
Databricks Administration
- All-purpose vs job clusters, autoscaling, init scripts.
Unity Catalog Governance
- Centralized metastore, grant/revoke, lineage.
Medallion Architecture
- Bronze (raw), Silver (clean), Gold (aggregates).
🔨 Mini-Projects
- Secure ADLS: implement ACLs, private endpoint, firewall rules.
- DBX Ingest Job: schedule notebook to load into Bronze.
- Governance Demo: set up Unity Catalog roles & access.
📚 Resources
- ADLS Gen2 documentation
- Databricks Unity Catalog guide
- Microsoft Fabric Lakehouse whitepapers