DP-601T00 Implementing a Lakehouse with Microsoft Fabric
This course is designed to build your foundational skills in data engineering on Microsoft Fabric, focusing on the Lakehouse concept. This course will explore the powerful capabilities of Apache Spark for distributed data processing and the essential techniques for efficient data management, versioning, and reliability by working with Delta Lake tables. This course will also explore data ingestion and orchestration using Dataflows Gen2 and Data Factory pipelines. This course includes a combination of lectures and hands-on exercises that will prepare you to work with lakehouses in Microsoft Fabric.
By the end of this course, learners will be able to:
-
Explain Microsoft Fabric fundamentals and Lakehouse architecture.
-
Create and manage a Fabric Lakehouse with files and tables.
-
Ingest and prepare data using Dataflows Gen2.
-
Process and query data with Apache Spark, Spark SQL, and SQL.
-
Create and manage Delta Lake tables for reliability.
-
Orchestrate data pipelines with Data Factory.
-
Apply the Medallion (Bronze, Silver, Gold) architecture.
-
Visualize Lakehouse data in Spark notebooks and Power BI.
The primary audience for this course is data professionals who are familiar with data modeling, extraction, and analytics. It is designed for professionals who are interested in gaining knowledge about Lakehouse architecture, the Microsoft Fabric platform, and how to enable end-to-end analytics using these technologies.
You should be familiar with basic data concepts and terminology.
- Explore end-to-end analytics with Microsoft Fabric
- Data teams and Microsoft Fabric
- Enable and use Microsoft Fabric
- Explore the Microsoft Fabric Lakehouse
- Work with Microsoft Fabric Lakehouses
- Explore and transform data in a lakehouse
- Prepare to use Apache Spark
- Run Spark code
- Work with data in a Spark dataframe
- Work with data using Spark SQL
- Visualize data in a Spark notebook
- Understand Delta Lake
- Create delta tables
- Work with delta tables in Spark
- Use delta tables with streaming data
- Understand Dataflows Gen2 in Microsoft Fabric
- Explore Dataflows Gen2 in Microsoft Fabric
- Integrate Dataflows Gen2 and Pipelines in Microsoft Fabric
- Understand pipelines
- Use the Copy Data activity
- Use pipeline templates
- Run and monitor pipelines