πŸ”’ Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided DP-600 Domain 2
Domain 2 β€” Module 1 of 14 7%
8 of 29 overall

DP-600 Study Guide

Domain 1: Maintain a Data Analytics Solution

  • Workspace Access Controls
  • Row-Level & Object-Level Security
  • Sensitivity Labels & Endorsement
  • Git Version Control in Fabric
  • Deployment Pipelines: Dev β†’ Test β†’ Prod
  • Impact Analysis & Dependencies
  • XMLA Endpoint & Reusable Assets

Domain 2: Prepare Data

  • Microsoft Fabric: The Big Picture Free
  • Lakehouses: Your Data Foundation Free
  • Warehouses in Fabric Free
  • Choosing the Right Data Store Free
  • Data Connections & OneLake Catalog
  • Shortcuts & OneLake Integration
  • Ingesting Data: Dataflows Gen2 & Pipelines
  • Star Schema Design Free
  • SQL Objects: Views, Functions & Stored Procedures
  • Transforming Data: Reshape & Enrich
  • Data Quality & Cleansing
  • Querying with SQL
  • Querying with KQL
  • Querying with DAX

Domain 3: Implement and Manage Semantic Models

  • Semantic Models: Storage Modes
  • Relationships & Advanced Modeling
  • DAX Essentials: Variables & Functions
  • Calculation Groups & Field Parameters
  • Large Models & Composite Models
  • Direct Lake Mode
  • DAX Performance Optimization
  • Incremental Refresh

DP-600 Study Guide

Domain 1: Maintain a Data Analytics Solution

  • Workspace Access Controls
  • Row-Level & Object-Level Security
  • Sensitivity Labels & Endorsement
  • Git Version Control in Fabric
  • Deployment Pipelines: Dev β†’ Test β†’ Prod
  • Impact Analysis & Dependencies
  • XMLA Endpoint & Reusable Assets

Domain 2: Prepare Data

  • Microsoft Fabric: The Big Picture Free
  • Lakehouses: Your Data Foundation Free
  • Warehouses in Fabric Free
  • Choosing the Right Data Store Free
  • Data Connections & OneLake Catalog
  • Shortcuts & OneLake Integration
  • Ingesting Data: Dataflows Gen2 & Pipelines
  • Star Schema Design Free
  • SQL Objects: Views, Functions & Stored Procedures
  • Transforming Data: Reshape & Enrich
  • Data Quality & Cleansing
  • Querying with SQL
  • Querying with KQL
  • Querying with DAX

Domain 3: Implement and Manage Semantic Models

  • Semantic Models: Storage Modes
  • Relationships & Advanced Modeling
  • DAX Essentials: Variables & Functions
  • Calculation Groups & Field Parameters
  • Large Models & Composite Models
  • Direct Lake Mode
  • DAX Performance Optimization
  • Incremental Refresh
Domain 2: Prepare Data Free ⏱ ~14 min read

Microsoft Fabric: The Big Picture

What is Microsoft Fabric, why does it exist, and how does OneLake unify all your analytics data? Your foundation for everything in DP-600.

What is Microsoft Fabric?

β˜• Simple explanation

Imagine a shopping mall for data.

Before Fabric, doing analytics was like visiting separate shops scattered across town β€” one shop for storing raw files, another for building data warehouses, a third for creating reports, and a fourth for running machine learning. Each shop had its own parking lot (storage), its own loyalty card (security), and its own opening hours (management).

Microsoft Fabric is the shopping mall that puts all these shops under one roof. One parking lot (OneLake), one loyalty card (Entra ID), one management office (admin portal). The shops still specialise β€” but they share everything underneath.

For the DP-600 exam, you need to understand the mall layout and know which shop to visit for each task.

Microsoft Fabric is a unified SaaS analytics platform that integrates data engineering, data warehousing, real-time intelligence, data science, and business intelligence into a single product. It is built on top of OneLake β€” a single, organisation-wide data lake that all Fabric workloads share.

Fabric is not a rebrand of existing services. It is a new architecture that replaces the need to stitch together Azure Data Lake Storage, Azure Synapse Analytics, Azure Data Factory, and Power BI as separate services. Lakehouse and warehouse data is stored in OneLake as Delta Parquet format, and all compute engines β€” Spark, SQL, KQL, and the Analysis Services engine β€” can access it. OneLake can also store non-Delta files (CSV, JSON, images) in the Files section of a lakehouse.

Key architectural decisions: Fabric is capacity-based (F SKUs or Power BI Premium P SKUs), uses Entra ID for identity, and supports governance through Microsoft Purview integration. Workspaces are the unit of collaboration, access control, and deployment.

The problem Fabric solves

Before Fabric, enterprise analytics looked like this:

ChallengeOld WorldFabric World
StorageMultiple storage accounts, duplicated data everywhereOneLake β€” one copy of data, shared across all engines
SecurityDifferent permissions per service, hard to auditEntra ID everywhere, sensitivity labels, workspace RBAC
Data movementCopy data between services (expensive, slow, error-prone)Engines read directly from OneLake β€” no copies needed
GovernanceSeparate tools for lineage, classification, complianceMicrosoft Purview integration across all Fabric items
BillingPer-service billing, hard to predict costsCapacity-based β€” one pool of compute for everything
πŸ’‘ Scenario: Meet our four analytics professionals

Throughout this course, you will follow four characters facing real Fabric challenges:

πŸ›’ Anita Patel β€” Senior Analytics Engineer at FreshCart, a national grocery chain with 2,000+ stores. She builds sales dashboards, demand forecasting, and inventory analytics. Her data: daily POS transactions, supplier feeds, loyalty programme data.

🏒 James Okafor β€” BI Architect at Summit Consulting, a 400-person consulting firm managing analytics for 15+ clients. He designs multi-workspace environments with strict governance and reusable templates.

πŸ₯ Dr. Sarah Nguyen β€” Health Data Analyst at Pacific Health Network, a 6,000-person hospital system. She builds patient outcome dashboards and clinical reporting while navigating strict compliance requirements.

πŸ’° Raj Mehta β€” Fabric Platform Admin at Atlas Capital, a mid-size financial services firm. He manages Fabric capacity, monitors performance, and ensures every report meets audit requirements.

OneLake: One lake to rule them all

OneLake is the foundation of everything in Fabric. Every workspace, every lakehouse, every warehouse writes to OneLake.

Key OneLake concepts

  • One per tenant β€” your entire organisation shares a single OneLake (like OneDrive is one drive per user, OneLake is one lake per tenant)
  • Delta Parquet format β€” table data in lakehouses and warehouses is stored as Delta tables (Parquet files with a transaction log). OneLake can also hold non-Delta files.
  • No data duplication β€” Spark, SQL, KQL, and Power BI all read the same physical files
  • Shortcuts β€” virtual pointers to data in external storage (Azure Data Lake, S3, Google Cloud Storage) without copying it into OneLake
  • Governed by default β€” Entra ID authentication, sensitivity labels, and Purview integration from day one
πŸ’‘ Exam tip: OneLake vs Azure Data Lake Storage

The exam may compare OneLake with Azure Data Lake Storage Gen2 (ADLS Gen2). Key differences: OneLake is tenant-scoped (one per org, not per subscription), uses Delta format by default (not raw Parquet/CSV), and is managed by Fabric (no storage account setup needed). ADLS Gen2 is still used for external data accessed via shortcuts.

Fabric workloads at a glance

Fabric bundles several workloads under one roof. Each workload has its own engine but shares OneLake storage.

All six workloads share OneLake storage and Entra ID security
WorkloadWhat It DoesEngineKey Items
Data EngineeringBuild and manage lakehouses, notebooks, Spark jobsApache SparkLakehouse, Notebook, Spark Job
Data WarehouseSQL-based analytics warehouse with full T-SQL supportFabric SQL EngineWarehouse, SQL Endpoint
Data FactoryData integration β€” pipelines and Dataflows Gen2 for ETL/ELTData movement enginePipeline, Dataflow Gen2
Power BISemantic models, reports, dashboards for business usersAnalysis ServicesSemantic Model, Report, Dashboard
Real-Time IntelligenceStreaming analytics and time-series dataKQL (Kusto)Eventhouse, KQL Queryset
Data ScienceMachine learning experiments and modelsApache Spark + ML librariesExperiment, ML Model

Which workload for which task?

If you need to…Use this workload
Store raw files and build Delta tablesData Engineering (Lakehouse)
Write complex SQL queries with joins and aggregationsData Warehouse
Move data from external sources into FabricData Factory (Pipeline or Dataflow Gen2)
Build interactive dashboards for business usersPower BI
Analyse streaming IoT or event data in near real-timeReal-Time Intelligence
Train and deploy ML modelsData Science

Workspaces: Your unit of everything

Workspaces are how you organise Fabric items. Think of them as project folders with built-in security and collaboration.

  • One workspace = one team or project β€” all related items live together
  • Access control β€” workspace roles (Admin, Member, Contributor, Viewer) control who can do what
  • Git integration β€” connect a workspace to a Git repo for version control
  • Deployment pipelines β€” promote items from Dev β†’ Test β†’ Prod workspaces
  • Capacity assignment β€” each workspace runs on an assigned Fabric capacity (F SKU)
πŸ’‘ Scenario: James sets up a client workspace

James at Summit Consulting creates a separate workspace for each client engagement. Client A’s workspace has its own lakehouse, warehouse, and semantic models β€” completely isolated from Client B. He assigns each workspace to the appropriate capacity and configures Git integration so his team can track changes.

This pattern β€” one workspace per client or project β€” is the foundation of multi-tenant governance in Fabric.

Fabric items you need to know

ItemWhat It IsWhere It Lives
LakehouseA storage container with Delta tables and a SQL analytics endpointData Engineering
WarehouseA fully managed SQL data warehouseData Warehouse
Semantic ModelA Power BI data model that defines measures, relationships, and business logicPower BI
NotebookAn interactive coding environment (PySpark, SQL, R)Data Engineering / Data Science
PipelineAn orchestration tool that moves and transforms data on a scheduleData Factory
Dataflow Gen2A low-code/no-code data transformation tool (Power Query Online)Data Factory
EventhouseA real-time analytics database for streaming dataReal-Time Intelligence
KQL QuerysetA Kusto Query Language query editor for Eventhouse dataReal-Time Intelligence
Question

What is OneLake?

Click or press Enter to reveal answer

Answer

OneLake is Microsoft Fabric's single, tenant-wide data lake. All Fabric workloads β€” Spark, SQL, KQL, Power BI β€” read from and write to the same OneLake storage. Data is stored in open Delta Parquet format. There is one OneLake per organisation (like OneDrive is one drive per user).

Click to flip back

Question

What format does OneLake store data in by default?

Click or press Enter to reveal answer

Answer

Delta Parquet. Delta tables use Parquet files for columnar storage plus a transaction log for ACID transactions. This open format means any engine (Spark, SQL, KQL) can read the same data without conversion.

Click to flip back

Question

How many OneLakes does an organisation have?

Click or press Enter to reveal answer

Answer

One. OneLake is tenant-scoped β€” the entire organisation shares a single OneLake instance. Individual workspaces, lakehouses, and warehouses are containers within that one OneLake.

Click to flip back

Question

What is the difference between a Fabric workspace and a capacity?

Click or press Enter to reveal answer

Answer

A workspace is a logical container for organising Fabric items (lakehouses, models, reports) with role-based access control. A capacity is the compute power (F SKU) that runs the workloads. Workspaces are assigned to capacities β€” one capacity can serve multiple workspaces.

Click to flip back

Knowledge Check

Anita at FreshCart needs to store raw CSV files from 2,000 stores, transform them into structured tables, and build Power BI dashboards. She wants a single platform without managing separate Azure services. Which statement best describes what Microsoft Fabric offers?

Knowledge Check

James at Summit Consulting manages analytics for 15 clients. He needs to ensure each client's data is isolated with its own access controls. Which Fabric concept should he use as the primary isolation boundary?

Knowledge Check

Raj at Atlas Capital is evaluating Fabric. His CTO asks: 'What stops us from needing to copy financial data between services?' Which OneLake feature addresses this concern?

🎬 Video coming soon


Next up: Lakehouses: Your Data Foundation β€” the core storage item in Fabric, where your raw data becomes structured Delta tables.

← Previous

XMLA Endpoint & Reusable Assets

Next β†’

Lakehouses: Your Data Foundation

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.