🔒 Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided DP-900 Domain 4
Domain 4 — Module 3 of 8 38%
22 of 27 overall

DP-900 Study Guide

Domain 1: Core Data Concepts

  • Your First Look at Data Free
  • Data File Formats: CSV, JSON, Parquet & More Free
  • Databases: Relational vs Non-Relational Free
  • Transactional Workloads: Keeping Data Consistent Free
  • Analytical Workloads: Finding the Insights Free
  • Data Roles: DBA, Engineer & Analyst Free
  • The Azure Data Landscape Free

Domain 2: Relational Data on Azure

  • Relational Data: Tables, Keys & Relationships
  • Normalization: Why Duplicate Data is Bad
  • SQL Basics: SELECT, INSERT, UPDATE, DELETE
  • Database Objects: Views, Indexes & More
  • Azure SQL: Your Database in the Cloud
  • Open-Source Databases on Azure
  • Choosing the Right Azure Database

Domain 3: Non-Relational Data on Azure

  • Azure Blob Storage: Files in the Cloud
  • Azure Files & Table Storage
  • Azure Cosmos DB: The Global Database
  • Cosmos DB APIs: SQL, MongoDB & More
  • Choosing Non-Relational Storage

Domain 4: Analytics on Azure

  • Data Ingestion & Processing
  • Analytical Data Stores: Data Lakes, Warehouses & Lakehouses
  • Microsoft Fabric & Azure Databricks
  • Batch vs Streaming: Two Speeds of Data
  • Real-Time Analytics on Azure
  • Power BI: See Your Data
  • Data Models in Power BI
  • Choosing the Right Visualization

DP-900 Study Guide

Domain 1: Core Data Concepts

  • Your First Look at Data Free
  • Data File Formats: CSV, JSON, Parquet & More Free
  • Databases: Relational vs Non-Relational Free
  • Transactional Workloads: Keeping Data Consistent Free
  • Analytical Workloads: Finding the Insights Free
  • Data Roles: DBA, Engineer & Analyst Free
  • The Azure Data Landscape Free

Domain 2: Relational Data on Azure

  • Relational Data: Tables, Keys & Relationships
  • Normalization: Why Duplicate Data is Bad
  • SQL Basics: SELECT, INSERT, UPDATE, DELETE
  • Database Objects: Views, Indexes & More
  • Azure SQL: Your Database in the Cloud
  • Open-Source Databases on Azure
  • Choosing the Right Azure Database

Domain 3: Non-Relational Data on Azure

  • Azure Blob Storage: Files in the Cloud
  • Azure Files & Table Storage
  • Azure Cosmos DB: The Global Database
  • Cosmos DB APIs: SQL, MongoDB & More
  • Choosing Non-Relational Storage

Domain 4: Analytics on Azure

  • Data Ingestion & Processing
  • Analytical Data Stores: Data Lakes, Warehouses & Lakehouses
  • Microsoft Fabric & Azure Databricks
  • Batch vs Streaming: Two Speeds of Data
  • Real-Time Analytics on Azure
  • Power BI: See Your Data
  • Data Models in Power BI
  • Choosing the Right Visualization
Domain 4: Analytics on Azure Premium ⏱ ~14 min read

Microsoft Fabric & Azure Databricks

Microsoft Fabric is the modern analytics platform. Azure Databricks is the big data powerhouse. Learn what each does and when to choose one over the other.

The two big analytics platforms

☕ Simple explanation

Microsoft Fabric is the all-in-one analytics kitchen. Azure Databricks is the specialised gourmet workshop.

Fabric has everything in one room — oven, fridge, dishwasher, dining table. You can prepare food, cook, plate, and serve without leaving the kitchen. Databricks is a professional-grade workshop for expert chefs — more powerful tools for complex dishes, but you bring your own plates and dining room.

Most teams start with Fabric. Teams with advanced data engineering or data science needs may use Databricks alongside it.

Microsoft Fabric is a SaaS analytics platform that unifies data engineering, data warehousing, real-time analytics, and business intelligence into a single service — all built on a shared storage layer (OneLake). Azure Databricks is a PaaS platform built on Apache Spark, optimised for large-scale data engineering, data science, and machine learning. Both handle large-scale analytics, but with different approaches to management, integration, and target audience.

Microsoft Fabric

Fabric is Microsoft’s unified analytics platform — a single SaaS service that replaces the need for multiple separate analytics tools.

What’s inside Fabric:

WorkloadWhat It Does
Data FactoryBuild data pipelines (ETL/ELT) with 200+ connectors
Data EngineeringApache Spark for large-scale data processing
Data WarehouseSQL-based analytical queries on structured data
Real-Time IntelligenceStream processing and real-time analytics
Data ScienceMachine learning experiments and model training
Power BIDashboards, reports, and data visualisation

The unifying layer: OneLake All Fabric workloads store data in OneLake — a single, organisation-wide data lake. No data duplication between services. A data engineer writes to OneLake, and a data analyst queries the same data in Power BI — no copies, no movement.

Priya’s FreshMart uses Fabric for everything:

  • Data Factory pipelines pull nightly data from 50 stores
  • Data Engineering cleans and transforms using Spark notebooks
  • Data Warehouse serves structured queries
  • Power BI dashboards connect directly to the warehouse
  • All data lives in one OneLake — no silos

Azure Databricks

Databricks is an Apache Spark-based analytics platform — jointly developed by Microsoft and Databricks, Inc. It runs on Azure infrastructure but is managed by Databricks.

Key capabilities:

  • Collaborative notebooks — Python, SQL, Scala, R in shared workspaces
  • Apache Spark — distributed data processing for massive datasets
  • Delta Lake — ACID transactions on data lakes
  • Machine learning — MLflow for experiment tracking, model management
  • Unity Catalog — data governance across all workspaces

When Databricks excels:

  • Advanced data engineering with complex Spark jobs
  • Data science and ML model training at scale
  • Multi-cloud environments (also runs on AWS and GCP)
  • Teams with deep Spark/Python expertise
Microsoft Fabric vs Azure Databricks
FeatureMicrosoft FabricAzure Databricks
TypeSaaS (fully managed)PaaS (Spark-based)
AudienceAll data roles (analysts to engineers)Data engineers and data scientists
StorageOneLake (built-in, unified)ADLS Gen2 (external, you configure)
Power BIIntegrated nativelyConnect externally
SparkIncluded (Fabric Spark)Core strength (Databricks Spark)
Multi-cloudAzure onlyAzure, AWS, GCP
Learning curveLower (guided experiences)Higher (requires Spark knowledge)
Best forEnd-to-end analytics in one platformAdvanced data engineering and ML
ℹ️ Can you use both together?

Absolutely — and many organisations do. A common pattern:

  • Databricks for complex data engineering and ML model training (where Spark expertise shines)
  • Fabric for data warehousing, real-time analytics, and Power BI reporting (where the integrated experience shines)
  • OneLake shortcuts connect Fabric to Databricks storage, so data flows between them without copying

Fabric and Databricks aren’t competitors for every scenario — they complement each other.

ℹ️ What about Azure Synapse Analytics?

You may see Azure Synapse Analytics mentioned in older materials. Synapse was Microsoft’s previous unified analytics service — it combined SQL pools, Spark pools, and data integration.

Microsoft Fabric is the newer unified analytics platform, positioned as the preferred choice for most new analytics projects. Fabric takes the same ideas (unified analytics, SQL + Spark, integrated pipelines) and delivers them as a simpler SaaS experience with OneLake. Some Synapse features (like dedicated SQL pools) still exist for existing workloads.

For the DP-900 exam, focus on Fabric as the current platform. You may see Synapse mentioned in passing but Fabric is what’s tested.

💡 Exam tip: Fabric vs Databricks selection
  • “End-to-end analytics platform” → Fabric
  • “Unified storage with OneLake” → Fabric
  • “Power BI integrated natively” → Fabric
  • “Advanced Spark-based data engineering” → Databricks
  • “Machine learning at scale” → Databricks
  • “Multi-cloud (Azure + AWS + GCP)” → Databricks
  • “SaaS with minimal setup” → Fabric

Flashcards

Question

What is Microsoft Fabric?

Click or press Enter to reveal answer

Answer

A SaaS analytics platform that unifies data engineering, data warehousing, real-time analytics, data science, and Power BI — all on a single storage layer called OneLake. It's the all-in-one analytics solution on Azure.

Click to flip back

Question

What is Azure Databricks?

Click or press Enter to reveal answer

Answer

An Apache Spark-based analytics platform for large-scale data engineering, data science, and machine learning. It runs on Azure infrastructure, supports Python/SQL/Scala/R notebooks, and works across Azure, AWS, and GCP.

Click to flip back

Question

What is OneLake?

Click or press Enter to reveal answer

Answer

Microsoft Fabric's unified storage layer — a single, organisation-wide data lake built on ADLS Gen2. All Fabric workloads (engineering, warehouse, Power BI) share OneLake, eliminating data silos and copies.

Click to flip back

Knowledge check

Knowledge Check

FreshMart wants a single platform for data pipelines, warehousing, real-time analytics, and Power BI dashboards — with minimal infrastructure management. Which service?

Knowledge Check

A data science team needs to train machine learning models using Apache Spark across massive datasets, and they also need the platform to work on AWS (not just Azure). Which service?

🎬 Video coming soon

Next up: Batch vs Streaming: Two Speeds of Data — not all data arrives the same way.

← Previous

Analytical Data Stores: Data Lakes, Warehouses & Lakehouses

Next →

Batch vs Streaming: Two Speeds of Data

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.