πŸ”’ Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided DP-420 Domain 1
Domain 1 β€” Module 1 of 11 9%
1 of 28 overall

DP-420 Study Guide

Domain 1: Design and Implement Data Models

  • Cosmos DB β€” The Big Picture Free
  • Designing Your Data Model Free
  • Partition Key Strategy Free
  • Synthetic and Hierarchical Partition Keys Free
  • Relationships β€” Embedding vs Referencing Free
  • SDK Connectivity and Client Configuration Free
  • SDK CRUD Operations and Transactions Free
  • SQL Queries in Cosmos DB Free
  • SDK Query Pagination and LINQ Free
  • Server-Side Programming Free
  • Transactions in Practice Free

Domain 2: Design and Implement Data Distribution

  • Global Replication and Failover
  • Consistency Levels: Five Choices, Real Trade-Offs
  • Multi-Region Writes and Conflict Resolution

Domain 3: Integrate and Move Data

  • Change Feed with Azure Functions and Processors
  • Analytical Workloads: Synapse Link and Fabric Mirroring
  • Data Movement: ADF, Kafka, and Spark Connectors

Domain 4: Optimize Query and Operation Performance

  • Indexing Policies: Range, Spatial, and Composite
  • Request Units and Query Cost Optimization
  • Integrated Cache and Dedicated Gateway
  • Change Feed Patterns: Materialized Views and Estimator

Domain 5: Maintain an Azure Cosmos DB Solution

  • Monitoring: Metrics, Logs, and Alerts
  • Backup and Restore: Periodic vs Continuous
  • Network Security: Firewalls, VNets, and Private Endpoints
  • Data Security: Encryption, Keys, and RBAC
  • Cost Optimization: Throughput Modes and RU Strategy
  • DevOps: Infrastructure as Code and Deployments
  • Exam Strategy and Cross-Domain Review

DP-420 Study Guide

Domain 1: Design and Implement Data Models

  • Cosmos DB β€” The Big Picture Free
  • Designing Your Data Model Free
  • Partition Key Strategy Free
  • Synthetic and Hierarchical Partition Keys Free
  • Relationships β€” Embedding vs Referencing Free
  • SDK Connectivity and Client Configuration Free
  • SDK CRUD Operations and Transactions Free
  • SQL Queries in Cosmos DB Free
  • SDK Query Pagination and LINQ Free
  • Server-Side Programming Free
  • Transactions in Practice Free

Domain 2: Design and Implement Data Distribution

  • Global Replication and Failover
  • Consistency Levels: Five Choices, Real Trade-Offs
  • Multi-Region Writes and Conflict Resolution

Domain 3: Integrate and Move Data

  • Change Feed with Azure Functions and Processors
  • Analytical Workloads: Synapse Link and Fabric Mirroring
  • Data Movement: ADF, Kafka, and Spark Connectors

Domain 4: Optimize Query and Operation Performance

  • Indexing Policies: Range, Spatial, and Composite
  • Request Units and Query Cost Optimization
  • Integrated Cache and Dedicated Gateway
  • Change Feed Patterns: Materialized Views and Estimator

Domain 5: Maintain an Azure Cosmos DB Solution

  • Monitoring: Metrics, Logs, and Alerts
  • Backup and Restore: Periodic vs Continuous
  • Network Security: Firewalls, VNets, and Private Endpoints
  • Data Security: Encryption, Keys, and RBAC
  • Cost Optimization: Throughput Modes and RU Strategy
  • DevOps: Infrastructure as Code and Deployments
  • Exam Strategy and Cross-Domain Review
Domain 1: Design and Implement Data Models Free ⏱ ~14 min read

Cosmos DB β€” The Big Picture

Understand the Azure Cosmos DB resource hierarchy, NoSQL API, throughput models, global distribution, SLAs, the emulator, and when to choose free tier vs serverless vs provisioned.

What is Azure Cosmos DB?

β˜• Simple explanation

Think of Cosmos DB as a planet-scale filing cabinet. You have one account (the office), which holds databases (rooms), which hold containers (filing drawers), which hold items (individual documents). The magic? You can replicate your filing cabinet to offices around the world, and anyone reads from the nearest copy β€” in single-digit milliseconds.

You pay for how fast the drawers open (throughput in RU/s) and how much paper you store (GB). That’s really it.

Azure Cosmos DB is a fully managed, globally distributed, multi-model NoSQL database. Key characteristics:

  • Multi-model: NoSQL (Core), MongoDB, Cassandra, Gremlin, Table β€” all on the same engine. The DP-420 exam focuses on the NoSQL API.
  • Turnkey global distribution: Replicate to any Azure region with a click. Multi-region writes available.
  • Elastic scale: Throughput and storage scale independently, with no practical upper limit.
  • Guaranteed SLAs: Latency, availability, throughput, and consistency β€” backed by financial SLAs.
  • Schema-agnostic: Items are JSON documents. No schema enforcement at the engine level.

Priya’s scenario: choosing Cosmos DB for NovaSaaS

πŸš€ Priya at NovaSaaS is architecting a multi-tenant SaaS platform serving 200K users across 12 countries. Her requirements:

  • Low latency globally β€” users in Sydney, London, and SΓ£o Paulo all expect sub-10ms reads
  • Elastic scale β€” Black Friday traffic is 5Γ— normal; she can’t over-provision year-round
  • Multi-tenant isolation β€” each tenant’s data stays logically separated
  • 99.99%+ availability β€” SaaS customers expect always-on

Cosmos DB checks every box. Let’s see how it’s structured.

Resource hierarchy

Every Cosmos DB deployment follows this hierarchy:

Azure Subscription
  └── Cosmos DB Account          ← endpoint + keys, billing root
        └── Database              ← logical namespace, optional shared throughput
              └── Container       ← the "table" β€” partitioned, indexed, queried
                    └── Item      ← a JSON document (max 2 MB)
LevelWhat it doesKey facts
AccountTop-level resource. Holds the endpoint URI and keys.One account can hold multiple APIs (but you choose one API at creation).
DatabaseLogical grouping of containers.Optional: you can provision shared throughput at this level.
ContainerWhere data lives. Think of it as a table or collection.Has a partition key, indexing policy, TTL, unique keys, throughput.
ItemA single JSON document.Max 2 MB per item. Identified by id + partition key value.

Priya creates one account, one database novasaas, and separate containers per domain β€” users, projects, billing.

Throughput: database vs container

AspectDatabase throughput (shared)Container throughput (dedicated)
RU/s poolShared across all containers in the databaseDedicated to a single container
Minimum400 RU/s (or 1000 with autoscale)400 RU/s (or 1000 with autoscale)
Best forMany containers with variable, low trafficHigh-traffic containers that need guaranteed capacity
OverrideIndividual containers can opt out with dedicated throughputN/A β€” each container has its own RU/s
Cost efficiencyGreat when most containers are idleBetter when one container dominates traffic

Priya uses shared throughput at the database level for dev/test environments (saves cost), and dedicated container throughput in production for her projects container that handles 80% of traffic.

πŸ’‘ Exam tip: RU/s minimum with shared throughput

When you provision shared throughput, the minimum is 400 RU/s (manual) or 1,000 RU/s (autoscale max). Each container in the shared pool is guaranteed at least 100 RU/s. So if you have 10 containers sharing 1,000 RU/s, each gets at least 100 RU/s β€” the remaining is distributed on demand.

You can also mix: have some containers on the shared pool and others on dedicated throughput within the same database.

Global distribution and SLAs

Cosmos DB was built for global distribution from day one:

SLASingle-regionMulti-region (single-write)Multi-region (multi-write)
Availability99.99%99.99%99.999%
Read latency< 10 ms at p99< 10 ms at p99< 10 ms at p99
Write latency< 10 ms at p99< 10 ms at p99< 10 ms at p99

The 99.999% SLA (five nines) requires multi-region with multi-write enabled. That’s ~5 minutes of downtime per year.

Priya enables multi-region reads across Australia East, UK South, and Brazil South, with writes pinned to Australia East for now. She’ll enable multi-write when her team is ready to handle conflict resolution.

Free tier vs serverless vs provisioned

FeatureFree tierServerlessProvisioned
ThroughputFirst 1,000 RU/s free (then standard pricing)Auto-scales per request, billed per RU consumedYou set RU/s (manual or autoscale)
StorageFirst 25 GB freeUnlimited (within partition limits)Unlimited (within partition limits)
Best forDev/test, learning, small appsBursty, unpredictable workloadsProduction, predictable traffic
Global distributionβœ… Supported❌ Single-region onlyβœ… Supported
Max per subscription1 accountUnlimitedUnlimited
Idle costFree (within limits)Zero β€” pay only when usedMinimum RU/s cost always applies

Ravi’s mistake: Ravi spun up a provisioned account for a weekend hackathon and left 10,000 RU/s running all week. β€œWhy is our Azure bill $400?” Serverless would have cost under $2 for his light traffic.

πŸ’‘ Exam tip: serverless limitations

Serverless is great for dev/test but has real limitations: single-region only (no global distribution), no dedicated throughput guarantees, and each physical partition supports at most 5,000 RU/s. The exam loves testing whether serverless supports multi-region β€” it doesn’t. Note: you can now convert a serverless account to provisioned throughput in-place (NoSQL API).

The Cosmos DB emulator

The emulator runs locally on Windows, macOS (Docker), or Linux (Docker). It provides a local endpoint that behaves like the real service:

Endpoint:  https://localhost:8081
Key:       C2y6yDjf5/R+ob0N8A7Cgv30VRDJIWEHLM+4QDU5DE2nQ9nDuVTqobD4b8mGGyPMbIZnqyMsEcaGQy67XIw/Jw==

That key is the same for every emulator instance β€” it’s a well-known fixed key, not a secret. The exam may test this.

🎬 Video walkthrough

🎬 Video coming soon

Cosmos DB Big Picture β€” DP-420 Module 1

Cosmos DB Big Picture β€” DP-420 Module 1

~14 min

Flashcards

Question

What is the Cosmos DB resource hierarchy from top to bottom?

Click or press Enter to reveal answer

Answer

Account β†’ Database β†’ Container β†’ Item. The account is the billing and endpoint root. Databases are logical namespaces. Containers hold partitioned data. Items are individual JSON documents (max 2 MB).

Click to flip back

Question

What availability SLA does Cosmos DB offer with multi-region multi-write?

Click or press Enter to reveal answer

Answer

99.999% (five nines) β€” approximately 5 minutes of downtime per year. Single-region and multi-region single-write both offer 99.99% (four nines).

Click to flip back

Question

Can serverless Cosmos DB accounts use global distribution?

Click or press Enter to reveal answer

Answer

No. Serverless is single-region only. If you need multi-region replication, you must use provisioned throughput (manual or autoscale).

Click to flip back

Question

What is the well-known emulator key?

Click or press Enter to reveal answer

Answer

C2y6yDjf5/R+ob0N8A7Cgv30VRDJIWEHLM+4QDU5DE2nQ9nDuVTqobD4b8mGGyPMbIZnqyMsEcaGQy67XIw/Jw== β€” it's the same fixed key for every emulator instance. It's not a secret.

Click to flip back

Knowledge check

Knowledge Check

Priya needs to choose a throughput model for NovaSaaS production. Her 'projects' container handles 80% of traffic. Which approach is best?

Knowledge Check

Ravi sets up a Cosmos DB account for a weekend prototype. He expects very light, unpredictable traffic, doesn't need multi-region, and hasn't used a free tier account in this subscription. Which model is cheapest?

Knowledge Check

Which of these is true about the Cosmos DB emulator?


Next up: Design Data Model β€” learn how to think access-pattern-first and design containers that match your queries, not your entity-relationship diagrams.

Next β†’

Designing Your Data Model

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.