πŸ”’ Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided AZ-305 Domain 3
Domain 3 β€” Module 5 of 5 100%
18 of 30 overall

AZ-305 Study Guide

Domain 1: Design Identity, Governance, and Monitoring Solutions

  • Monitoring & Logging Design
  • Choosing Authentication Methods
  • Designing Identity Management
  • Authorizing Access to Resources
  • Secrets, Keys & Certificates
  • Governance at Scale
  • Compliance & Identity Governance

Domain 2: Design Data Storage Solutions

  • Relational Data: Choosing Your SQL Platform
  • Database Performance & Scalability
  • Database Security & Compliance
  • Cosmos DB & Semi-Structured Data
  • Blob, Data Lake & Azure Files
  • Data Integration & Analytics

Domain 3: Design Business Continuity Solutions

  • Recovery Objectives: RPO, RTO & SLA Free
  • Backup & Recovery for Compute Free
  • Backup for Databases & Unstructured Data Free
  • High Availability for Compute Free
  • High Availability for Data Free

Domain 4: Design Infrastructure Solutions

  • Compute Design: VMs & When to Use Them
  • Container Solutions: AKS, ACI & Container Apps
  • Serverless & Batch Processing
  • Messaging Architecture
  • Event-Driven Architecture
  • API Integration & Caching
  • App Configuration & Automated Deployment
  • Migration Strategy & Assessment
  • Executing Migrations
  • Network Connectivity: Internet & Hybrid
  • Network Security & Performance
  • Load Balancing & Routing

AZ-305 Study Guide

Domain 1: Design Identity, Governance, and Monitoring Solutions

  • Monitoring & Logging Design
  • Choosing Authentication Methods
  • Designing Identity Management
  • Authorizing Access to Resources
  • Secrets, Keys & Certificates
  • Governance at Scale
  • Compliance & Identity Governance

Domain 2: Design Data Storage Solutions

  • Relational Data: Choosing Your SQL Platform
  • Database Performance & Scalability
  • Database Security & Compliance
  • Cosmos DB & Semi-Structured Data
  • Blob, Data Lake & Azure Files
  • Data Integration & Analytics

Domain 3: Design Business Continuity Solutions

  • Recovery Objectives: RPO, RTO & SLA Free
  • Backup & Recovery for Compute Free
  • Backup for Databases & Unstructured Data Free
  • High Availability for Compute Free
  • High Availability for Data Free

Domain 4: Design Infrastructure Solutions

  • Compute Design: VMs & When to Use Them
  • Container Solutions: AKS, ACI & Container Apps
  • Serverless & Batch Processing
  • Messaging Architecture
  • Event-Driven Architecture
  • API Integration & Caching
  • App Configuration & Automated Deployment
  • Migration Strategy & Assessment
  • Executing Migrations
  • Network Connectivity: Internet & Hybrid
  • Network Security & Performance
  • Load Balancing & Routing
Domain 3: Design Business Continuity Solutions Free ⏱ ~18 min read

High Availability for Data

SQL Always On, failover groups, Cosmos DB multi-region, and geo-redundant storage β€” design data architectures that survive failures without losing a single transaction.

Designing data high availability

β˜• Simple explanation

Compute HA keeps your app running. Data HA keeps your data accessible. They’re separate concerns β€” your VMs might survive a failure, but if the database is down, the app is useless.

Key patterns: Zone-redundant databases (survive data centre failure), failover groups (automatic regional failover for SQL), Cosmos DB multi-region writes (globally distributed, always available), and geo-redundant storage (data replicated to paired region).

Data HA architecture covers three data categories:

  • Relational (Azure SQL): Zone-redundant configuration, active geo-replication, auto-failover groups
  • Semi-structured (Cosmos DB): Multi-region writes, automatic failover, consistency vs availability tradeoffs
  • Unstructured (Storage): ZRS, GRS, GZRS, RA-GZRS for tiered durability and availability

Relational data HA

Azure SQL HA patterns

Azure SQL High Availability Options
OptionScopeFailoverRPOBest For
Zone-redundant configWithin-region (across zones)Automatic0 (synchronous)Data centre failure protection β€” standard HA
Active geo-replicationCross-regionManual~5 seconds (async)Read offloading + manual DR
Auto-failover groupsCross-regionAutomatic~5 seconds (async)Automatic regional DR with endpoint redirection

Auto-failover groups deep dive

Auto-failover groups are the recommended pattern for regional SQL DR:

FeatureDetail
Automatic failoverDetects region outage and fails over within the grace period
Grace periodConfigurable (default 1 hour) β€” prevents false positives
Read-write endpoint<group-name>.database.windows.net β€” always points to primary
Read-only endpoint<group-name>.secondary.database.windows.net β€” always points to secondary
Application impactConnection strings don’t change β€” endpoints redirect automatically

🏦 Elena’s SQL HA design:

  • Zone-redundant Business Critical for within-region HA (survives data centre failure)
  • Auto-failover group to the paired region (survives regional outage)
  • Grace period: 30 minutes β€” short enough for fast failover, long enough to avoid false triggers
  • Application uses the failover group endpoint β€” no connection string changes during failover
πŸ’‘ Exam tip: Auto-failover groups vs active geo-replication

Both replicate across regions, but auto-failover groups are preferred because:

  • Automatic failover (geo-replication requires manual or app-level failover)
  • Endpoint redirection (app connection strings don’t change)
  • Group multiple databases (failover all databases together, not one at a time)

Choose active geo-replication only when you need more than 1 secondary, or need secondaries in regions other than the paired region (failover groups can target any region, but paired regions are recommended for operational reasons).

Cosmos DB multi-region HA

Cosmos DB is built for global distribution:

ConfigurationWrite RegionsRead RegionsConsistencySLA
Single-region11Any99.99%
Multi-region (single write)11-30+Any99.999% (reads)
Multi-region (multi-write)2+All write regionsSession, Consistent Prefix, Eventual99.999% (reads + writes)

πŸš€ Marcus’s Cosmos DB HA: NovaSaaS operates globally:

  • Multi-region writes in 3 regions (US East, West Europe, Southeast Asia)
  • Session consistency β€” each user sees their own writes
  • Automatic failover enabled β€” if a region goes down, Cosmos DB promotes another region
  • Conflict resolution: Last-writer-wins for most containers, custom merge for shopping carts
ℹ️ Design decision: Multi-write tradeoffs

Multi-region writes give the highest availability (99.999% write SLA) but:

  • Limits consistency: Strong and Bounded Staleness are NOT available with multi-write
  • Requires conflict resolution: Concurrent writes to same item in different regions need a resolution policy
  • Higher cost: RU charges in every write region

Recommendation: Use multi-write for global apps where latency matters and eventual/session consistency is acceptable. Use single-write + multi-read for apps needing stronger consistency.

Storage HA

RedundancyWithin-Region HACross-Region DRRead from Secondary
LRSNo (single DC)NoNo
ZRSYes (3 zones)NoNo
GRSNo (single DC)Yes (paired region)No (failover required)
GZRSYes (3 zones)Yes (paired region)No (failover required)
RA-GRSNo (single DC)Yes (paired region)Yes (read-only secondary)
RA-GZRSYes (3 zones)Yes (paired region)Yes (read-only secondary)

πŸ—οΈ Priya’s storage HA: GlobalTech uses RA-GZRS for critical data:

  • Zone redundancy in primary region β€” survives data centre failure
  • Geo-redundancy β€” data replicated to paired region
  • Read-access secondary β€” applications can read from secondary endpoint for resilience

Knowledge check

Question

What's the advantage of SQL auto-failover groups over active geo-replication?

Click or press Enter to reveal answer

Answer

Three advantages: (1) Automatic failover detection and promotion, (2) Endpoint redirection β€” applications use a single connection string that always points to the active primary, (3) Group multiple databases β€” failover all databases together as a unit.

Click to flip back

Question

What consistency levels are available with Cosmos DB multi-region writes?

Click or press Enter to reveal answer

Answer

Session, Consistent Prefix, and Eventual. Strong and Bounded Staleness are NOT available with multi-write because synchronous cross-region consistency would negate the low-latency benefit of local writes.

Click to flip back

Question

What does RA-GZRS provide that GZRS doesn't?

Click or press Enter to reveal answer

Answer

Read access to the secondary region endpoint WITHOUT requiring a failover. RA-GZRS = Zone-redundant (primary) + Geo-redundant (secondary) + Read Access (secondary). Applications can read from the secondary for resilience even when the primary is healthy.

Click to flip back

Knowledge Check

🏦 Elena needs Azure SQL to automatically fail over to a secondary region if the primary region has an outage. The application should not need connection string changes. Multiple databases must fail over together. Which feature should she recommend?

Knowledge Check

πŸš€ NovaSaaS needs Cosmos DB to be writable from 3 regions simultaneously for low-latency global access. They can accept session-level consistency. What configuration should Marcus recommend?

🎬 Video coming soon


Domain 3 complete! You’ve designed recovery objectives, backup strategies, and high availability for compute and data.

Next up: Now let’s design the infrastructure itself β€” Compute Design: VMs & When to Use Them.

← Previous

High Availability for Compute

Next β†’

Compute Design: VMs & When to Use Them

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.