πŸ”’ Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided AZ-305 Domain 3
Domain 3 β€” Module 4 of 5 80%
17 of 30 overall

AZ-305 Study Guide

Domain 1: Design Identity, Governance, and Monitoring Solutions

  • Monitoring & Logging Design
  • Choosing Authentication Methods
  • Designing Identity Management
  • Authorizing Access to Resources
  • Secrets, Keys & Certificates
  • Governance at Scale
  • Compliance & Identity Governance

Domain 2: Design Data Storage Solutions

  • Relational Data: Choosing Your SQL Platform
  • Database Performance & Scalability
  • Database Security & Compliance
  • Cosmos DB & Semi-Structured Data
  • Blob, Data Lake & Azure Files
  • Data Integration & Analytics

Domain 3: Design Business Continuity Solutions

  • Recovery Objectives: RPO, RTO & SLA Free
  • Backup & Recovery for Compute Free
  • Backup for Databases & Unstructured Data Free
  • High Availability for Compute Free
  • High Availability for Data Free

Domain 4: Design Infrastructure Solutions

  • Compute Design: VMs & When to Use Them
  • Container Solutions: AKS, ACI & Container Apps
  • Serverless & Batch Processing
  • Messaging Architecture
  • Event-Driven Architecture
  • API Integration & Caching
  • App Configuration & Automated Deployment
  • Migration Strategy & Assessment
  • Executing Migrations
  • Network Connectivity: Internet & Hybrid
  • Network Security & Performance
  • Load Balancing & Routing

AZ-305 Study Guide

Domain 1: Design Identity, Governance, and Monitoring Solutions

  • Monitoring & Logging Design
  • Choosing Authentication Methods
  • Designing Identity Management
  • Authorizing Access to Resources
  • Secrets, Keys & Certificates
  • Governance at Scale
  • Compliance & Identity Governance

Domain 2: Design Data Storage Solutions

  • Relational Data: Choosing Your SQL Platform
  • Database Performance & Scalability
  • Database Security & Compliance
  • Cosmos DB & Semi-Structured Data
  • Blob, Data Lake & Azure Files
  • Data Integration & Analytics

Domain 3: Design Business Continuity Solutions

  • Recovery Objectives: RPO, RTO & SLA Free
  • Backup & Recovery for Compute Free
  • Backup for Databases & Unstructured Data Free
  • High Availability for Compute Free
  • High Availability for Data Free

Domain 4: Design Infrastructure Solutions

  • Compute Design: VMs & When to Use Them
  • Container Solutions: AKS, ACI & Container Apps
  • Serverless & Batch Processing
  • Messaging Architecture
  • Event-Driven Architecture
  • API Integration & Caching
  • App Configuration & Automated Deployment
  • Migration Strategy & Assessment
  • Executing Migrations
  • Network Connectivity: Internet & Hybrid
  • Network Security & Performance
  • Load Balancing & Routing
Domain 3: Design Business Continuity Solutions Free ⏱ ~18 min read

High Availability for Compute

Availability sets, availability zones, region pairs, and VMSS β€” design compute architectures that survive failures at every level, from a single rack to an entire region.

Designing compute high availability

β˜• Simple explanation

High availability means your application keeps running when things break. The question is: what level of failure can you survive?

Rack failure: Availability sets spread VMs across fault domains (different racks/power/network).

Data centre failure: Availability zones spread VMs across physically separate buildings.

Region failure: Multi-region deployment with load balancing/failover.

Each level costs more but protects against bigger disasters.

Compute HA architecture follows a layered protection model:

  • Availability sets: Fault domains (rack isolation) + update domains (rolling updates). SLA: 99.95%
  • Availability zones: Physically separate data centres within a region. SLA: 99.99%
  • Multi-region: Active-active or active-passive across region pairs. SLA: 99.99%+
  • Virtual Machine Scale Sets (VMSS): Auto-scaling groups with built-in HA

Compute HA comparison

Compute High Availability Options
FactorAvailability SetsAvailability ZonesMulti-Region
Protects againstRack/hardware failure, planned maintenanceData centre failureRegion-wide failure
SLA (VMs)99.95%99.99%99.99%+ (architecture dependent)
Latency between instancesSub-millisecond (same data centre)~2ms (same region, different DC)Variable (cross-region, 10-100+ ms)
Data replicationN/A β€” compute onlyNo inherent compute-state replication β€” depends on storage/data layer (ZRS disks, app-level replication)Asynchronous (cross-region, app/storage dependent)
CostNo extra cost (just placement)No extra cost for zonesDouble compute + storage + networking
ComplexityLowLow-mediumHigh
Best forLegacy apps, regions without zonesStandard production workloadsMission-critical, global applications
πŸ’‘ Exam tip: Availability Zones are the default answer for most scenarios

If the exam says β€œhigh availability” without mentioning regional DR, Availability Zones is almost always the correct answer. They provide 99.99% SLA with minimal complexity and no extra cost. Availability Sets (99.95%) are for regions that don’t support zones or legacy configurations. Multi-region is only needed when the scenario explicitly requires regional failure protection.

PaaS high availability

Not just VMs β€” PaaS services have their own HA patterns:

ServiceHA MechanismSLA
App ServiceZone-redundant deployment (3+ instances across zones)99.99%
Azure FunctionsZone-redundant (Premium/Dedicated plan)99.99%
AKSZone-redundant node pools + system pods99.99% (with zones)
Container AppsBuilt-in zone redundancy99.95%
Azure SQLZone-redundant configuration (Business Critical/Hyperscale)99.995%

πŸš€ Marcus’s HA architecture: NovaSaaS uses zone-redundant everything:

  • AKS with node pools spread across 3 zones
  • Azure SQL Business Critical with zone-redundant replicas
  • Azure Cache for Redis with zone redundancy
  • Result: any single data centre can fail without customer impact

Virtual Machine Scale Sets (VMSS)

VMSS provides auto-scaling with built-in HA:

FeatureDescription
Auto-scaleScale out (add VMs) and in (remove VMs) based on metrics or schedule
Zone spreadingAutomatically distributes VMs across availability zones
Rolling upgradesUpdate VMs in batches without downtime
OverprovisioningCreate extra VMs during scale-out, delete extras once target is healthy
Orchestration modesUniform (identical VMs) or Flexible (mixed VM sizes)

πŸ—οΈ Priya’s VMSS design: GlobalTech’s web tier:

  • Flexible orchestration with zone spreading across 3 availability zones
  • Auto-scale rules: scale out at 70% CPU, scale in at 30%, min 6 / max 30 instances
  • Health probes on the load balancer β€” unhealthy VMs auto-replaced
  • Rolling upgrade policy β€” update 20% at a time, wait for health confirmation

Multi-region HA patterns

PatternDescriptionRTOCost
Active-ActiveBoth regions serve traffic simultaneously~0 (seamless)Highest (full capacity both regions)
Active-Passive (hot standby)Secondary region running but not serving trafficMinutesHigh (idle compute in secondary)
Active-Passive (warm standby)Secondary has reduced capacity, scales up on failover10-30 minutesMedium (reduced compute in secondary)
Active-Passive (cold standby)Secondary deployed but VMs stoppedHoursLowest (storage only until failover)

🏦 Elena’s multi-region design: Trading platform uses Active-Active (both regions handle trades). Customer portal uses Active-Passive hot standby (secondary ready to take over in minutes).

Knowledge check

Question

What SLA do Availability Zones provide for VMs?

Click or press Enter to reveal answer

Answer

99.99% (about 4.38 minutes downtime per month). This is higher than Availability Sets (99.95%) because zones are physically separate data centres, protecting against entire data centre failures β€” not just rack failures.

Click to flip back

Question

What's the difference between Active-Active and Active-Passive multi-region?

Click or press Enter to reveal answer

Answer

Active-Active: both regions serve traffic simultaneously β€” zero RTO, highest cost. Active-Passive: secondary region is standby (hot/warm/cold) β€” RTO depends on readiness level. Active-Active requires stateless design or data synchronisation between regions.

Click to flip back

Question

When should you choose Application Gateway over Azure Load Balancer?

Click or press Enter to reveal answer

Answer

Application Gateway operates at Layer 7 (HTTP/HTTPS) β€” use when you need URL-based routing, SSL offloading, cookie-based session affinity, or WAF. Azure Load Balancer operates at Layer 4 (TCP/UDP) β€” use for non-HTTP protocols, ultra-low latency, or zone-redundant load balancing. For web apps, Application Gateway is usually the right choice.

Click to flip back

Knowledge Check

πŸ—οΈ GlobalTech needs their customer-facing web application to survive a data centre failure with 99.99% availability. They want automatic scaling during peak hours. The application is stateless. What should Priya recommend?

Knowledge Check

πŸš€ NovaSaaS needs their API tier to survive a full Azure region outage with automatic failover in under 60 seconds. The APIs are stateless and deployed to two regions. Users connect via custom domain with HTTPS. Which load balancing solution should Marcus use?

🎬 Video coming soon


Next up: Compute is highly available β€” now let’s do the same for data β€” High Availability for Data.

← Previous

Backup for Databases & Unstructured Data

Next β†’

High Availability for Data

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.