🔒 Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided AZ-900 Domain 1
Domain 1 — Module 4 of 7 57%
4 of 26 overall

AZ-900 Study Guide

Domain 1: Describe Cloud Concepts

  • What is Cloud Computing? Free
  • Cloud Models: Public, Private, and Hybrid Free
  • Cloud Pricing: Consumption, Serverless, and Pay-as-You-Go Free
  • High Availability and Scalability Free
  • Reliability, Security, and Manageability Free
  • IaaS, PaaS, and SaaS Explained Free
  • Choosing the Right Cloud Service Free

Domain 2: Describe Azure Architecture and Services

  • Azure Regions, Zones, and Datacenters
  • Resources, Resource Groups, and Subscriptions
  • Azure Virtual Machines
  • Azure Compute: Containers, Functions, and App Service
  • Azure Networking: VNets, Subnets, and Peering
  • Connecting to Azure: VPNs, ExpressRoute, and DNS
  • Azure Storage Services
  • Data Migration: Moving to Azure
  • Microsoft Entra ID: Your Identity Hub
  • Authentication and External Identities
  • Azure Security: RBAC, Zero Trust, and Defender

Domain 3: Describe Azure Management and Governance

  • What Affects Your Azure Bill
  • Pricing Calculators: TCO and Azure Pricing
  • Cost Management and Tags
  • Azure Governance: Purview, Policy, and Locks
  • The Azure Portal and Cloud Shell
  • Infrastructure as Code: ARM, Bicep, and Arc
  • Azure Advisor and Service Health
  • Azure Monitor: Logs, Alerts, and Insights

AZ-900 Study Guide

Domain 1: Describe Cloud Concepts

  • What is Cloud Computing? Free
  • Cloud Models: Public, Private, and Hybrid Free
  • Cloud Pricing: Consumption, Serverless, and Pay-as-You-Go Free
  • High Availability and Scalability Free
  • Reliability, Security, and Manageability Free
  • IaaS, PaaS, and SaaS Explained Free
  • Choosing the Right Cloud Service Free

Domain 2: Describe Azure Architecture and Services

  • Azure Regions, Zones, and Datacenters
  • Resources, Resource Groups, and Subscriptions
  • Azure Virtual Machines
  • Azure Compute: Containers, Functions, and App Service
  • Azure Networking: VNets, Subnets, and Peering
  • Connecting to Azure: VPNs, ExpressRoute, and DNS
  • Azure Storage Services
  • Data Migration: Moving to Azure
  • Microsoft Entra ID: Your Identity Hub
  • Authentication and External Identities
  • Azure Security: RBAC, Zero Trust, and Defender

Domain 3: Describe Azure Management and Governance

  • What Affects Your Azure Bill
  • Pricing Calculators: TCO and Azure Pricing
  • Cost Management and Tags
  • Azure Governance: Purview, Policy, and Locks
  • The Azure Portal and Cloud Shell
  • Infrastructure as Code: ARM, Bicep, and Arc
  • Azure Advisor and Service Health
  • Azure Monitor: Logs, Alerts, and Insights
Domain 1: Describe Cloud Concepts Free ⏱ ~11 min read

High Availability and Scalability

Two of the most important cloud benefits — keeping your apps running when things fail, and growing your resources to match demand. Here's how Azure delivers both.

Why do availability and scalability matter?

☕ Simple explanation

Imagine your favourite coffee shop.

High availability = the shop is open every day, even if one barista calls in sick. There’s always someone to make your coffee because they have backup staff.

Scalability = during the morning rush, the shop opens extra registers and calls in more baristas. When it’s quiet in the afternoon, they close the extra registers. They match capacity to demand.

In cloud computing, your apps need to be “always open” (available) and able to “call in more baristas” (scale) when traffic spikes.

High availability (HA) refers to a system’s ability to remain operational and accessible, even when components fail. Cloud providers achieve HA through redundancy — running multiple copies of your application across different servers, racks, and even data centres. If one component fails, traffic is routed to a healthy copy.

Scalability is the ability to adjust resources to meet demand. In cloud computing, scalability comes in two forms: vertical scaling (making a single resource bigger) and horizontal scaling (adding more instances of a resource). Both can happen rapidly in the cloud, whereas on-premises scaling requires purchasing and installing new hardware.

High availability — keeping things running

When Peak Roasters launches their online ordering system, they can’t afford downtime. Every minute the ordering page is down, they lose sales.

High availability means: even if a server crashes, the app keeps running because it’s deployed across multiple servers (or even multiple data centres).

How Azure delivers high availability

MechanismWhat It DoesExample
RedundancyMultiple copies of your app across servers3 VMs behind a load balancer
Load balancingDistributes traffic across healthy instancesAzure Load Balancer
Availability zonesSeparate physical locations within a regionZone 1, Zone 2, Zone 3
Region pairsAzure matches regions for disaster recoveryAustralia East + Australia Southeast
Auto-restartFailed VMs automatically restart on healthy hardwareAzure fabric controller

SLAs — measuring availability

Azure measures availability using Service Level Agreements (SLAs) — guarantees expressed as uptime percentages:

SLADowntime Per YearDowntime Per Month
99%3.65 days7.3 hours
99.9%8.76 hours43.8 minutes
99.95%4.38 hours21.9 minutes
99.99%52.6 minutes4.38 minutes

Key exam concept: Higher SLAs require more redundancy, which costs more. A single VM might offer 99.9% SLA. Two VMs in an availability set might offer 99.95%. Two VMs across availability zones might offer 99.99%.

💡 Exam tip: The nines matter

The exam may ask about SLA percentages and what they translate to in real downtime. Key numbers to remember:

  • 99.9% (three nines) = about 8.76 hours of downtime per year
  • 99.99% (four nines) = about 52.6 minutes of downtime per year

When you combine services, the combined SLA is lower than the individual SLAs. If Service A has 99.9% and Service B has 99.9%, together they offer 99.9% x 99.9% = 99.8% uptime.

Scalability — matching resources to demand

Scalability means you can add (or remove) resources based on demand. There are two types:

Vertical vs horizontal scaling
FeatureVertical Scaling (Scale Up/Down)Horizontal Scaling (Scale Out/In)
What changesSize of a single resourceNumber of resource instances
ExampleUpgrade a VM from 2 CPU/4 GB to 8 CPU/32 GBGo from 1 VM to 5 VMs behind a load balancer
AnalogyReplacing a small truck with a bigger truckAdding more trucks to the fleet
LimitMax hardware size of the machineVirtually unlimited
DowntimeUsually requires a restartNo downtime — new instances are added live
Best forDatabases, single-instance appsWeb apps, APIs, stateless services

Scaling in action: Summit Construction

Summit Construction’s project portal normally handles 50 users. But during quarterly reviews, 500 project managers log in simultaneously.

Without cloud: They’d need to buy servers capable of handling 500 users — even though they only need that capacity 4 times a year. Those servers sit idle 95% of the time.

With Azure: The portal runs on 2 VMs normally. During quarterly reviews, it automatically scales out to 10 VMs. After the review, it scales back to 2. They only pay for the extra VMs during those peak periods.

Elasticity — automatic scaling

Elasticity is a specific type of scalability where resources automatically increase and decrease based on demand — without human intervention.

ConceptDefinition
ScalabilityThe system can handle more load by adding resources
ElasticityThe system automatically adds and removes resources based on actual demand

Think of a rubber band: it stretches when pulled and snaps back when released. An elastic cloud system stretches with traffic spikes and contracts when traffic drops.

Azure services that provide elasticity:

  • Virtual Machine Scale Sets — automatically add/remove VMs based on CPU, memory, or custom metrics
  • Azure App Service — auto-scale web apps based on request count or schedule
  • Azure Functions — scale from zero to thousands of instances automatically
ℹ️ Real-world: Harbour Health during flu season

Harbour Health’s patient portal sees 10x normal traffic during flu season. With Azure’s elasticity:

  • Normal: 3 VMs, handling 500 concurrent users
  • Flu season peak: Auto-scales to 15 VMs, handling 5,000 concurrent users
  • After flu season: Automatically scales back to 3 VMs

Total extra cost: only the additional VMs during the 6-week peak period, not year-round.

🎬 Video walkthrough

🎬 Video coming soon

High Availability and Scalability — AZ-900

High Availability and Scalability — AZ-900

~9 min

Flashcards

Question

What is high availability in cloud computing?

Click or press Enter to reveal answer

Answer

The ability of a system to remain operational and accessible even when components fail. Achieved through redundancy, load balancing, availability zones, and region pairs.

Click to flip back

Question

What is the difference between vertical scaling and horizontal scaling?

Click or press Enter to reveal answer

Answer

Vertical scaling (scale up) = making a single resource bigger (more CPU, RAM). Horizontal scaling (scale out) = adding more instances of a resource. Horizontal is preferred for cloud apps because it's virtually unlimited and requires no downtime.

Click to flip back

Question

What is the difference between scalability and elasticity?

Click or press Enter to reveal answer

Answer

Scalability means the system CAN handle more load. Elasticity means the system AUTOMATICALLY adjusts resources based on demand — scaling up during peaks and down during quiet periods.

Click to flip back

Question

If Service A has a 99.9% SLA and Service B has a 99.9% SLA, what is the combined SLA?

Click or press Enter to reveal answer

Answer

99.8% — calculated by multiplying: 0.999 x 0.999 = 0.998. The combined SLA is always LOWER than the individual SLAs.

Click to flip back

Knowledge Check

Knowledge Check

Summit Construction's project portal normally serves 50 users but gets 500 users during quarterly reviews. Which scaling approach is MOST appropriate?

Knowledge Check

An Azure VM has a 99.9% SLA. What does this mean in practical terms?

Knowledge Check

Which cloud characteristic allows resources to automatically increase and decrease based on demand without manual intervention?


Next up: More cloud benefits — reliability, predictability, security, governance, and manageability in the cloud.

← Previous

Cloud Pricing: Consumption, Serverless, and Pay-as-You-Go

Next →

Reliability, Security, and Manageability

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.