🔒 Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided DP-900 Domain 1
Domain 1 — Module 1 of 7 14%
1 of 27 overall

DP-900 Study Guide

Domain 1: Core Data Concepts

  • Your First Look at Data Free
  • Data File Formats: CSV, JSON, Parquet & More Free
  • Databases: Relational vs Non-Relational Free
  • Transactional Workloads: Keeping Data Consistent Free
  • Analytical Workloads: Finding the Insights Free
  • Data Roles: DBA, Engineer & Analyst Free
  • The Azure Data Landscape Free

Domain 2: Relational Data on Azure

  • Relational Data: Tables, Keys & Relationships
  • Normalization: Why Duplicate Data is Bad
  • SQL Basics: SELECT, INSERT, UPDATE, DELETE
  • Database Objects: Views, Indexes & More
  • Azure SQL: Your Database in the Cloud
  • Open-Source Databases on Azure
  • Choosing the Right Azure Database

Domain 3: Non-Relational Data on Azure

  • Azure Blob Storage: Files in the Cloud
  • Azure Files & Table Storage
  • Azure Cosmos DB: The Global Database
  • Cosmos DB APIs: SQL, MongoDB & More
  • Choosing Non-Relational Storage

Domain 4: Analytics on Azure

  • Data Ingestion & Processing
  • Analytical Data Stores: Data Lakes, Warehouses & Lakehouses
  • Microsoft Fabric & Azure Databricks
  • Batch vs Streaming: Two Speeds of Data
  • Real-Time Analytics on Azure
  • Power BI: See Your Data
  • Data Models in Power BI
  • Choosing the Right Visualization

DP-900 Study Guide

Domain 1: Core Data Concepts

  • Your First Look at Data Free
  • Data File Formats: CSV, JSON, Parquet & More Free
  • Databases: Relational vs Non-Relational Free
  • Transactional Workloads: Keeping Data Consistent Free
  • Analytical Workloads: Finding the Insights Free
  • Data Roles: DBA, Engineer & Analyst Free
  • The Azure Data Landscape Free

Domain 2: Relational Data on Azure

  • Relational Data: Tables, Keys & Relationships
  • Normalization: Why Duplicate Data is Bad
  • SQL Basics: SELECT, INSERT, UPDATE, DELETE
  • Database Objects: Views, Indexes & More
  • Azure SQL: Your Database in the Cloud
  • Open-Source Databases on Azure
  • Choosing the Right Azure Database

Domain 3: Non-Relational Data on Azure

  • Azure Blob Storage: Files in the Cloud
  • Azure Files & Table Storage
  • Azure Cosmos DB: The Global Database
  • Cosmos DB APIs: SQL, MongoDB & More
  • Choosing Non-Relational Storage

Domain 4: Analytics on Azure

  • Data Ingestion & Processing
  • Analytical Data Stores: Data Lakes, Warehouses & Lakehouses
  • Microsoft Fabric & Azure Databricks
  • Batch vs Streaming: Two Speeds of Data
  • Real-Time Analytics on Azure
  • Power BI: See Your Data
  • Data Models in Power BI
  • Choosing the Right Visualization
Domain 1: Core Data Concepts Free ⏱ ~10 min read

Your First Look at Data

Data comes in three flavours — structured, semi-structured, and unstructured. Understanding the difference is the first step to working with data in the cloud.

What is data?

☕ Simple explanation

Data is just information written down so a computer can use it.

Think of a supermarket. The price sticker on a can of beans is data. The photo on the label is data. The handwritten note from your flatmate saying “we need more beans” is also data.

All three are information — but they’re organised very differently. That difference matters because it changes how computers store, search, and analyse them.

Data is a collection of facts, observations, or measurements recorded in a way that can be processed by computer systems. In cloud computing, how data is structured determines which storage services, query languages, and processing tools you use.

Data falls into three broad categories based on its level of organisation: structured (rigid schema, rows and columns), semi-structured (flexible schema, self-describing tags), and unstructured (no predefined schema). Each category has different storage requirements and query capabilities.

The three types of data

Structured data

Structured data fits neatly into rows and columns — like a spreadsheet or a database table. Every row follows the same pattern, and every column has a defined type (text, number, date).

Aisha’s example: The campus food ordering app stores orders in a table:

OrderIDStudentItemPriceDate
1001AishaChicken wrap8.502026-04-20
1002LiamCoffee4.002026-04-20

Every order has the same fields. You can sort by price, filter by date, or count orders per student — because the structure is predictable.

Key features:

  • Fixed schema (the column names and types are defined in advance)
  • Stored in relational databases or tabular files (CSV, Excel)
  • Easy to query using languages like SQL
  • Works best when every record has the same fields

Semi-structured data

Semi-structured data has some organisation, but it’s flexible. Not every record needs the same fields. The data describes itself using tags, keys, or markers.

Aisha’s example: Her app also stores user profiles, but some students fill in more details than others:

// Student 1 — minimal profile
"name": "Aisha", "email": "aisha@uni.ac.nz"

// Student 2 — detailed profile
"name": "Liam", "email": "liam@uni.ac.nz",
"phone": "021-555-0199", "dietary": "vegetarian"

Liam has extra fields that Aisha doesn’t. That’s fine — the data is flexible. Each record carries its own labels (keys like “name”, “dietary”), so the system knows what each value means.

Key features:

  • Flexible schema — records can have different fields
  • Self-describing — tags or keys label each value
  • Common formats: JSON, XML, YAML
  • Stored in document databases (like Azure Cosmos DB) or NoSQL stores

Unstructured data

Unstructured data has no predefined format. It’s the photos, videos, audio files, PDFs, and emails that don’t fit into rows or columns.

Aisha’s example: Students upload photos of their meals for reviews. Each photo is a blob of pixels — no columns, no keys, no rows. A computer can store it and display it, but it can’t “query” the photo the way it queries a table.

Key features:

  • No schema — no rows, columns, or tags
  • Examples: images, videos, audio, PDFs, Word documents
  • Stored in file storage or blob storage (like Azure Blob Storage)
  • Requires specialised tools (AI, search engines) to analyse content
Structured vs semi-structured vs unstructured data
FeatureStructuredSemi-StructuredUnstructured
SchemaFixed (defined upfront)Flexible (self-describing)None
FormatTables (rows & columns)JSON, XML, YAMLImages, video, audio, PDFs
Queryable?Yes — SQL queriesYes — with the right toolsNot directly — needs AI or search
ExampleOrder table in a databaseUser profile in JSONPhoto uploaded by a customer
Azure storageAzure SQL DatabaseAzure Cosmos DBAzure Blob Storage
💡 Exam tip: how the exam tests this

The exam loves giving you a scenario and asking “what type of data is this?” Look for these clues:

  • Fixed columns, every row the same → Structured
  • Tags or keys, flexible fields → Semi-structured
  • Files with no internal organisation → Unstructured

A tricky pattern: “an email” can be semi-structured (it has To, From, Subject fields) OR unstructured (the body text). Similarly, log files can be semi-structured (if they have timestamps and severity levels in a consistent format) or unstructured (if they’re free-form text). The exam usually clarifies by asking about a specific aspect.

Why does it matter?

The type of data determines:

  1. Where you store it — relational database? Document database? Blob storage?
  2. How you query it — SQL? API calls? AI-powered search?
  3. How you process it — direct queries? ETL pipelines? Machine learning?

Choosing the wrong storage for your data type leads to wasted money, slow performance, or lost information. The rest of this course teaches you how to make those choices on Azure.

ℹ️ Real-world mix: most systems use all three

Real applications almost always combine all three types. Take Priya’s FreshMart grocery chain:

  • Structured: Sales transactions (item, price, quantity, date, store)
  • Semi-structured: Product metadata from suppliers (JSON files with varying attributes per category)
  • Unstructured: Security camera footage, customer feedback emails, scanned invoices

A complete data solution handles all three — that’s why Azure offers different services for each.

Meet the characters

Throughout this course, you’ll follow four people working with data:

CharacterWho They AreData Scenarios
📈 Priya SharmaSenior Data Analyst at FreshMart Groceries (50 stores)Analysing sales trends, building dashboards, Power BI reports
💼 Tom ChenBusiness Ops Manager at Pacific Freight (logistics, 200 drivers)Tracking deliveries, querying databases, optimising routes
🎓 Aisha MohammedCS student at Auckland University (building a campus food app)Learning data basics, choosing storage, first database decisions
🏢 Jake BennettSolo DBA at CloudPulse (SaaS startup, 15 developers)Managing production databases, scaling storage, choosing Azure services

Flashcards

Question

What are the three types of data?

Click or press Enter to reveal answer

Answer

Structured (rows & columns, fixed schema), semi-structured (flexible schema, self-describing tags like JSON), and unstructured (no schema — images, videos, PDFs).

Click to flip back

Question

What makes semi-structured data 'self-describing'?

Click or press Enter to reveal answer

Answer

Each value is paired with a label (key or tag), so the data itself explains what each field means — like JSON keys or XML tags. No separate schema definition is needed.

Click to flip back

Question

Give one example each of structured, semi-structured, and unstructured data.

Click or press Enter to reveal answer

Answer

Structured: a sales table in a database. Semi-structured: a JSON product catalogue. Unstructured: a customer photo uploaded to a website.

Click to flip back

Knowledge check

Knowledge Check

Aisha's food ordering app stores meal reviews as free-text comments with no fixed format. What type of data is this?

Knowledge Check

Tom's logistics company receives shipment data from partners in JSON format. Each partner includes different fields — some include weight, others include dimensions. What type of data is this?

Knowledge Check

Which of the following is the BEST description of structured data?

🎬 Video coming soon

Next up: Data File Formats: CSV, JSON, Parquet & More — learn how data is physically stored in files.

Next →

Data File Formats: CSV, JSON, Parquet & More

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.