🔒 Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided PL-300 Domain 1
Domain 1 — Module 3 of 7 43%
3 of 26 overall

PL-300 Study Guide

Domain 1: Prepare the Data

  • Connecting to Data Sources Free
  • Storage Modes: Import, DirectQuery & DirectLake Free
  • Data Profiling & Quality Free
  • Power Query Transforms Free
  • Combining Data: Merge, Append & Queries
  • Fact Tables, Dimension Tables & Keys
  • Data Loading & Query Optimisation

Domain 2: Model the Data

  • Star Schema & Relationships
  • Date Tables & Table Properties
  • Columns vs Measures: When to Use Which
  • DAX Fundamentals Free
  • CALCULATE & Filter Context
  • Time Intelligence & Calculation Groups
  • Model Performance Optimisation

Domain 3: Visualize and Analyze the Data

  • Choosing & Formatting Visuals
  • Slicers, Filters & Interactions
  • Bookmarks, Tooltips & Navigation
  • Copilot in Power BI Reports
  • Report Pages & Paginated Reports
  • Mobile, Accessibility & Personalisation
  • AI Visuals & Pattern Discovery

Domain 4: Manage and Secure Power BI

  • Workspaces & Distribution
  • Dashboards, Alerts & Subscriptions
  • Gateways & Scheduled Refresh
  • Row-Level Security & Access Control
  • Sensitivity Labels & Governance

PL-300 Study Guide

Domain 1: Prepare the Data

  • Connecting to Data Sources Free
  • Storage Modes: Import, DirectQuery & DirectLake Free
  • Data Profiling & Quality Free
  • Power Query Transforms Free
  • Combining Data: Merge, Append & Queries
  • Fact Tables, Dimension Tables & Keys
  • Data Loading & Query Optimisation

Domain 2: Model the Data

  • Star Schema & Relationships
  • Date Tables & Table Properties
  • Columns vs Measures: When to Use Which
  • DAX Fundamentals Free
  • CALCULATE & Filter Context
  • Time Intelligence & Calculation Groups
  • Model Performance Optimisation

Domain 3: Visualize and Analyze the Data

  • Choosing & Formatting Visuals
  • Slicers, Filters & Interactions
  • Bookmarks, Tooltips & Navigation
  • Copilot in Power BI Reports
  • Report Pages & Paginated Reports
  • Mobile, Accessibility & Personalisation
  • AI Visuals & Pattern Discovery

Domain 4: Manage and Secure Power BI

  • Workspaces & Distribution
  • Dashboards, Alerts & Subscriptions
  • Gateways & Scheduled Refresh
  • Row-Level Security & Access Control
  • Sensitivity Labels & Governance
Domain 1: Prepare the Data Free ⏱ ~11 min read

Data Profiling & Quality

Learn to evaluate data quality using Power Query's profiling tools — column distribution, column quality, column profile — and how to resolve nulls, errors, and inconsistencies.

Trust your data before you build on it

☕ Simple explanation

Imagine buying a used car. Before you drive it, you check the engine, tyres, and mileage. You don’t just trust the seller — you verify.

Data profiling is the same idea. Before you build dashboards and DAX measures on your data, you check it for problems: missing values, duplicates, wrong data types, unexpected patterns. Power Query gives you three built-in tools to do this.

Data profiling in Power Query Editor examines your data to identify quality issues before they corrupt your model. Power Query provides three profiling views — Column Quality, Column Distribution, and Column Profile — accessible from the View tab.

By default, profiling analyses only the first 1,000 rows. For a full picture, change this to “Column profiling based on entire dataset” in the status bar. This is critical — a column might look clean in the first 1,000 rows but have errors deeper in.

The three profiling views

Power Query Editor offers three profiling tools under View tab. Each reveals different problems:

Use all three views together for a complete picture of your data quality
ToolWhat It ShowsWhat to Look For
Column QualityPercentage of Valid, Error, and Empty values per columnAny column with errors or high empty percentage needs attention
Column DistributionCount of distinct values and unique values per columnA primary key should have 100% unique values; low distinct count may indicate a category column
Column ProfileDetailed statistics: min, max, average, standard deviation, value distribution histogramOutliers, unexpected ranges, skewed distributions
💡 Exam tip: first 1,000 rows vs entire dataset

This is a classic exam trick. By default, Power Query profiles only the first 1,000 rows. If your dataset has 500,000 rows, you might miss errors that appear after row 1,000.

Always change to “Column profiling based on entire dataset” (click the link in the Power Query Editor status bar) when profiling production data. The exam will test whether you know this default and how to change it.

Spotting and fixing data quality issues

Nadia Okafor at Prism Agency (📊) loads campaign data from 5 different advertising platforms. Each platform formats data differently. Here’s what she finds using profiling:

Nulls and empty values

Column Quality shows that the CampaignEndDate column has 15% empty values. These are campaigns that are still running — they don’t have an end date yet.

Fix options:

  • Replace nulls with a meaningful value (e.g., “Ongoing”) if the column is text
  • Fill down/up to inherit values from adjacent rows
  • Remove rows if null records aren’t needed for analysis
  • Keep nulls if “no value” is meaningful (e.g., active campaigns)

Data type mismatches

The Spend column was imported as Text because one platform formats currency as “$1,234.56”. Power Query can’t do math on text.

Fix: Remove the dollar sign using Replace Values, then change the column type to Decimal Number. If some rows still fail conversion, use Replace Errors to handle them.

Inconsistent values

The Platform column has “Facebook”, “facebook”, “FACEBOOK”, and “Meta Ads” — all meaning the same thing.

Fix: Use Replace Values to standardise, or use Conditional Column to map variations to a single value.

Import errors

Riley at Coastal Fresh (🛒) gets an error importing an Excel file because a column has mixed data types (some cells are numbers, others are text). Power BI imports it as “Error” for the mismatched rows.

Fix options:

  • Change the column type in Power Query — rows that can’t convert become Error values (then use Replace Errors to handle them)
  • Fix the source data in Excel
  • Use Replace Errors to substitute a default value
Real-world: Dr. Ethan's patient data audit

Dr. Ethan Moss at Bayview Medical Centre (🏥) profiles a patient records extract before building a dashboard. Column Distribution reveals that PatientID has 50,000 distinct values but only 49,800 unique values — meaning 200 IDs appear more than once. This signals duplicate records that would inflate patient counts. He uses Remove Duplicates on the PatientID column to fix it.

Common data quality actions in Power Query

ProblemPower Query FixWhere to Find It
Null valuesReplace Values, Fill Down/Up, Remove RowsRight-click column → Replace Values
Wrong data typeChange TypeColumn header icon or Transform tab
DuplicatesRemove DuplicatesRight-click column → Remove Duplicates
Extra whitespaceTrimTransform → Format → Trim
Mixed caseCapitalize Each Word, UPPER, lowerTransform → Format
Errors in columnReplace Errors, Remove ErrorsRight-click column
Inconsistent valuesReplace Values, Conditional ColumnRight-click or Add Column tab

Knowledge check

Question

By default, how many rows does Power Query profile?

Click or press Enter to reveal answer

Answer

Only the first 1,000 rows. Change to 'Column profiling based on entire dataset' in the status bar for a complete picture.

Click to flip back

Question

What does Column Distribution show?

Click or press Enter to reveal answer

Answer

The count of distinct values (total different values) and unique values (values that appear exactly once) per column. Useful for identifying primary keys and categories.

Click to flip back

Question

What's the difference between Column Quality and Column Profile?

Click or press Enter to reveal answer

Answer

Column Quality shows the percentage of Valid, Error, and Empty values. Column Profile shows detailed statistics: min, max, average, standard deviation, and a value distribution histogram.

Click to flip back

Knowledge Check

Nadia profiles a campaign dataset and notices that Column Quality shows 5% errors in the 'Spend' column. The errors occur because some values contain a dollar sign ('$1,234'). What's the best fix?

Knowledge Check

Which Power Query profiling view would help Riley identify that her 'ProductID' column has duplicate values?

🎬 Video coming soon

Next up: Power Query Transforms — reshape your data with columns, groups, pivots, and more.

← Previous

Storage Modes: Import, DirectQuery & DirectLake

Next →

Power Query Transforms

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.