🔒 Guided

Pre-launch preview. Authorised access only.

Incorrect code

Guided by A Guide to Cloud
Explore AB-900 AI-901
Guided AI-901 Domain 1
Domain 1 — Module 10 of 11 91%
10 of 26 overall

AI-901 Study Guide

Domain 1: AI Concepts and Capabilities

  • What is AI? Your First 10 Minutes Free
  • Responsible AI: The Six Principles Free
  • How Generative AI Actually Works Free
  • Choosing the Right AI Model Free
  • Deploying AI Models: Options & Settings
  • AI Workloads at a Glance
  • Text Analysis: Keywords, Entities & Sentiment
  • Speech: Recognition & Synthesis
  • Computer Vision: Seeing the World
  • Image Generation: Creating with AI
  • Information Extraction: From Chaos to Structure

Domain 2: Implement AI Solutions Using Foundry

  • Prompting Fundamentals: System & User Prompts
  • Microsoft Foundry: Your AI Command Center Free
  • Building a Chat App with the Foundry SDK
  • Agents in Foundry: Create & Test
  • Building an Agent Client App
  • Building a Text Analysis App
  • Multimodal: Responding to Speech
  • Azure Speech in Foundry Tools
  • Visual Prompts: Images as Input
  • Generating Images with AI
  • Building a Vision App
  • Content Understanding: Documents & Forms
  • Multimodal Extraction: Images, Audio & Video
  • Building an Extraction App
  • Exam Prep: Putting It All Together

AI-901 Study Guide

Domain 1: AI Concepts and Capabilities

  • What is AI? Your First 10 Minutes Free
  • Responsible AI: The Six Principles Free
  • How Generative AI Actually Works Free
  • Choosing the Right AI Model Free
  • Deploying AI Models: Options & Settings
  • AI Workloads at a Glance
  • Text Analysis: Keywords, Entities & Sentiment
  • Speech: Recognition & Synthesis
  • Computer Vision: Seeing the World
  • Image Generation: Creating with AI
  • Information Extraction: From Chaos to Structure

Domain 2: Implement AI Solutions Using Foundry

  • Prompting Fundamentals: System & User Prompts
  • Microsoft Foundry: Your AI Command Center Free
  • Building a Chat App with the Foundry SDK
  • Agents in Foundry: Create & Test
  • Building an Agent Client App
  • Building a Text Analysis App
  • Multimodal: Responding to Speech
  • Azure Speech in Foundry Tools
  • Visual Prompts: Images as Input
  • Generating Images with AI
  • Building a Vision App
  • Content Understanding: Documents & Forms
  • Multimodal Extraction: Images, Audio & Video
  • Building an Extraction App
  • Exam Prep: Putting It All Together
Domain 1: AI Concepts and Capabilities Premium ⏱ ~10 min read

Image Generation: Creating with AI

AI doesn't just understand images — it can create entirely new ones from text descriptions. Learn how image generation works, when to use it, and what GPT-image can do.

How does AI create images?

☕ Simple explanation

Image generation AI works like a sculptor starting from random noise.

Imagine you start with TV static — pure random dots. The AI model gradually refines that noise, step by step, until a clear image emerges that matches your text description. It’s like watching a photo develop in a darkroom, except the AI is guided by your words.

You type “a fluffy orange cat wearing a tiny top hat, watercolour style” → the model generates a completely new image that’s never existed before.

Modern image generation primarily uses diffusion models. During training, the model learns to add noise to images step by step until they become pure static, then learns to reverse that process — starting from noise and progressively denoising to create a clear image.

At inference time, the model starts with random noise and uses the text prompt as guidance to shape the denoising process toward the desired image. GPT-image-1.5 (available through Azure OpenAI in Foundry) is the primary image generation model in the Azure ecosystem.

Computer vision vs image generation

Understanding images vs creating images
FeatureComputer VisionImage Generation
DirectionImage → UnderstandingText → Image
InputAn existing imageA text description (prompt)
OutputText (labels, descriptions, extracted text)A new image
Example'This image contains a cat and a dog'Creates a new image of 'a cat and a dog playing in a park'
Azure serviceAzure AI Vision (Foundry Tools)GPT-image-1.5 via Azure OpenAI

GPT-image-1.5: Azure’s image generation model

GPT-image-1.5 is OpenAI’s latest image generation model, available in Microsoft Foundry.

Note: The previous model (DALL-E 3) was retired on March 4, 2026. GPT-image-1.5 is the current GA replacement with improved capabilities.

Key capabilities:

  • Generate images from text prompts
  • Edit existing images with text instructions
  • Specify image size and quality settings
  • Generate multiple images per request

Use cases:

ScenarioPrompt Example
Marketing”Professional product photo of a smartwatch on a white background”
Concept art”Futuristic city skyline at sunset, cyberpunk style”
Education”Diagram showing how a solar panel converts sunlight to electricity”
Prototyping”Mobile app mockup for a fitness tracker, dark mode, minimal design”

GreenLeaf scenario: GreenLeaf uses GPT-image to generate visuals for their sustainability reports — illustrations of farming practices, infographics about crop yields, and concept images for new products — all without hiring a designer.

Responsible AI in image generation

Image generation has unique responsible AI considerations:

ConsiderationHow Azure Handles It
Harmful contentContent filters block violent, sexual, or harmful image generation
DeepfakesAzure AI embeds C2PA provenance metadata in generated images to prove AI origin
BiasModels are tested for demographic bias in generated faces and scenarios
CopyrightUsers should not generate images of copyrighted characters or trademarks
TransparencyGenerated images should be labelled as AI-created
ℹ️ What is C2PA?

C2PA (Coalition for Content Provenance and Authenticity) is an open standard for embedding provenance metadata into digital content. When GPT-image generates an image, it includes C2PA metadata that records the content’s origin — that it was AI-generated, by which service, and when.

This supports the transparency responsible AI principle — tools that read C2PA metadata can verify whether content is AI-generated, even if the creator doesn’t visually label it.

🎬 Video walkthrough

🎬 Video coming soon

Image Generation — AI-901 Module 10

Image Generation — AI-901 Module 10

~10 min

Flashcards

Question

How do diffusion models generate images?

Click or press Enter to reveal answer

Answer

They start from random noise and progressively denoise it, guided by the text prompt, until a clear image emerges. During training, the model learned to reverse the process of adding noise to images.

Click to flip back

Question

What is GPT-image-1.5?

Click or press Enter to reveal answer

Answer

OpenAI's image generation model, available in Microsoft Foundry through Azure OpenAI. It creates new images from text prompts and supports various sizes and styles.

Click to flip back

Question

What is C2PA metadata in AI-generated images?

Click or press Enter to reveal answer

Answer

An open provenance standard (Coalition for Content Provenance and Authenticity) embedded in AI-generated images that records their origin — proving they were created by AI. Supports the transparency principle.

Click to flip back

Knowledge Check

Knowledge Check

GreenLeaf wants to create professional illustrations for their annual sustainability report without hiring a graphic designer. Which AI capability should they use?

Knowledge Check

A news organisation uses GPT-image-1.5 to generate images for articles. Which responsible AI concern is most critical for this use case?


Next up: Information Extraction — how AI pulls structured data from chaotic documents, images, audio, and video.

← Previous

Computer Vision: Seeing the World

Next →

Information Extraction: From Chaos to Structure

Guided

I learn, I simplify, I share.

A Guide to Cloud YouTube Feedback

© 2026 Sutheesh. All rights reserved.

Guided is an independent study resource and is not affiliated with, endorsed by, or officially connected to Microsoft. Microsoft, Azure, and related trademarks are property of Microsoft Corporation. Always verify information against Microsoft Learn.