Foundry Services: Your AI Toolkit
Microsoft Foundry is a platform of platforms. Learn which service handles what — from generative tasks to grounding, vector search, agent workflows, and multimodal processing.
What is Microsoft Foundry?
Microsoft Foundry is like a workshop with every AI tool you could need — all in one building.
Need to deploy a language model? There’s a workbench for that. Need to search through company documents? There’s a tool for that. Need to build an AI agent that books meetings? There’s a framework for that. Need to extract data from invoices? There’s a machine for that.
Instead of buying tools from 10 different shops, Foundry puts everything under one roof — one portal, one set of SDKs, one billing system.
The Foundry service map
The exam tests whether you know which Foundry service handles which job. Here’s the complete map:
| Task Category | Foundry Service | What It Does |
|---|---|---|
| Generative tasks | Model deployments (OpenAI, Llama, Phi, etc.) | Text generation, chat, code, summarisation |
| Grounding | Azure AI Search + Foundry IQ | Ground model responses in your data |
| Vector search | Azure AI Search (vector index) | Semantic similarity search using embeddings |
| Agent workflows | Foundry Agent Service | Build agents with tools, memory, and orchestration |
| Multimodal processing | Multimodal models + Content Understanding | Process text, images, audio, video together |
| Document extraction | Content Understanding | OCR, layout analysis, structured field extraction |
| Speech | Azure AI Speech | Speech-to-text, text-to-speech, translation |
| Translation | Azure Translator | Text and document translation |
| Evaluation | Foundry Evaluation | Measure quality, safety, groundedness |
Foundry architecture: Projects and resources
Every AI solution in Foundry starts with a Foundry Project. Think of it as your workspace:
| Component | What It Contains |
|---|---|
| Foundry Project | Models, agents, data connections, evaluations, deployments |
| Model deployments | Hosted models you can call via API (pay-per-token or provisioned) |
| Connected resources | Azure AI Search indexes, storage accounts, Key Vault |
| Agent definitions | Agent configurations with tools, instructions, and knowledge |
Classic vs New Foundry architecture
If you’ve used Azure AI Studio before, you’ll remember the hub-and-project model. The “classic” architecture had hubs as parent containers for projects.
The new Foundry architecture (2026) simplifies this — projects are standalone resources. No more hubs. This makes access control, lifecycle management, and governance simpler.
The exam focuses on the new architecture. If a question mentions “hubs,” it’s likely testing whether you know they’re deprecated.
Grounding and Foundry IQ
Grounding means anchoring AI responses in your actual data instead of letting the model hallucinate. Two key services:
| Feature | Azure AI Search | Foundry IQ |
|---|---|---|
| What it is | Full search platform with indexing, ranking, and retrieval | Foundry-native knowledge layer for agents |
| Search types | Keyword, semantic, vector, hybrid | Integrated with agent memory and tools |
| Best for | RAG applications, custom search pipelines | Quick agent knowledge setup without custom indexing |
| Setup effort | Medium — configure index, chunking, embeddings | Low — upload docs, Foundry handles the rest |
| Flexibility | High — full control over indexing and retrieval | Lower — abstracted for simplicity |
Exam tip: When to pick Search vs Foundry IQ
The exam will present scenarios where you need to ground an AI response. Key decision:
- Need full control over indexing, chunking, hybrid search, enrichment? → Azure AI Search
- Quick setup for an agent that needs to search company docs? → Foundry IQ
- Both are correct for RAG — the question is about complexity and control requirements
Agent Service
The Foundry Agent Service is where you build AI agents — systems that don’t just chat, but take actions:
| Agent Feature | What It Provides |
|---|---|
| Model selection | Choose which LLM powers the agent’s reasoning |
| Instructions | System prompt defining the agent’s role, rules, and personality |
| Tools | Functions the agent can call (APIs, search, code execution) |
| Knowledge | Data sources the agent can query (Foundry IQ, Search indexes) |
| Memory | Conversation history and per-thread state |
| Orchestration | Multi-agent coordination with approval controls |
Real-world example: Atlas Financial's compliance agent
Atlas Financial builds a compliance review agent using three Foundry services:
- Agent Service — defines the agent’s role (“Review loan applications against regulatory guidelines”)
- Azure AI Search — indexes 50,000 pages of regulations for grounding
- GPT-4o deployment — provides the reasoning capability
The agent receives a loan application, searches relevant regulations, and produces a compliance assessment with citations. Three services working together.
Content Understanding
Content Understanding is Foundry’s newer multimodal extraction service for processing documents, images, and video. It coexists with Document Intelligence, which remains available for document-specific tasks:
| Capability | What It Does |
|---|---|
| OCR | Reads printed and handwritten text from images and PDFs |
| Layout analysis | Understands document structure (tables, headings, sections) |
| Field extraction | Pulls specific fields (invoice number, date, total) |
| Single-task mode | Optimised pipeline for one extraction type |
| Pro mode | Multi-capability pipeline combining OCR, layout, and extraction |
Exam tip: Content Understanding vs Document Intelligence
These are separate, coexisting services — Content Understanding did NOT replace Document Intelligence. The exam may test when to use which:
- Content Understanding — multimodal extraction (documents + images + audio + video), custom schemas, Markdown output for LLM consumption
- Document Intelligence — document-specific extraction with pre-built models (invoices, receipts, tax forms), high-accuracy structured output
If the scenario involves multimodal content or feeding output to an LLM/agent → Content Understanding. If it’s purely document extraction with pre-built analysers → either service works, but Document Intelligence has more mature pre-built models.
Key terms
Knowledge check
NeuralMed wants to build a patient-facing chatbot that answers questions using their medical knowledge base of 10,000 articles. The team needs full control over how articles are chunked, embedded, and ranked. Which grounding service should they use?
Kai needs to quickly prototype an AI agent that can answer questions about the company's shipping policies. He has 20 PDF documents and wants the fastest setup possible. Which approach should he use?
Which of the following is a Foundry Tool (managed service), NOT a model you deploy?
🎬 Video coming soon