Challenges of Generative AI: Fabrications, Bias &…

Why generative AI isn’t magic

Simple explanation

Generative AI is like a very confident intern — brilliant, fast, but sometimes wrong and never admits it.

It doesn’t actually “know” things. It predicts what text should come next based on patterns it learned during training. Most of the time, this produces great results. But sometimes it confidently states things that are completely wrong — and it does it with the same tone as when it’s right.

As a leader, you don’t need to fear these challenges — you need to understand them so you can deploy AI responsibly and set the right expectations.

The three big challenges

1. Fabrications (hallucinations)

The AI generates content that sounds correct but is factually wrong. It doesn’t “lie” — it’s predicting likely text sequences, and sometimes those predictions land on fiction.

What Happens	Real-World Example	Risk Level
Invents facts	AI claims a regulation exists that doesn’t	High — legal or compliance exposure
Creates fake sources	Cites a study or URL that doesn’t exist	Medium — credibility damage
Mixes real and fake	Accurate summary with one invented statistic	High — hard to detect, easy to trust
Contradicts itself	Says “yes” in paragraph 1, “no” in paragraph 3	Low — usually caught by careful readers

Mitigation strategies:

Grounding with RAG — connect the model to verified data sources
Human-in-the-loop — always review AI output before publishing or acting
Confidence thresholds — configure systems to flag low-confidence outputs
Citations — require the AI to cite sources so they can be verified

2. Bias

AI models learn from data that reflects human society — including its biases. The output can perpetuate or amplify unfair patterns.

Bias Type	How It Shows Up	Example
Training data bias	Model over-represents certain demographics in its training data	AI writing job descriptions uses gendered language
Representation bias	Certain groups are underrepresented in training data	AI performs poorly on accents from underrepresented regions
Confirmation bias	Model reinforces the perspective of the prompt	Ask “why is X a problem?” and AI generates only negative content
Selection bias	Data used for RAG or fine-tuning has gaps	Customer service AI trained only on English queries fails for multilingual users

Mitigation strategies:

Diverse, representative datasets — for fine-tuning and RAG
Regular bias audits — test outputs across different demographics
Content filters — block harmful or inappropriate output
Transparency — tell users when they’re interacting with AI

3. Reliability

Generative AI is probabilistic, not deterministic. The same prompt can produce different outputs each time.

Reliability Challenge	Business Impact	Mitigation
Non-deterministic output	Hard to guarantee consistent quality	Use temperature settings, structured prompts, and output validation
Prompt sensitivity	Small wording changes produce very different results	Develop and test standard prompts for critical workflows
Context limitations	Long documents may be summarised inconsistently	Break large inputs into chunks, use structured prompts
Model degradation	Performance can change when models are updated	Pin model versions for critical applications, test after updates

Scenario: Dr. Patel's board briefing on AI risks

Dr. Anisha Patel (board advisor) prepares a risk briefing for a healthcare company evaluating generative AI:

Red flags she raises:

Fabrications in medical AI could lead to patient harm and regulatory violations
Bias in diagnostic AI could mean unequal treatment across patient demographics
Reliability issues in automated clinical notes could create legal liability

Her recommendation: Deploy AI with human-in-the-loop validation for all clinical applications. Use AI for administrative tasks (scheduling, summarisation) where hallucination risk is lower before expanding to clinical decision support.

Other challenges leaders face

Beyond the “big three,” business leaders should also be aware of:

Challenge	What It Means	Business Impact
Data privacy	AI may process sensitive data and leak it in outputs	Must enforce data boundaries and access controls
Intellectual property	AI trained on copyrighted material raises IP questions	Legal risk when generating content that resembles copyrighted works
Over-reliance	Users trust AI too much and stop thinking critically	Quality degrades as humans skip verification
Shadow AI	Employees use unapproved AI tools	Data leaks, compliance violations, security gaps
Explainability	AI can’t explain WHY it generated a specific output	Hard to audit or defend AI-driven decisions

Exam tip: Fabrications vs hallucinations

Microsoft uses the term “fabrications” in the official exam skills outline (not “hallucinations”). Both terms refer to the same thing — AI generating false content confidently. Use “fabrications” in your exam answers to match Microsoft’s terminology.

Turning challenges into governance opportunities

Smart leaders don’t avoid AI because of these challenges — they build governance frameworks to manage them:

Establish acceptable use policies — define where AI can and can’t be used
Require human review for high-stakes outputs (legal, medical, financial)
Implement content filters — Microsoft’s content safety tools block harmful outputs
Monitor and audit — track AI usage patterns and output quality
Train users — help them understand AI limitations and verify outputs

Key flashcards

Question

What is an AI 'fabrication' (hallucination)?

Click or press Enter to reveal answer

Answer

When an AI model generates content that sounds plausible and confident but is factually incorrect. It happens because the model predicts likely text sequences, not verified facts.

Click to flip back

Question

What are the three main types of bias in AI?

Click or press Enter to reveal answer

Answer

1. Training data bias (data over-represents certain groups). 2. Representation bias (certain groups are underrepresented). 3. Selection bias (data used for grounding or fine-tuning has gaps).

Click to flip back

Question

Why is generative AI 'unreliable' compared to traditional software?

Click or press Enter to reveal answer

Answer

Generative AI is probabilistic (non-deterministic) — the same prompt can produce different outputs each time. Traditional software is deterministic — same input always produces the same output.

Click to flip back

Question

What is the most effective way to reduce AI fabrications?

Click or press Enter to reveal answer

Answer

Grounding with RAG (Retrieval-Augmented Generation) — connecting the model to verified data sources so it answers from YOUR trusted data, not just its training knowledge.

Click to flip back

Knowledge check

Knowledge Check

Elena's consulting firm deploys Copilot for drafting client proposals. A consultant submits a proposal that cites a regulation that doesn't actually exist. What type of AI challenge is this?

Knowledge Check

Tomás notices that the AI writing job descriptions at PacificSteel consistently uses masculine language. What type of challenge is this, and what should he do?

Knowledge Check

Dr. Patel recommends that a healthcare company deploy AI for administrative tasks before clinical applications. What is the primary reason for this phased approach?

Next up: When Generative AI Creates Real Business Value — scalability, automation, and the scenarios where AI delivers transformative impact.