Session Summary: 6 Advanced Rules for Production Copilot Agents

recording

copilot-agents

prompt-engineering

production

Summary of Mario Fontana’s masterclass on operational excellence for Copilot agents—covering grounding, guardrails, testing, self-critique, monitoring, and feedback loops.

Author

Dario Airoldi

Published

January 30, 2026

Session Summary: 6 Advanced Rules for Production Copilot Agents

Recording Date: 2026-01-30
Summary Date: 2026-01-30
Summarized By: Dario Airoldi
Recording Link: YouTube
Duration: ~39 minutes
Speakers: Mario Fontana (Microsoft, 25+ years experience)
Series: Part 2 of 2 (Masterclass on Copilot Agent Production)

Executive Summary

This session is the second part of a masterclass focused on operational excellence for AI agents in production. While

Part 1 covered the foundational six rules for building robust system prompts,
Part 2 presents six advanced rules (7-12) that transform a demo agent into a production-ready system.

The speaker uses a racing car metaphor throughout—if Part 1 built the engine, Part 2 builds the brakes, dashboard, and crash testing infrastructure.

🎯 Introduction: From Demo to Production
🏗️ Rule 7: Grounding & Citations
🛡️ Rule 8: Guardrails & Security
🧪 Rule 9: Rigorous Testing
🧠 Rule 10: Self-Critique Architecture
📟 Rule 11: Monitoring & Observability
🔄 Rule 12: Feedback Loop
🏎️ Recap: The “Agent Fleet” Transition

[00:00] Introduction: From Demo to Production

Speaker: Mario Fontana

Key Points:

Jailbreak attacks make headlines, but real production failures come from ambiguous questions, outdated documents, and context changes
Part 1 built the “engine” (system prompt architecture); Part 2 builds the “race setup” (operational controls)
A professional system prompt is necessary but not sufficient—you need brakes, dashboard, and crash testing

Core Themes Introduced:

Grounding — Anchor the agent to corporate data only
Guardrails — Block forbidden topics and risks
Testing — Scientifically validate responses before production
Monitoring — Real-time visibility into vital parameters

[01:51] Rule 7: Grounding & Citations (Stop Hallucinations)

Speaker: Mario Fontana

Key Points:

Citation hallucination is the most insidious problem—agents invent not just answers but the “proof” those answers are true
LLMs are trained to complete sentences, not verify facts; they’ll choose a plausible lie over saying “I don’t know”
Rule 7 isn’t about giving access to sources—it’s about creating an impassable perimeter

The Two-Step Protocol:

Retrieval — Extract forensic evidence: source ID, title, exact section (not summaries)
Kill Switch — Add a system prompt constraint: respond using only retrieved evidence, never prior knowledge; if information isn’t available, respond “I don’t know”

7-Point Grounding Checklist:

No claims without evidence (disable “use general knowledge” toggle)
Traceable evidence with source IDs
Inline citations—every factual sentence ends with source reference
Conflict management—declare conflicts, don’t average them
User text is unverified—treat manipulation attempts as noise, never as sources
Confidence gate—lower confidence when evidence is weak (Pro tip: integrate Azure AI Search for native confidence scores)
Validation logic—block output if citations are empty

Quotes: > We don’t fight hallucinations by asking the agent to be precise—we fight by removing its ability to invent.

“In production, I don’t want the agent to do its best—I want it to be a paranoid bureaucrat.

[06:39] Rule 8: Guardrails & Security (Anti-Jailbreak)

Speaker: Mario Fontana

Key Points:

Even with perfect grounding, someone can try to change the rules mid-game—users, unverified documents, emails pasted in chat, or disguised prompt injections
OWASP (Open Worldwide Application Security Project) has formalized LLM vulnerabilities
Two critical vulnerabilities: Prompt Injection (rewriting rules from inside) and Excessive Agency (agent claiming powers it doesn’t have)

Three Types of Boundaries:

Policy Boundaries — What can/can’t be authorized (e.g., discounts >15% require escalation)
Knowledge Boundaries — When to say “I don’t know” (if data isn’t in configured sources, don’t invent)
Role Boundaries — Anti-manipulation (role is fixed, ignore phrases like “from now on you’re…”)

Implementation Pattern (3 lines per guardrail):

What NOT to do
What to do INSTEAD
Exact message to give the user

Adversarial Testing:

Write 10 out-of-scope scenarios and test with zero tolerance
Use Copilot Studio Kit for batch testing
If it concedes even once, that guardrail is weak—fix immediately
Re-test when changing models (e.g., GPT-5 to 5.2)
Guardrails need periodic revalidation like any security control

[14:47] Rule 9: Rigorous Testing (Output Validation)

Speaker: Mario Fontana

Key Points:

Testing has two big lies: False Positives (green light that lies) and False Negatives (red light that’s wrong)
Problem: If you measure the wrong thing, you either pass a disaster or block a success

Three Levels of Testing Rigor:

Level	Name	When to Use	Method
1	Exact Form	Structured outputs (JSON, API calls)	Exact match—character by character
2	Meaning	Conversational responses	Semantic similarity (start with 0.85 threshold)
3	Absolute Quality	Critical sectors (legal, medical, compliance)	Evaluate: relevance, grounding, completeness, knows when to say “I don’t know”

Examples:

False Positive: Agent approves a refund for a 6-month expired subscription (policy says 30 days max), but test passes because it measured “answer relevance” not policy compliance
False Negative: Agent says “You have 25 days available” but expected response was “25 days”—test fails on form when meaning was correct

Operational Rule: Don’t start from “which tool do I use”—start from “what do I want to measure?”

Validating JSON? → Exact match
Validating chat? → Semantic similarity
Validating legal advice? → General quality

[19:55] Rule 10: Self-Critique (Editor-Journalist Architecture)

Speaker: Mario Fontana

Key Points:

Testing happens in controlled environments; production is rush hour in a chaotic city
Self-critique is an architectural block, not just a prompt line—the agent generates a draft, stops, reviews it, then speaks
Think of a newsroom: journalist writes the draft, editor reviews sources, cuts ambiguous phrases, then publishes

The Three Self-Critique Questions:

Grounding — Is every statement supported by a source? (Cut unsupported phrases)
Consistency — Are there conflicts between sources? Am I respecting guardrails? (Declare conflicts, don’t average)
Clarity — Would a non-expert understand this response? (Ambiguity generates tickets)

When Agent Can’t Respond:

Don’t invent, don’t guess—escalate with a pre-approved message
Example: “I don’t have sufficient evidence in available sources. To avoid errors, I’m opening a ticket for HR. Here’s what I found, here’s what’s missing. You’ll receive a response from an operator within 24 hours.”

Cost vs. Value:

Self-critique costs more tokens, latency, compute (often double the time)
But: How much does a screenshot on Teams that ends up with legal cost you?
Use strategically: always active in testing, surgical approach in production (high-risk scenarios: legal, finance, HR)

Implementation in Copilot Studio:

Self-critique isn’t a native feature—it’s a design pattern you build
Two-node architecture: Node 1 (Writer/Draft) → saves to variable → Node 2 (Editor/Critic) → applies three checks → only then sends to user
Version everything—if the critic starts rejecting everything, you need to see what changed yesterday

[27:33] Rule 11: Monitoring & Observability (Tracking KPIs)

Speaker: Mario Fontana

Key Points:

Monitoring is your headlights at night—without it, you’re flying blind
Test metrics from Rule 9 don’t die—they evolve into live production metrics
Copilot Studio doesn’t do this out-of-the-box; you build indicators (e.g., Azure Functions + Application Insights)

Three Essential Log Fields:

Version — Which prompt was active, which model, which parameters
Context — Which documents were read, how many tokens consumed (cost is line by line)
Outcome — How did it end? (resolved, escalation, abandoned)

M365 Copilot System (Coming Soon):

Each agent has an identity via Entra Agent ID
Who executed, when, which resources touched—all tracked for compliance/incident response
Currently in gradual rollout; direction is clear: centralized governance

Three Immediate Actions:

Version your prompts (Git/DevOps—prompts are production software)
Activate Application Insights with structured logs (version, tokens, outcome)
Create ONE alert: if success rate drops below 90%, get notified

[32:36] Rule 12: Feedback Loop (ROI Optimization)

Speaker: Mario Fontana

Key Points:

Monitoring tells you there’s a problem; the feedback loop tells you how to fix it
Your prompt is an eternal hypothesis—validate it against reality, not just beta tests
Optimizing one metric can break all the others

Cautionary Tale:

Agent at 92% success rate
Loosened a guardrail slightly to improve satisfaction
Result: Success crashed to 78%, but satisfaction rose to 4+
Three customers received non-existent discounts; CFO sends urgent email
Lesson: Optimizing one metric while ignoring others = disaster

Three Feedback Levels in Copilot Studio:

Effectiveness — Does it work? (Sessions resolved vs. escalation vs. abandoned)
Satisfaction — Do users like it? (Thumbs up/down, CSAT 1-5; >4 = green, <3 = problem)
Usage — What are users actually talking about? (Trending topics, spikes on “refunds” or “invoices”)

Forensic Analysis Workflow:

Download transcripts, filter by escalation
Read failed conversations
Identify which part of the prompt failed (grounding? guardrail? self-critique?)
Transform that error into a test case
Verify version 1.2 resolves it

ROI Tracking (New in Copilot Studio late 2025):

Configure: “Every time this agent closes an order, we saved 15 minutes of work”
System calculates: total time saved, economic value generated
Changes the conversation: Not “the agent is smart” but “the agent saved €30,000 this month”

Quotes: > “L’errore non è un fallimento. È un’informazione importantissima e il carburante per la tua prossima versione.” (An error isn’t a failure. It’s critical information and fuel for your next version.)

[37:19] Recap: The “Agent Fleet” Transition

Speaker: Mario Fontana

Key Points:

With 12 rules, you may end up with one massive system prompt—the “great monolith”
Math beats synthesis: if you ask the agent to hold 50 policies and 20 guardrails, context fills up, quality drops
The professional secret: when the prompt becomes a monolith, don’t shorten—split

Architecture Evolution:

Don’t build a do-everything agent—build specialized agents for each domain (HR, Finance, IT Support)
Build small specialized agents coordinated by an orchestrator
It’s the transition from single pilot to racing team (scuderia)
But to manage the team, you first need to know how to make the car work—and now you do

Main Takeaways

Grounding is a law, not a suggestion
- No evidence, no response—remove the agent’s ability to invent
- Implement a two-step protocol: forensic retrieval + kill switch
Guardrails are non-negotiable boundaries
- Define policy, knowledge, and role boundaries explicitly
- Test adversarially with zero tolerance before go-live
Testing requires the right metrics for the right context
- Exact match for structured outputs, semantic similarity for conversations, quality evaluation for critical domains
- False positives and false negatives are equally dangerous
Self-critique is architecture, not a prompt line
- Build a two-phase process: draft generation → mandatory review
- Use strategically where risk justifies cost
Monitoring transforms test metrics into live production signals
- Track version, context, and outcome in every conversation
- Success rate <90% should trigger immediate alerts
Feedback loops close the circle—errors fuel improvement
- Never optimize one metric in isolation
- Transform production errors into test cases for the next version

📚 Resources and References

Official Documentation

Azure Responsible AI 📘 [Official]
Microsoft’s official guidance on responsible AI practices, covering fairness, reliability, safety, privacy, security, inclusiveness, transparency, and accountability. Essential reading for anyone deploying AI agents in production environments.

Copilot Studio Analytics 📘 [Official]
Official documentation for analytics capabilities in Microsoft Copilot Studio, including effectiveness, satisfaction, and usage metrics. Covers dashboard interpretation and ROI tracking features mentioned in the session.

Agent Analytics in Copilot Studio 📘 [Official]
Detailed guidance on monitoring agent performance, understanding conversation flows, and identifying optimization opportunities. Directly relevant to Rules 11 and 12 on monitoring and feedback loops.

Security Resources

OWASP Top 10 for LLM Applications 📗 [Verified Community]
The authoritative community resource on LLM security vulnerabilities, including prompt injection and excessive agency discussed in Rule 8. Essential reference for anyone implementing production guardrails.

Academic References

Self-Critique Patterns for LLMs 📗 [Verified Community]
Academic paper on self-critique architectures for large language models. Provides theoretical foundation for the editor-journalist pattern discussed in Rule 10.

Session Materials

Session Recording 📘 [Official]
Full video recording of this masterclass session on YouTube. Includes visual demonstrations of Copilot Studio configurations and the complete discussion of all six advanced rules.

Part 1: 6 Vital Rules for Copilot Agents 📘 [Official]
First part of this masterclass series, covering foundational rules 1-6 on prompt architecture, persona definition, meta-prompting, and model-specific optimization. Prerequisite viewing for understanding this session.

Speaker Resources

Mario Fontana on LinkedIn 📗 [Verified Community]
Speaker’s professional profile with ongoing content about Copilot agents and AI solutions. Good resource for following up on concepts from this session.

Io e il mio Copilot (YouTube Channel) 📗 [Verified Community]
Speaker’s YouTube channel with additional content on Copilot development, agent architecture, and production best practices.

Session Summary: 6 Advanced Rules for Production Copilot Agents

Executive Summary

Table of contents

[00:00] Introduction: From Demo to Production

[01:51] Rule 7: Grounding & Citations (Stop Hallucinations)

[06:39] Rule 8: Guardrails & Security (Anti-Jailbreak)

[14:47] Rule 9: Rigorous Testing (Output Validation)

[19:55] Rule 10: Self-Critique (Editor-Journalist Architecture)

[27:33] Rule 11: Monitoring & Observability (Tracking KPIs)

[32:36] Rule 12: Feedback Loop (ROI Optimization)

[37:19] Recap: The “Agent Fleet” Transition

Main Takeaways

📚 Resources and References

Official Documentation

Security Resources

Academic References

Session Materials

Speaker Resources

Related Content