Learning Hub
  • Home
  • News
    • Latest Updates
    • VS Code v1.107 Release Summary
    • VS Code v1.107 Release Analysis
  • Events
    • Build Conference 2025
    • Ignite Conference 2025

    • All event content available via sidebar →
  • Architectures
    • Podcast and RSS Feed Analysis
    • Analyzing Atom and RSS Specifications
    • Additional Metadata for RSS and Atom Feeds
    • C# Reference Classes for Reading Feeds
  • Azure
    • Azure Best Practices

    • Security & Identity
    • Azure AD External Users Explained
    • External Users Types & Invitation

    • Data Services
    • Table Storage Access
    • Blob Storage Access Options
    • Blob Storage Limitations
    • CosmosDB Access

    • Messaging & Events
    • EventHub Access

    • Developer Learning Path
    • Azure Apps Q&A
    • Azure Web App Limitations
    • Azure Apps CLI Commands
    • Azure Web App CLI Guide
    • Azure Functions Q&A
    • Azure Functions Limitations
    • Azure Key Vault Q&A
    • Azure Key Vault Limitations
    • Azure Key Vault HowTo
    • Azure App Configuration Q&A
    • Azure App Configuration Exercise
    • Azure Monitor Q&A
    • Azure Container App
    • Azure Container App Environments
    • Azure Container App CLI HowTo
    • Azure Containers Q&A
  • Markdown & Docs
    • Quarto Documentation
    • Using Quarto (Overview)
    • Quarto Architecture: How It Works
    • Quarto: Monolithic vs. Modular Deployment
    • Quarto: Split Navigation Build
    • Quarto.yml Document Structure
    • Quarto Theming and Styling
    • Quarto-Specific Markdown Features
    • Optimizing Quarto Build & Deploy
    • How Quarto Sidebar Works
    • Quarto Navigation Workflow
    • Sidebar Page Transition Optimization
    • Troubleshooting Quarto Sites
    • Deploying Quarto to GitHub Pages
    • Deploying Quarto to Azure Storage

    • MkDocs Documentation
    • Using MkDocs (Overview)
    • MkDocs Architecture: How It Works
    • MkDocs: Monolithic vs. Modular Deployment
  • Technologies
    • GitHub & Version Control
    • Git Command Line
    • GitHub CLI Commands

    • Prompt Engineering
    • GitHub Copilot & Markdown
    • Naming & Organizing Prompts
    • Structuring Prompt Files

    • API Testing & Tools
    • HTTP/REST API Testing Comparison
    • Using .http Files (VSCode)

    • Programming & Hardware
    • C# DateTime Formats
    • yq YAML Processor
    • Surface Laptop 6
  • How-To Guides
    • Configure .prompt Files in VS
    • No-IP DDNS Setup
  • Ideas & Projects
    • IQPilot - AI Quality Tool
    • LearnHub Overview

    • DIY Battery Packs
    • DIY E-Bike Projects
  • Culture & Travel
    • Paris Cultural Journey 2025
    • San Francisco & Seattle 2026
  1. Events
  2. Build Conference 2025
  3. BRK - Breakout Sessions
  4. BRK141 RAG for enterprise agents with Azure AI Search
  5. Knowledge Retrieval: RAG for Enterprise Agents with Azure AI Search
  • Home
    • Learning Hub Concept
    • Using Learning Hub for Learning Technologies
  • News & Updates
    • 20251224 vscode v1.107 Release
      • Recording Summary: VS Code v1.107 Release Live Stream
      • Session Analysis: VS Code v1.107 Release Live Stream
    • 20260111 6 VITAL Rules for Production-Ready Copilot Agents
      • 6 VITAL Rules for Production-Ready Copilot Agents
  • Events
    • Build Conference 2025
      • Microsoft Build 2025 - Session Articles
      • BRK - Breakout Sessions
        • BRK101 Dotnet app modernization
          • AI-Powered .NET App Modernization: GitHub Copilot & Azure Migration Tools
          • Executive Summary
        • BRK103 Microsoft Developers Use AI
          • How Microsoft Developers Use AI in Real-World Coding
          • How Microsoft Developers Use AI in Real-World Coding
        • BRK104 Building the Next Generation of Apps with AI and .NET
          • Building the Next Generation of Apps with AI and .NET
          • SUMMARY
        • BRK106 Elevating Development with .NET Aspire AI
          • Elevating Development with .NET Aspire: AI, Cloud, and Beyond
        • BRK114 CS 14 Language Features and Beyond
          • Appendix A: CS Version History and Features
          • C# 14 Language Features and Beyond: A Demo-Filled Tour
          • AI Summary
        • BRK119 Debug Like a Pro - Improve Your Efficiency
          • Debug Like a Pro: Improve Your Efficiency with Visual Studio & Copilot
          • Debug Like a Pro: Improve Your Efficiency with Visual Studio & Copilot
        • BRK122 Supercharge Your Git workflow with VS Code
          • Passkey Authentication Information
          • Inside Observability with .NET 10
          • The Future of Web Development with ASP.NET Core & Blazor
          • Sample
            • Microsoft Open Source Code of Conduct
            • Build25 BRK122 - The future of web development with ASP.NET Core & Blazor
            • Security
            • Support
            • ModernWebApi
              • ModernWebApi
        • BRK123 Build AI Apps with Microsoft Graph Data
          • AI Infused Mobile & Desktop App Development with .NET MAUI
          • SUMMARY
        • BRK127 Unleash developer potential with AI and Dev Box
          • Unleash Developer Potential with AI and Dev Box
          • Unleash Developer Potential with AI and Dev Box
        • BRK141 RAG for enterprise agents with Azure AI Search
          • Knowledge Retrieval: RAG for Enterprise Agents with Azure AI Search
        • BRK155 Azure AI Foundry - app and Agent Factory
          • Azure AI Foundry: The AI App and Agent Factory
        • BRK163 Create agents for 365 Copilot with 365 Agents SDK
          • Create Agents for Microsoft 365 Copilot with Microsoft 365 Agents SDK
          • Create Agents for Microsoft 365 Copilot with Microsoft 365 Agents SDK
        • BRK165 Building agents for Microsoft 365 Copilot
          • Building Agents for Microsoft 365 Copilot: From No-Code to Pro-Code
          • Building Agents for Microsoft 365 Copilot: From No-Code to Pro-Code
        • BRK176 agent solutions with Copilot Studio and M365 Agents SDK
          • Architecting Multi-Agent Solutions with Copilot Studio and M365 Agents SDK
          • Architecting Multi-Agent Solutions with Copilot Studio and M365 Agents SDK
          • Architecting Multi-Agent Solutions with Copilot Studio and M365 Agents SDK
        • BRK195 Inside Azure innovations with Mark Russinovich
          • Inside Azure Innovations with Mark Russinovich
          • Inside Azure Innovations with Mark Russinovich
        • BRK199 Accelerate Modernization
          • Accelerate Modernization at Scale: From Legacy to Cloud-Native with AI
          • Accelerate Modernization at Scale: From Legacy to Cloud-Native with AI
        • BRK204 Wharts new in Microsoft Databases
          • README.Sonnet4
        • BRK204 Whats new in Microsoft Databases
          • What’s New in Microsoft Databases: Empowering AI-Driven App Development
          • What’s New in Microsoft Databases: Empowering AI-Driven App Development
        • BRK223 An overview of Windows AI Foundry
          • An Overview of Windows AI Foundry: Local AI Development and Deployment
          • An Overview of Windows AI Foundry: Local AI Development and Deployment
        • BRK224 Integrate AI using Windows AI APIs
          • SUMMARY
        • BRK225 Bring your own model to Windows using Windows ML
          • Bring Your Own Model to Windows using Windows ML
        • BRK226 Boost Development Productivity
          • Boost Development Productivity: Windows Latest Tools and Tips
          • Boost Development Productivity: Windows Latest Tools and Tips
        • BRK229 Introducing Copilot Solutions
          • BRK229: Introducing Copilot Solutions
        • BRK229 Unlock agents for your apps using MCP on Windows
          • Unlock Agentic Interactions for Your Apps Using MCP on Windows
          • Unlock Agentic Interactions for Your Apps Using MCP on Windows
      • DEM - Demonstrations
        • DEM508 Streamlining Application Testing with .NET Aspire and Playwright
          • Streamlining Application Testing with .NET Aspire and Playwright
        • DEM509 Essential AI Prompts for Developers
          • Essential AI Prompts for Developers: 4 Strategies to Make AI Work Super Hard for You
          • Essential AI Prompts for Developers: 4 Strategies to Make AI Work Super Hard for You
        • DEM515 Write better C# code
          • Write Better C# Code: Underutilized Features for Modern Development (Deep Analysis)
          • Executive Summary
        • DEM517 Build, Deploy, & Use Your First MCP Server
          • Build, Deploy, & Use Your First Model Context Protocol (MCP) Server
        • DEM518 dotnet run app
          • .NET 10 Preview: Run C# Files Directly with dotnet run app.cs
          • .NET 10 Preview: Run C# Files Directly with dotnet run app.cs
        • DEM519 Agent mode for serious developers
          • Agent Mode for “Serious” Developers: Autonomous Coding with VS Code
          • Agent Mode for “Serious” Developers: Autonomous Coding with VS Code
        • DEM520 Local AI Development with Foundry Local and .NET Aspire
          • Working Qwen2.5 Sample with Foundry Local
          • Foundry Local Samples
          • DEM520: Local AI Development with Foundry Local and .NET Aspire
        • DEM524 Running Large Language Models on your local machine
          • The Power Within: Running Large Language Models on Your Local Machine
        • DEM571 Extending your application with powertoys
          • Extending Applications with PowerToys Command Palette
          • Extending Applications with PowerToys Command Palette
        • DEM581 Transforming Microsoft Learn with AI
          • AI-Powered Learning: Transforming Microsoft Learn with Intelligent Assistance
          • Overview
      • ODF - On-Demand Features
        • ODFP957 Sentry and Copilot Integration for AI Debugging
          • ODFP957: Sentry + GitHub Copilot Integration for AI-Powered Debugging
          • Session Goal
      • STUDIO - Studio Sessions
        • STUDIO14 Agents AI and Azure AI Foundry
          • STUDIO14: Agents, AI, and Azure AI Foundry - A Deep Dive into Enterprise AI Platforms
          • STUDIO14: Agents, AI, and Azure AI Foundry - A Deep Dive into Enterprise AI Platforms
          • Agents, AI, and Azure AI Foundry: A Deep Dive into Enterprise AI Platforms
    • Ignite Conference 2025
      • 03. Migrate and modernize
        • BRK431 - Cloud Native Innovations with Mark Russinovich
          • Summary
  • Technologies
    • Authentication
    • Azure
      • 00. Azure Naming conventions
        • Azure Naming Conventions
      • 01. Azure Apps
        • Q. What are Deployment slots ? how can I use them?
        • Azure Web Apps Limitations
        • 03. cli commands
        • Azure Web App CLI Commands Guide
      • 01. Azure Functions
        • 📑 Table of Contents
        • ⚠️ Azure Functions Limitations
      • 02. Azure Key Vault
        • 📑 Table of Contents
        • Azure Key Vault Limitations
        • 03. HowTo.Create and retrieve secrets from Azure Key Vault
      • 03. Azure App Configuration
        • 📑 Table of Contents
        • Azure App Configuration with Key Vault Integration 🚀
      • 04. Monitor App Performance
        • 📑 Table of Contents
      • 05. Azure Containers
        • Azure Container Apps Overview
        • Azure Container App Environments Overview
        • 03. HowTo. Use az cli to create ACI
        • HowTo: Use Azure CLI to Create Azure Container Apps
        • Q. Compare and contrast ‘Azure Container Apps’ and ‘Azure Web Containerized Apps’?
      • 08. EventHub Access options
        • README
    • Data
      • 01. TableStorageAccess options
        • Azure Table Storage Access with C#
      • 02. CosmosDB Access options
        • 🌐 Azure CosmosDB Access Options (with C#)
        • 🎯 CosmosDB Partitioning Strategies
      • 03. BlobStorage Access options
        • Azure Blob Storage Access Approaches with C#
        • ⚠️ Azure Blob Storage Limitations
    • Programming Languages
      • 20250827 what is yq overview
        • Appendix B: Advanced YAML to JSON Conversion Techniques
        • Appendix A: The yq Tool - Complete Guide
        • How to Convert YAML to JSON - Complete Guide
      • Lang01 CSharp
        • C# Date and Time Data Types: A Comprehensive Guide
    • GitHub
      • 01.01 Cli
        • 20250118 Github Cli
          • 🚀 CLI Commands to Interact with GitHub Repositories
        • 20250709 Manage GitRepo from commandline
          • HowTo: Manage your Git repo from the command line
      • 01.Repositories
        • 20250825 Github repositories limitations
          • Overcoming GitHub Repository Limitations
      • 02.Actions
        • 20251018 ISSUE Github action fails with Artifact storage quota has been hit
          • ISSUE: Github action fails with ‘Artifact storage quota has been hit’
          • Quick Start Guide - Fixing Artifact Storage Quota Issue
          • Summary - GitHub Actions Artifact Storage Quota Issue
          • FINAL SOLUTION: No Artifacts Approach
          • WSL Error Fix - Updated Solution
    • Prompt Engineering
      • How GitHub Copilot Uses Markdown and Prompt Folders within Your Repo
      • How to Name and Organize Prompt Files in Your GitHub Repository
      • How to Structure Content for GitHub Copilot Prompt Files
      • How to Structure Content for GitHub Copilot Agent Files
      • How to Structure Content for GitHub Copilot Instruction Files
      • How to Create a Prompt Orchestrating Multiple Agents
      • Multi-Agent Orchestration Plan V2 for Prompt and Agent File Creation
      • Prompts and Markdown Structure for a Documentation Site
    • HTTP Client
      • 20250711 Use http files for easy and repeatable test
        • README
      • 20250713 Use http files for easy and repeatable test
        • 🔍 HTTP/REST API Testing Tools - Comprehensive Comparison
        • Using HTTP Files for API Testing (Visual Studio)
        • README
    • Markdown Compilers
      • Quarto
        • Using Quarto
        • How Quarto Works
        • Monolithic vs. Modular Deployment
        • Split Navigation Build from Content Rendering
        • _quarto.yml Document Structure
        • Quarto-Specific Markdown Features
        • Quarto Theming and Styling
        • How does quarto site layout works
        • Navbars Navigation Workflow
        • Sidebar Page Transition Optimization
        • Optimizing Quarto Build and Deploy Performance
        • Troubleshooting Quarto Sites
        • Deploying a Quarto Site to GitHub Pages
        • Deploying a Quarto Site to Azure Storage Accounts
        • Quarto Documentation Series
      • MkDocs
        • Using MkDocs
        • MkDocs Architecture - How MkDocs Works
        • MkDocs Architecture - Monolithic vs. Modular Deployment
    • Feed Architectures & Protocols
      • 20251005 Feeds architectures and protocols
        • Podcast and RSS Feed: Information Gathering and Analysis
        • Analyzing Atom and RSS Specifications
        • Additional Metadata for RSS and Atom Feeds
        • C# Reference Classes for Reading RSS and Atom Feeds
    • Diginsight
      • 20250817 Query Cost Metrics with Diginsight
        • CosmosDB Query Cost Metrics with Diginsight
    • Hardware
  • How-To Guides
    • HowTo: Configure .prompt Files for Markdown Preview in Visual Studio
    • 20251013 HowTo Expose My Computer with No-IP DDNS
      • 🌐 HowTo: Expose My Computer with No-IP DDNS
  • Issues & Solutions
    • 20250709 fatal error cloning a repo with Visual Studio
      • PROBLEM DESCRIPTION
    • 20251107 FeedId cannot contain forward slash
      • ISSUE: ‘title’ - 20251028
    • YYYYMMDD sample
      • README

  • Ideas & Projects
    • IQPilot - AI Content Quality Tool
      • IQPilot
        • IQPilot Overview: AI-Assisted Content Development Tool
        • IQPilot Getting Started Guide
        • IQPilot Implementation Details
      • LearnHub
        • 01. Learning Hub Overview
          • Learning Hub Concept
          • Using Learning Hub for Learning Technologies

  • Other Technologies
    • DIY Li-Ion Battery Packs
    • DIY E-Bike Projects

  • Culture & Travel
    • Paris Cultural Journey 2025
    • San Francisco & Seattle 2026

On this page

  • Knowledge Retrieval: RAG for Enterprise Agents with Azure AI Search
    • Executive Summary
    • Key Topics Covered
      • ?? 1. The Evolution Beyond Traditional RAG
      • ?? 2. Agentic Retrieval: The Next Generation Query Pipeline
      • ?? 3. Multi-Modal Document Processing Revolution
      • ?? 4. Enterprise-Grade Security and Access Control
      • ?? 5. Data Integration and Pipeline Innovation
      • ?? 6. Model Context Protocol and Developer Experience
    • Technical Architecture Deep Dive
      • Knowledge Agent Abstraction
      • Multi-Modal Processing Pipeline
      • Security Architecture
    • Performance Metrics and Validation
      • Agentic Retrieval Effectiveness
      • Multi-Modal Processing Benefits
    • Live Demonstration Results
      • Agentic Retrieval in Action
      • Multi-Modal Document Processing
      • Security Enforcement
      • Agent-Driven Development
    • Session Highlights
    • Implementation Guide
      • Getting Started with Agentic Retrieval
      • Best Practices for Enterprise Deployment
    • Advanced Applications
      • Enterprise RAG Architecture
      • Development Acceleration
      • Security and Compliance
    • Resources and Links
      • Official Documentation
      • Sample Code and Demos
      • Security and Compliance
    • About the Speaker
  • Edit this page
  • Report an issue
  1. Events
  2. Build Conference 2025
  3. BRK - Breakout Sessions
  4. BRK141 RAG for enterprise agents with Azure AI Search
  5. Knowledge Retrieval: RAG for Enterprise Agents with Azure AI Search

Knowledge Retrieval: RAG for Enterprise Agents with Azure AI Search

Session Date: May 20, 2025
Duration: 1 hour
Venue: Build 2025 Conference - BRK141
Speaker: Pablo Castro (CVP & Distinguished Engineer, AI Platform, Microsoft)
Link: [Microsoft Build 2025 Session BRK141]

Azure AI Search RAG Evolution

Executive Summary

This technical deep-dive session reveals the evolution from traditional RAG to purpose-built enterprise AI systems powered by Azure AI Search. Pablo Castro demonstrates how knowledge retrieval has transformed from simple vector search to sophisticated agentic retrieval systems capable of multi-hop reasoning, complex query decomposition, and enterprise-grade security. The session showcases three major innovation areas: agentic retrieval with 40% improvement in answer relevance, multi-modal document processing with AI-powered extraction, and native Entra ID-based access control for secure enterprise deployment.


Key Topics Covered

?? 1. The Evolution Beyond Traditional RAG

The Historical Context: Two Years of RAG Progress

Pablo’s Opening Insight: > “About two years ago, we started to talk about RAG… it was built out of the tools available in the room. There was this clever observation of in-context learning… we managed to put these things together.”

Traditional RAG Architecture:

  • In-context learning - Language models with concatenated retrieved content
  • Available tools approach - Using existing search stacks not designed for AI
  • Vector search adoption - Addressing vocabulary gaps and semantic understanding
  • Industry success - Two years of progress with first-wave applications

The Transition to Purpose-Built Systems

From Adoption to Innovation:

  • Historical approach - “Whatever we adopted to do the job”
  • Current evolution - “Things that we purposely built and constructed”
  • Strategic goal - Make tasks easier, better, and faster to solution
  • Enterprise focus - Real-world agents solving real-world problems

?? 2. Agentic Retrieval: The Next Generation Query Pipeline

The Limitation of Single-Shot Search

Traditional Search Constraints:

  • Linear results - Top-K results from single query execution
  • Simple fact-seeking works well: “What are the security updates for KB article #123456?”
  • Complex queries fail - Multi-part questions with typos, references, and context

The “Doesn’t Work” Scenario Example:

Complex Query: "What does KB [typo] article fix security issue mentioned earlier 
regarding the vulnerability we discussed?"

Challenges:

- Multiple moving parts in single question
- Typos requiring correction
- Cross-references requiring resolution
- Conversational context dependency

Agentic Retrieval Architecture

Revolutionary Approach:

Traditional: User Query ? Single Search ? Top-K Results ? LLM
Agentic: Chat History + Context ? Query Planning ? Parallel Execution ? Merge Results ? LLM

Intelligent Query Processing:

  • Query planning - LLM-powered understanding of information needs
  • Query decomposition - Breaking complex queries into retrievable components
  • Typo correction - Context-aware error resolution
  • Parallel execution - Multiple search operations simultaneously
  • Result merging - Intelligent combination of diverse information sources

Live Demonstration: Outdoor Gear Analysis

Complex Query Scenario:

User: "What are examples of popular tents?"
System: "TrailMaster and SkyView are two popular choices."
User: "Which one fits more people?"

Agentic Processing:

  1. Context analysis - Understanding previous conversation about tent models
  2. Query branching - Two separate searches:
    • “TrailMaster tent maximum capacity”
    • “SkyView tent maximum capacity”
  3. Parallel execution - Both queries processed simultaneously
  4. Result synthesis - Combined capacity information in comparative format

Performance Metrics and Validation

Quantified Improvements:

  • 40% increase in answer relevance for complex queries
  • 30% increase in result rate for difficult question scenarios
  • Multi-dataset validation - Support, MIML (Multi-Industry Multi-Language)
  • Groundedness preservation - No regression in hallucination prevention

Evaluation Framework:

  • Content relevance - Retrieved information matches query intent
  • Answer relevance - LLM response addresses actual question
  • Groundedness - Response based on retrieved data, not hallucinated

?? 3. Multi-Modal Document Processing Revolution

Beyond Simple Text Chunking

Traditional RAG Data Problems:

  • 500-token chunking - Arbitrary text segmentation ignoring document structure
  • Text-only processing - Missing critical visual information
  • Layout ignorance - Losing contextual relationships and hierarchies
  • Manual extraction - Developer responsibility for complex document parsing

Advanced Document Intelligence

Real-World Document Challenges:

  • Complex schematics - Arrows, text boxes, and diagram relationships
  • Multi-modal content - Text and images requiring coordinated understanding
  • Layout significance - Document structure conveying meaning
  • Addressable components - Individual elements for citation and reference

AI-Powered Processing Pipeline

End-to-End Document Processing:

  1. AI Document Intelligence - Automatic layout and structure extraction
  2. Image verbalization - LLM-powered image description generation
  3. Multi-modal embeddings - Coordinated text and visual understanding
  4. Component addressability - Individual images and sections for citation
  5. Sophisticated chunking - Structure-aware content segmentation

Live Demo: Azure Search Documentation Processing

Multi-Modal RAG Implementation:

  • Data source - PDF documentation with diagrams and text
  • Processing options - Simple extraction vs. full AI Document Intelligence
  • Image handling - Verbalization vs. embedding vs. hybrid approaches
  • Knowledge Store - Addressable component storage for application integration

Generated Application Results:

  • Automatic image extraction - Individual diagrams made addressable
  • Visual grounding - Images shown alongside text responses
  • Citation support - References to specific document sections
  • Multi-format support - Text, images, and layout information combined

?? 4. Enterprise-Grade Security and Access Control

The Security Challenge of Intelligent Retrieval

The Problem: > “An interesting effect of all this super-smart retrieval systems and the copilots we build on top of them is that they’ll find everything.”

Enterprise Requirements:

  • Access control propagation - Document permissions must flow through AI systems
  • Identity integration - Enterprise identity systems must control AI access
  • Group membership - Dynamic group changes affecting document visibility
  • Audit trails - Compliance and security monitoring requirements

Native Entra ID Integration

Announcement: Document-Level Access Control

  • Automatic group expansion - Dynamic membership resolution
  • User-scoped indexing - Search results filtered by user permissions
  • RBAC integration - Azure role-based access control support
  • Zero manual configuration - Automatic security policy enforcement

Implementation Architecture:

Document ACLs ? Azure AI Search ? User Token ? Filtered Results
??? User IDs field
??? Groups field  
??? RBAC roles field

Live Security Demo

Access Control in Action:

  • Document creation - Three documents with different permission sets
  • User authentication - Application identity with user token delegation
  • Filtered results - Only accessible documents returned
  • Dynamic enforcement - Real-time permission checking

Advanced Security: Sensitivity Labels

Microsoft Purview Integration (Private Preview):

  • Document encryption - Automatic handling of encrypted documents
  • Policy enforcement - Sensitivity label policy compliance
  • Organizational protection - Document classification and handling rules
  • End-to-end security - From document creation to AI response

?? 5. Data Integration and Pipeline Innovation

Logic Apps Integration for External Data

Challenge: Enterprise data exists beyond Azure ecosystem Solution: Azure AI Search + Logic Apps partnership

OneDrive for Business Example:

  1. Azure Portal wizard - Simplified integration setup
  2. Logic Apps workflow - Automated data ingestion and processing
  3. Change tracking - Continuous synchronization with source systems
  4. Vectorization pipeline - Automatic embedding generation and indexing

Comprehensive Data Source Support

Native Azure Integration:

  • Blob Storage - Document and file processing
  • OneLake - Microsoft Fabric data lake integration
  • Azure SQL Database - Structured data indexing
  • Cosmos DB - NoSQL document processing

External Source Integration:

  • OneDrive/SharePoint - Microsoft 365 document systems
  • Third-party systems - Through Logic Apps connectors
  • Custom APIs - Extensible integration patterns

Azure Data Lake Storage Gen2 ACL Propagation

Automatic Permission Flow:

  • Source ACL detection - Native ADLS Gen2 access control lists
  • Permission propagation - Automatic transfer to search index
  • User/group mapping - Entra ID integration throughout pipeline
  • Zero-touch security - No manual permission configuration required

?? 6. Model Context Protocol and Developer Experience

MCP Integration for Agent Development

Azure MCP Server Capabilities:

  • Resource management - Azure resource groups and services
  • Service integration - SQL, Azure Monitor, Cosmos DB, Azure AI Search
  • Developer-focused - Tools for development workflows
  • Self-describing APIs - Enhanced agent interaction capabilities

Agent-Driven Development Demo

Intelligent Application Generation: Query: “Build me a Next.js app for outdoor gear using my Azure Search index”

Automated Results:

  • Schema analysis - Automatic index structure discovery
  • Data sampling - Understanding actual content patterns
  • UI generation - Complete application with faceted navigation
  • Branding decisions - Logos and visual design choices
  • Search integration - Functional search interface with filtering

Developer Productivity Gains:

  • Zero manual UI development - Complete application from description
  • Facet exploitation - Automatic use of searchable/filterable fields
  • Metadata utilization - Index descriptions driving application behavior
  • API integration - Functional search capabilities without manual coding

Technical Architecture Deep Dive

Knowledge Agent Abstraction

Up-Leveled API Design:

# Traditional Search API
search_client.search(
    query="tents",
    search_fields=["title", "description"],
    vector=embedding,
    top_k=10
)

# Knowledge Agent API  
knowledge_agent.retrieve(
    chat_history=[...],
    context="outdoor gear shopping",
    model="gpt-4o"
)

Architectural Benefits:

  • Higher abstraction - Chat history instead of field specifications
  • Automatic optimization - AI-driven query planning and execution
  • Policy integration - Built-in access control and security
  • Extensible foundation - Room for continuous innovation

Multi-Modal Processing Pipeline

Document Intelligence Integration:

PDF Input ? Layout Extraction ? Image Identification ? 
Verbalization (GPT-4o) ? Text Indexing ? Vector Generation ? 
Component Storage ? Addressable References

Processing Options:

  • Simple extraction - Basic text and image separation
  • Full AI Document Intelligence - Complete layout and structure analysis
  • Image verbalization - LLM-powered image description
  • Hybrid approaches - Combined embedding and description strategies

Security Architecture

Document-Level Access Control:

{
  "id": "doc1",
  "content": "Sensitive financial data...",
  "users": ["user1@contoso.com"],
  "groups": ["finance-team", "executives"],
  "roles": ["Storage Blob Data Reader"]
}

Query-Time Enforcement:

  • Token delegation - Application identity with user context
  • Dynamic filtering - Real-time permission checking
  • Group expansion - Automatic membership resolution
  • RBAC integration - Azure role-based access control

Performance Metrics and Validation

Agentic Retrieval Effectiveness

Complex Query Performance:

  • 40% improvement in answer relevance for difficult questions
  • 30% increase in successful result rate
  • Multi-industry validation - Finance, manufacturing, multiple sectors
  • Multi-language testing - Global deployment readiness

Query Type Analysis:

  • Simple queries - Maintained existing performance levels
  • Multi-hop questions - Significant improvement in accuracy
  • Complex scenarios - Material quality gains across all datasets
  • Groundedness - No regression in hallucination prevention

Multi-Modal Processing Benefits

Document Coverage Expansion:

  • Visual information - Previously inaccessible diagram content
  • Layout understanding - Structural relationships preserved
  • Component addressability - Individual elements citeable
  • Application-ready - Immediate integration with existing systems

Live Demonstration Results

Agentic Retrieval in Action

Complex Query Processing:

Input: "Which tent fits more people?" (with context about TrailMaster vs SkyView)
Processing:
??? Query Planning: Understand comparative capacity question
??? Context Analysis: Previous discussion about specific tent models  
??? Query Decomposition: Two separate capacity searches
??? Parallel Execution: TrailMaster capacity + SkyView capacity
??? Result Synthesis: Comparative capacity information

Output: Structured comparison with capacity details and citations

Multi-Modal Document Processing

Azure Search Documentation Analysis:

  • Input: Technical PDFs with diagrams and text
  • Processing: AI Document Intelligence + Image Verbalization
  • Output: Searchable text + addressable images + layout information
  • Application: Functional Q&A with visual grounding and citations

Security Enforcement

Access Control Validation:

Document Set:
??? Doc 1: User has direct access ? Visible
??? Doc 2: User lacks permission ? Hidden
??? Doc 3: User via group membership ? Visible

Search Results: Only documents 1 and 3 returned

Agent-Driven Development

Application Generation:

  • Input: “Build outdoor gear app with my search index”
  • Analysis: Automatic schema discovery + data sampling
  • Output: Complete Next.js application with search, facets, branding
  • Result: Functional e-commerce interface without manual development

Session Highlights

“We started with whatever elements we had in the room and we’re transitioning to a purpose-built system.” - Pablo Castro

“The reality is that if you think about how RAG was constructed at that point… it was built out of the tools available in the room.” - Pablo Castro

“What we are announcing today is what we call agentic retrieval… applying the same methods we use to create agents out there in our own search engine.” - Pablo Castro

“An interesting effect of all this super-smart retrieval systems and the copilots we build on top of them is that they’ll find everything. So it becomes super important that you have proper access control policies.” - Pablo Castro

“I’m highly incompetent in anything that has to do with UI, so I would’ve never been able to do something like this.” - Pablo Castro (on agent-generated application)


Implementation Guide

Getting Started with Agentic Retrieval

1. Knowledge Agent Creation

# Create Knowledge Agent
knowledge_agent = search_client.create_knowledge_agent(
    name="product_agent",
    data_sources=["contoso_products_index"],
    model="gpt-4o",
    policy={
        "relevance_threshold": 0.7,
        "max_queries": 5
    }
)

# Use Agent for Retrieval
response = knowledge_agent.retrieve(
    chat_history=[
        {"role": "user", "content": "What are popular tents?"},
        {"role": "assistant", "content": "TrailMaster and SkyView..."},
        {"role": "user", "content": "Which fits more people?"}
    ]
)

2. Multi-Modal Document Processing

**Portal Setup Process:**
1. Import and Vectorize Data wizard
2. Select data source (Blob Storage, OneLake, etc.)
3. Enable AI Document Intelligence
4. Configure image verbalization with GPT-4o
5. Set up embedding model for vectorization
6. Enable Knowledge Store for component addressability
7. Configure Semantic Ranker for quality improvement

3. Access Control Implementation

# Document with Access Control
document = {
    "id": "secure_doc_1",
    "content": "Confidential business information...",
    "users": ["alice@contoso.com"],
    "groups": ["finance_team", "management"],
    "roles": ["Storage Blob Data Reader"]
}

# Query with User Context
results = search_client.search(
    query="quarterly results",
    user_token=user_access_token  # Automatic permission filtering
)

Best Practices for Enterprise Deployment

Data Pipeline Optimization

  • Use AI Document Intelligence for complex multi-modal documents
  • Enable image verbalization when visual content contains critical information
  • Configure Knowledge Store for application integration and citation support
  • Set up incremental indexing for continuous data synchronization

Security Configuration

  • Map existing ACLs from source systems to search index fields
  • Use managed identity for secure service-to-service communication
  • Enable Entra ID integration for automatic group expansion
  • Test permission propagation with representative user scenarios

Performance Tuning

  • Use Semantic Ranker for improved result quality
  • Configure relevance thresholds based on application requirements
  • Monitor agentic retrieval costs vs. traditional search approaches
  • Implement caching strategies for frequently accessed content

Advanced Applications

Enterprise RAG Architecture

Comprehensive Implementation Stack:

Application Layer: Copilot/Agent Interface
??? Agentic Retrieval: Complex query processing
??? Multi-Modal Processing: Document + visual understanding  
??? Access Control: Entra ID integration
??? Data Integration: Logic Apps + native connectors
??? Foundation: Azure AI Search with semantic ranking

Development Acceleration

MCP-Enabled Workflows:

  • Automatic application generation from search index schemas
  • Schema discovery and data pattern recognition
  • UI component optimization based on facetable fields
  • Citation and reference system integration

Security and Compliance

Enterprise-Grade Features:

  • Document-level access control with dynamic group membership
  • Sensitivity label support for classified content
  • Audit trail integration for compliance reporting
  • RBAC policy enforcement across AI-powered systems

Resources and Links

Official Documentation

  • Agentic Retrieval Announcement - Complete technical overview and implementation guide
  • Agentic Retrieval Evaluation Results - Detailed performance metrics and methodology
  • Multi-Modal Updates - Document processing and AI integration features
  • What’s New in Azure AI Search - Comprehensive feature updates and roadmap

Sample Code and Demos

  • Agentic Retrieval Demo - Working implementation of knowledge agents
  • Multi-Modal Demo - Document processing with visual content
  • Azure MCP Repository - Model Context Protocol server for Azure services
  • Foundry MCP Server - AI Foundry integration examples

Security and Compliance

  • Private Preview: Sensitivity Labels - Enterprise document classification support

About the Speaker

Pablo Castro
CVP & Distinguished Engineer, AI Platform
Microsoft
Corporate Vice President leading Azure AI Search team with focus on state-of-the-art information understanding and retrieval systems. Expert in information retrieval, machine learning, distributed systems, and database systems. Co-founder of Lagash Systems (acquired by Mercado Libre).


This session reveals the transformation of enterprise AI from simple RAG implementations to sophisticated, purpose-built knowledge retrieval systems that understand context, process multi-modal content, and enforce enterprise security policies. Pablo Castro demonstrates how Azure AI Search has evolved beyond traditional search into an intelligent knowledge platform that powers the next generation of enterprise AI applications.

Unleash Developer Potential with AI and Dev Box
Azure AI Foundry: The AI App and Agent Factory
  • Edit this page
  • Report an issue