RAG Risks: Why Retrieval-Augmented LLMs Are Not Safer with Sebastian Gehrmann
EPISODE 732
|
MAY
21,
2025
Watch
Follow
Share
About this Episode
Today, we're joined by Sebastian Gehrmann, head of responsible AI in the Office of the CTO at Bloomberg, to discuss AI safety in retrieval-augmented generation (RAG) systems and generative AI in high-stakes domains like financial services. We explore how RAG, contrary to some expectations, can inadvertently degrade model safety. We cover examples of unsafe outputs that can emerge from these systems, different approaches to evaluating these safety risks, and the potential reasons behind this counterintuitive behavior. Shifting to the application of generative AI in financial services, Sebastian outlines a domain-specific safety taxonomy designed for the industry's unique needs. We also explore the critical role of governance and regulatory frameworks in addressing these concerns, the role of prompt engineering in bolstering safety, Bloomberg’s multi-layered mitigation strategies, and vital areas for further work in improving AI safety within specialized domains.
About the Guest
Sebastian Gehrmann
Bloomberg
Resources
- RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
- Understanding and Mitigating Risks of Generative AI in Financial Services
- Bloomberg AI Researchers Mitigate Risks of “Unsafe” RAG LLMs and GenAI in Finance
- Bloomberg’s Responsible AI Research: Mitigating Risky RAGs & GenAI in Finance
- Bloomberg
- Bloomberg Terminal
- Bloomberg Launches AI-Powered Earnings Call Summaries
- Bloomberg Accelerates Financial Analysis with Gen AI Document Insights
- Bloomberg Launches Gen AI Summarization for News Content
- OWASP Top 10 for Large Language Model Applications
- Introducing v0.5 of the AI Safety Benchmark from MLCommons
- Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
- ShieldGemma: Generative AI Content Moderation Based on Gemma
- BloombergGPT – an LLM for Finance with David Rosenberg - #639
- Information Extraction from Natural Document Formats with David Rosenberg - #126

