Intelligent content that gives practitioners, innovators and leaders an inside look at the present and future of ML & AI technologies.

Podcast

Latest

Is RAG Dead? Lessons from Building AI for Tax Law

EPISODE 769 | JUNE 9, 2026

As context windows grow into the millions of tokens, many AI practitioners are questioning whether retrieval-augmented generation (RAG) is still necessary. If modern models can ingest entire libraries of documents, why bother with retrieval at all? In this episode, Alex Bowcut, Head of Engineering at Sphere, explains why the answer depends on the application. Sphere uses AI to automate global tax compliance—an environment where getting the answer right isn’t enough. Every conclusion must be backed by the correct legal citation, and every decision must withstand expert review. We explore how Sphere built TRAM (Tax Review and Assessment Model), a production AI system that combines retrieval, reasoning models, legal review workflows, reinforcement learning, and deterministic systems to help tax experts move nearly two orders of magnitude faster while maintaining accuracy. Along the way, we discuss why RAG remains critical in high-stakes domains, how Sphere processes legal and regulatory documents from jurisdictions around the world, retrieval architectures, semantic chunking, dense versus sparse retrieval, expert feedback loops, and the challenges of building AI systems that people can actually trust.

Recent

Cover Image: Jure Leskovec - Podcast Interview

Relational Foundation Models for Enterprise Data

EPISODE 768 | May 21, 2026

Cover Image: Scott Clark - Podcast Interview

How to Find the Agent Failures Your Evals Miss

EPISODE 767 | May 7, 2026

Cover Image: Philip Kiely - Podcast Interview

How to Engineer AI Inference Systems

EPISODE 766 | April 30, 2026

Retrieval-augmented generation promised to bring ChatGPT’s magic to enterprise data. But while organizations rushed to build chatbots, they often struggled to deliver real business value. This comprehensive guide reveals RAG’s full potential beyond conversational interfaces.

Download RAG: Beyond the Chatbot now.

MORE REPORTS

Google Cloud Next '26: Delivering the Agentic Control Plane

Snapdragon Summit 2025: Performance Gains Meet Ecosystem Momentum

Google Cloud Next ’25: Debts Paid, Stakes Raised

ML Platforms for the Generative AI Era

Community

The TWIML Community is a global network of machine learning, deep learning and AI practitioners and enthusiasts.

We organize ongoing educational programs including study groups for several popular ML/AI courses such as Fast.ai Deep Learning, Machine learning and NLP, Stanford CS224N, Deeplearning.ai and more. We also host several special interest groups focused on topics like Swift for Tensorflow, and competing in Kaggle competitions.

GET STARTED

Work With Us

TWIML creates and curates intelligent content that helps makers build better experiences for their users, and gives executives an inside look at the real-world application of intelligence technologies. We also build and support communities of innovators who are as excited about these technologies as we are. We advise a variety of leading organizations as well, helping to craft strategies for taking advantage of the vast opportunities created by ML and AI.

GET IN TOUCH