Building Voice AI Agents That Don’t Suck with Kwindla Kramer

EPISODE 739

JULY 15, 2025

Watch

Facebook

About this Episode

In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat framework, joins us to discuss the architecture and challenges of building real-time, production-ready conversational voice AI. Kwin breaks down the full stack for voice agents—from the models and APIs to the critical orchestration layer that manages the complexities of multi-turn conversations. We explore why many production systems favor a modular, multi-model approach over the end-to-end models demonstrated by large AI labs, and how this impacts everything from latency and cost to observability and evaluation. Kwin also digs into the core challenges of interruption handling, turn-taking, and creating truly natural conversational dynamics, and how to overcome them. We discuss use cases, thoughts on where the technology is headed, the move toward hybrid edge-cloud pipelines, and the exciting future of real-time video avatars, and much more.

About the Guest

Kwindla Kramer

Daily; Pipecat

Connect with Kwindla

Building Voice AI Agents That Don’t Suck with Kwindla Kramer

About this Episode

About the Guest

Kwindla Kramer

Resources

Related Topics

Building Voice AI Agents That Don’t Suck with Kwindla Kramer

About this Episode

About the Guest

Kwindla Kramer

Resources

Related Topics

Related Episodes