Building AI Voice Agents with Scott Stephenson
EPISODE 707
|
OCTOBER
28,
2024
Watch
Follow
Share
About this Episode
Today, we're joined by Scott Stephenson, co-founder and CEO of Deepgram to discuss voice AI agents. We explore the importance of perception, understanding, and interaction and how these key components work together in building intelligent AI voice agents. We discuss the role of multimodal LLMs as well as speech-to-text and text-to-speech models in building AI voice agents, and dig into the benefits and limitations of text-based approaches to voice interactions. We dig into what’s required to deliver real-time voice interactions and the promise of closed-loop, continuously improving, federated learning agents. Finally, Scott shares practical applications of AI voice agents at Deepgram and provides an overview of their newly released agent toolkit.
About the Guest
Scott Stephenson
Deepgram
Resources
- Deepgram's Groundbreaking Voice Agent API Brings AI to Life
- Deepgram's Voice Agent API
- Case Study: NASA uses Deepgram to power the next generation of space tech
- https://deepgram.com/
- Wav2vec: State-of-the-art speech recognition through self-supervision
- From Particle Physics to Audio AI with Scott Stephenson - #19
