Intelligent content that gives practitioners, innovators and leaders an inside look at the present and future of ML & AI technologies.

LATEST
Play Video
EPISODE 753  |  
October 28, 2025
In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the technical challenges of deploying these models, which are powerful but computationally expensive due to their iterative sampling process. Hung details his team's work on SwiftBrush and SwiftEdit, which enable high-quality text-to-image generation and editing in a single inference step. He explains their novel distillation framework, where a multi-step teacher model guides the training of an efficient, single-step student model. We explore the architecture and training, including the use of a secondary 'coach' network that aligns the student's denoising function with the teacher's, allowing the model to bypass the iterative process entirely. Finally, we discuss how these efficiency breakthroughs pave the way for personalized on-device agents and the challenges of running reasoning models with techniques like inference-time scaling under a fixed compute budget.
RECENT
EPISODE 752  |  
October 21, 2025
EPISODE 751  |  
October 14, 2025
EPISODE 750  |  
October 7, 2025

INSIGHTS

LATEST REPORT

Retrieval-augmented generation promised to bring ChatGPT’s magic to enterprise data. But while organizations rushed to build chatbots, they often struggled to deliver real business value. This comprehensive guide reveals RAG’s full potential beyond conversational interfaces.

Community

The TWIML Community is a global network of machine learning, deep learning and AI practitioners and enthusiasts.

We organize ongoing educational programs including study groups for several popular ML/AI courses such as Fast.ai Deep Learning, Machine learning and NLP, Stanford CS224N, Deeplearning.ai and more. We also host several special interest groups focused on topics like Swift for Tensorflow, and competing in Kaggle competitions.

TWIML Community

Work with Us

TWIML creates and curates intelligent content that helps makers build better experiences for their users, and gives executives an inside look at the real-world application of intelligence technologies. We also build and support communities of innovators who are as excited about these technologies as we are. We advise a variety of leading organizations as well, helping to craft strategies for taking advantage of the vast opportunities created by ML and AI.