Stable Diffusion and LLMs at the Edge with Jilei Hou
EPISODE 633
|
JUNE
12,
2023
Watch
Follow
Share
About this Episode
Today we’re joined by Jilei Hou, a VP of Engineering at Qualcomm Technologies. In our conversation with Jilei, we focus on the emergence of generative AI, and how they've worked towards providing these models for use on edge devices. We explore how the distribution of models on devices can help amortize large models' costs while improving reliability and performance and the challenges of running machine learning workloads on devices, including model size and inference latency. Finally, Jilei we explore how these emerging technologies fit into the existing AI Model Efficiency Toolkit (AIMET) framework.
About the Guest
Jilei Hou
Qualcomm
Thanks to our sponsor Qualcomm AI Research
Qualcomm AI Research is dedicated to advancing AI to make its core capabilities — perception, reasoning, and action — ubiquitous across devices. Their work makes it possible for billions of users around the world to have AI-enhanced experiences on devices powered by Qualcomm Technologies. To learn more about what Qualcomm Technologies is up to on the research front, visit twimlai.com/qualcomm.
Resources
- Blog: World’s first on-device demonstration of Stable Diffusion on an Android phone | Qualcomm
- Paper: Up or Down? Adaptive Rounding for Post-Training Quantization
- Kahneman, D - Thinking, Fast and Slow
- Consciousness and COVID-19 with Yoshua Bengio - #361
- Full-Stack AI Systems Development with Murali Akula - #563
