More Language, Less Labeling with Kate Saenko
EPISODE 580
|
JUNE
27,
2022
Watch
Follow
Share
About this Episode
Today we continue our CVPR series joined by Kate Saenko, an associate professor at Boston University and a consulting professor for the MIT-IBM Watson AI Lab. In our conversation with Kate, we explore her research in multimodal learning, which she spoke about at the Multimodal Learning and Applications Workshop, one of a whopping 6 workshops she spoke at. We discuss the emergence of multimodal learning, the current research frontier, and Kate’s thoughts on the inherent bias in LLMs and how to deal with it. We also talk through some of the challenges that come up when building out applications, including the cost of labeling, and some of the methods she’s had success with. Finally, we discuss Kate’s perspective on the monopolizing of compute resources for “foundational” models, and her paper Unsupervised Domain Generalization by learning a Bridge Across Domains.
About the Guest
Kate Saenko
Boston University
Resources
- Paper: Unsupervised Domain Generalization by learning a Bridge Across Domains
- Paper: ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes
- Paper: Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
- Paper: Many-to-many Splatting for Efficient Video Frame Interpolation
- Paper: MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision
- Multimodal Learning and Applications Workshop
- Workshop On Media Forensics
