Today we’re joined by Cha Zhang, a Partner Engineering Manager at Microsoft Cloud & AI.
Subscribe: iTunes / Google Play / Spotify / RSS
Cha’s work at MSFT is focused on exploring ways that new technologies can be applied to optical character recognition, or OCR, pushing the boundaries of what has been seen as an otherwise ‘solved’ problem. In our conversation with Cha, we explore some of the traditional challenges of doing OCR in the wild, and what are the ways in which deep learning algorithms are being applied to transform these solutions.
We also discuss the difficulties of using an end to end pipeline for OCR work, if there is a semi-supervised framing that could be used for OCR, the role of techniques like neural architecture search, how advances in NLP could influence the advancement of OCR problems, and much more.
Thanks to our Sponsor!
Thanks to Microsoft for their support for the show, and their sponsorship of this series of episodes highlighting just a few of the fundamental innovations behind Azure Cognitive Services! Cognitive Services is a portfolio of domain-specific capabilities that brings AI within reach of every developer—without requiring machine-learning expertise. All it takes is an API call to embed the ability to see, hear, speak, search, understand, and accelerate decision-making into your apps. Visit aka.ms/cognitive to learn how customers like Volkswagen, Uber, and the BBC have used Azure Cognitive Services to embed services like real-time translation, facial recognition, and natural language understanding to create robust and intelligent user experiences in their apps. While you’re there, you can take advantage of the $200 credit to start building your own intelligent applications when you open an Azure Free Account.
Connect with Cha!
Resources
- Microsoft Form Recognizer
- Paper: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- Paper: LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Join Forces!
- Join us at TWIMLfest: A Virtual AI Festival!
- Check out our TWIML Presents: series page!
- Register for the TWIML Newsletter
- Check out the official TWIMLcon:AI Platform video packages here!
- Download our latest eBook, The Definitive Guide to AI Platforms!
“More On That Later” by Lee Rosevere licensed under CC By 4.0
Leave a Reply