Session

Model Quality as a Reliability Problem

Perspective

Model quality and performance is typically thought of as the model developer’s problem rather than a concern of ML production engineers. But changes in model quality represent the only truly end-to-end test of ML infrastructure, reliably identifying subtle problems in feature storage, metadata, model configuration, training and serving. ML production engineers generally avoid directly measuring or responding to changes in model quality as an operational or reliability concern, but that needs to change. In this talk, I elaborate on and support this perspective based on our experiences at Google, explore some of the technical and cultural transformations we’ve had to make in order to close the model quality loop, and share some of the results our teams have seen through this work.

Session Speakers

Todd Underwood

Senior Director, ML SRE

Google

Oops, please Login or Create Account to view On Demand

The good news is that it's both easy and free to register and get access.

Account Login

Create Account

First Name(Required)

Last Name(Required)

Email(Required)

Company/Organization

Job Title/Role

Password

Enter Password Confirm Password

Newsletter Consent(Required)

I would like to receive TWIML updates via email

Terms and Privacy Consent

I agree to the TWIML Terms and Privacy Policy.

This field is hidden when viewing the form

conftag

Comments

This field is for validation purposes and should be left unchanged.

4140137313