Intelligent content that gives practitioners, innovators and leaders an inside look at the present and future of ML & AI technologies.

Latest
Play Video
EPISODE 704  |  
October 7, 2024
Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks.
Discover

Now available On Demand!  Our premier event of the year where we discuss the platforms, tools, technologies, and practices necessary to enable and scale enterprise machine learning and AI. 

TWIMLcon On Demand

EVENTS

TWIML hosts a variety of events throughout the year to educate and inspire our listeners and community members. Take a look at our upcoming events below and click the events link to find out more about past events.

TWIMLcon: AI Platforms is happening now!

This virtual conference will once again bring to light the platforms, tools, technologies, and practices necessary to enable and scale enterprise machine learning and AI.

Registration is FREE and it’s not too late to join us. Visit the TWIMLcon: AI Platforms 2022 event page to check out our exciting line up of speakers and sessions, and to register for the event.

Solutions

Our conversations with hundreds of ML/AI practitioners and teams have demonstrated that effective tools and platforms are the key to delivering ML and AI at scale—allowing teams to innovate more quickly and consistently.

The TWIML Solutions guide helps you identify technologies and solutions that can help your organization deliver models into production more quickly and efficiently.

Latest Research

Long before starting the TWIML podcast, I worked at the intersection of the two technology shifts that ultimately enabled modern artificial intelligence: cloud computing and big data. AWS was the clear leader in cloud even back then, so I jumped at the opportunity to attend the company’s first re:Invent conference way back in 2012.

Pachyderm provides the ability to modularize, orchestrate, and scale the steps of your ML pipeline within a language-agnostic platform — with the added ability to trace the lineage and versioning of both code and data.

A recent tweet from Soft Linden illustrated the importance of strong responsible AI, governance and testing frameworks for organizations deploying public-facing machine learning applications.

Following a search for “had a seizure now what”, the tweet showed that Google’s “featured snippet” highlighted actions that a University of Utah healthcare site explicitly advised readers NOT to take.

We’re proud to announce the new TWIML Solutions Guide, a directory of machine learning tools and platform technologies for data scientists, ML engineers and other AI practitioners and leaders. The Guide aims to help them explore and compare open source and commercial offerings for building, delivering, and improving their ML and AI projects. This post explains why we think the guide is important and highlights some of its key features.

In order to help enterprise machine learning, data science, and AI innovators understand how model-driven enterprises are successfully scaling machine learning, we have conducted numerous interviews on the topic.

In this post, we present three representative ML platforms: Airbnb’s Bighead, Facebook’s FBLearner, and LinkedIn’s Pro-ML. Each of these platforms was developed in response to the unique situation, challenges, and considerations faced by its creator.

Explore Solutions

Build better models faster by using state-of-the-art hyperparameter optimization and supervised early stopping tools. Focus on adding business value to your data pipeline while Comet automates the rest.

Dataiku Data Science Studio is the collaborative data science software platform for teams of data scientists, data analysts, and engineers to explore, prototype, build, and deliver their own data products more efficiently.

Run:ai Atlas is a compute orchestration platform that speeds up data science initiatives by pooling all available GPU resources and then dynamically allocating resources as you need them. One-click execution of experiments, no code changes required by the user, and most importantly, no more waiting around to access GPUs. Atlas automates provisioning of multiple GPU or fractions of GPU across teams, users, clusters and nodes, and IT gains control and visibility over the full AI infrastructure stack through comprehensive, easy-to-use dashboards.

SigOpt is a model development platform that makes it easy to track runs, visualize training, and scale hyperparameter optimization for any type of model built with any library on any infrastructure

Introducing the first enterprise-ready feature store for machine learning. Built by the creators of Uber Michelangelo, Tecton provides the first enterprise-ready feature store that manages the complete lifecycle of features — from engineering new features to serving them online for real-time predictions.

Experiment tracking, Datasetset tracking, Dataset visualization

Community

The TWIML Community is a global network of machine learning, deep learning and AI practitioners and enthusiasts.

We organize ongoing educational programs including study groups for several popular ML/AI courses such as Fast.ai Deep Learning, Machine learning and NLP, Stanford CS224N, Deeplearning.ai and more. We also host several special interest groups focused on topics like Swift for Tensorflow, and competing in Kaggle competitions.

TWIML Community

Work with Us

TWIML creates and curates intelligent content that helps makers build better experiences for their users, and gives executives an inside look at the real-world application of intelligence technologies. We also build and support communities of innovators who are as excited about these technologies as we are. We advise a variety of leading organizations as well, helping to craft strategies for taking advantage of the vast opportunities created by ML and AI.