The TWIML AI Podcast with Sam Charrington

LATEST

Play Video

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain

EPISODE 694 |

July 23, 2024

Today, we're joined by Hamel Husain, founder of Parlance Labs, to discuss the ins and outs of building real-world products using large language models (LLMs). We kick things off discussing novel applications of LLMs and how to think about modern AI user experiences. We then dig into the key challenge faced by LLM developers—how to iterate from a snazzy demo or proof-of-concept to a working LLM-based application. We discuss the pros, cons, and role of fine-tuning LLMs and dig into when to use this technique. We cover the fine-tuning process, common pitfalls in evaluation—such as relying too heavily on generic tools and missing the nuances of specific use cases, open-source LLM fine-tuning tools like Axolotl, the use of LoRA adapters, and more. Hamel also shares insights on model optimization and inference frameworks and how developers should approach these tools. Finally, we dig into how to use systematic evaluation techniques to guide the improvement of your LLM application, the importance of data generation and curation, and the parallels to traditional software engineering practices.

RECENT

TWIML_COVER_800x800

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI

EPISODE 693 |

July 16, 2024

TWIML_COVER_800x800

Decoding Animal Behavior to Train Robots with EgoPet

EPISODE 692 |

July 9, 2024

TWIML_COVER_800x800

How Microsoft Scales Testing and Safety for Generative AI

EPISODE 691 |

July 1, 2024

FOLLOW

Join our list for notifications and early access to events

Episodes Series Topics Playlists

694

TWIML_COVER_800x800

Building Real-World LLM Products with Fine-Tuning and More

693

TWIML_COVER_800x800

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI

692

TWIML_COVER_800x800

Decoding Animal Behavior to Train Robots with EgoPet

691

TWIML_COVER_800x800

How Microsoft Scales Testing and Safety for Generative AI

690

TWIML_COVER_800x800

Long Context Language Models and their Biological Applications

689

TWIML_COVER_800x800

Accelerating Sustainability with AI

688

TWIML_COVER_800x800

Gen AI at the Edge: Qualcomm AI Research at CVPR 2024

687

TWIML_COVER_800x800

Energy Star Ratings for AI Models

686

TWIML_COVER_800x800

Language Understanding and LLMs

685

twiml-abdul-fatir-ansari-chronos-learning-the-language-of-time-series-sq

Chronos: Learning the Language of Time Series

684

twiml-joel-hestness-powering-ai-with-the-worlds-largest-computer-chip-sq

Powering AI with the World’s Largest Computer Chip

683

twiml-laurent-boinot-AI-for-power-and-energy-sq

AI for Power & Energy

682

twiml-controlling-aza-jalalvand-fusion-reactor-instability-with-deep-reinforcement-learning-sq

Controlling Fusion Reactor Instability with Deep Reinforcement Learning

681

twiml-kirk-marple-graphrag-knowledge-graph-for-ai-applications-sq

GraphRAG: Knowledge Graphs for AI Applications

680

twiml-Alex-Havrilla-teaching-llms-to-reason-with-rl-sq

Teaching Large Language Models to Reason with Reinforcement Learning

679

twiml-peter-hase-localizing-and-editing-knowledge-in-llms-sq

Localizing and Editing Knowledge in LLMs

678

twiml-jonas-geiping-coercing-llms-to-do-and-reveal-almost-anything-sq

Coercing LLMs to Do and Reveal (Almost) Anything

677

twiml-mido-assran-v-jepa-ai-reasoning-non-generative-architecture-sq

V-JEPA, AI Reasoning from a Non-Generative Architecture

676

twiml-sherry-yang-video-as-a-universal-interface-for-ai-reasoning-sq

Video as a Universal Interface for AI Reasoning

675

twiml-sayash-kapoor-assessing-the-risks-of-open-ai-models-sq

Assessing the Risks of Open AI Models

TWIML_AI_Trends_2024_800x800_B

TWIML_NeurIPS_2023_800x800_B

twiml_aws2023_800x800_A

AWS re:Invent 2023

TWIML_ICML2023_800x800_A

twiml-cvpr-2023-sq-a

twiml-iclr-2023-sq-a

TWIML_AI_Trends_2023_2023_800x800_B

TWIML_NeurIPS_2022_800x800_B

TWIML_AWS_reInvent_2022_800X800_B_200P

AWS re:Invent 2022

Cover: TWIML Presents: AWS re:Mars 2022

AWS re:Mars 2022

Cover: TWIML Presents: ICML 2022

Cover: TWIML Presents: CVPR 2022

Cover: TWIML Presents: Data-Centric AI

Data-Centric AI

twiml-iclr-2022-sq-a

Cover: TWIML Presents: AI Rewind 2021

Cover: TWIML Presents: NeurIPS 2021

Cover: TWIML Presents: AWS re:Invent 2021

AWS re:Invent 2021

Cover: TWIML Presents: SigOpt AI & HPC Summit

SigOpt AI & HPC Summit

Cover: TWIML Presents: ICML 2021

TWIML Presents: AI in the Natural Sciences

AI in the Natural Sciences

TWIML Presents: AI Infrastructure & MLOps

AI Infrastructure & MLOps

TWIML Presents: Applied AI

TWIML_Topic_Icons_800x800_19

Artificial General Intelligence (AGI)

TWIML Presents: Autonomous Vehicles

Autonomous Vehicles

TWIML Presents: Causality

TWIML Presents: Computer Vision

Computer Vision

TWIML Presents: Data Centric AI

Data-Centric AI

generativeai_800x800

TWIML Presents: Healthcare

llms_800x800

Large Language Models (LLMs)

TWIML Presents: Neuroscience

TWIML Presents: NLP

TWIML Presents: Privacy & Security

Privacy & Security

TWIML Presents: Quantum Machine Learning

Quantum Machine Learning

TWIML Presents: Reinforcement Learning

Reinforcement Learning

TWIML Presents: Responsible AI

TWIML Presents: Robotics

TWIML Presents: Trends

Cover: TWIML Presents: Celebrating 500 Episodes

Celebrating 500 Episodes

Cover: TWIML Presents: Celebrating 500 Episodes

Geometry in Machine Learning

Cover: TWIML Presents: Programming Intelligence

Programming Intelligence

Cover: TWIML Presents: #AmplifyBlackSTEM

#AmplifyBlackSTEM

Cover: TWIML Presents: Fairness, Bias, and Ethics in AI

Fairness, Bias, and Ethics in AI

New to Podcast

© 2021 CloudPulse Strategies / All rights reserved