New Talk: Building Olmo 3 Think cover art

New Talk: Building Olmo 3 Think

New Talk: Building Olmo 3 Think

Listen for free

View show details

About this listen

It’s finally here! The public (and most complete) version of my talk covering every stage of the process to build Olmo 3 Think (slides are available). I’ve been giving this, improving it, and getting great feedback at other venues such as The Conference on Language Modeling (COLM) & The PyTorch Conference.This involves changes and new considerations of every angle of the stack, from pretraining, evaluation, and of course post-training.

Most of the talk focuses on reinforcement learning infrastructure and evaluating reasoning models, with quick comments on every training stage. I hope you enjoy it, and let us know what to improve in the future!

Chapters

* 00:00:00 Introduction

* 00:06:30 Pretraining Architecture

* 00:09:25 Midtraining Data

* 00:11:08 Long-context Necessity

* 00:13:04 Building SFT Data

* 00:20:05 Reasoning DPO Surprises

* 00:24:47 Scaling RL

* 00:41:05 Evaluation Overview

* 00:48:50 Evaluation Reflections

* 01:00:25 Conclusions

Here’s the YouTube link:



This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.interconnects.ai/subscribe
No reviews yet
In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.