Episodes

  • Why ML Needs a New Programming Language with Chris Lattner
    Sep 3 2025

    Chris Lattner is the creator of LLVM and led the development of the Swift language at Apple. With Mojo, he’s taking another big swing: How do you make the process of getting the full power out of modern GPUs productive and fun? In this episode, Ron and Chris discuss how to design a language that’s easy to use while still providing the level of control required to write state of the art kernels. A key idea is to ask programmers to fully reckon with the details of the hardware, but making that work manageable and shareable via a form of type-safe metaprogramming. The aim is to support both specialization to the computation in question as well as to the hardware platform. “Somebody has to do this work,” Chris says, “if we ever want to get to an ecosystem where one vendor doesn’t control everything.”

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • Democratizing AI compute (an 11-part series)
    • Modular AI
    • Mojo
    • MLIR
    • Swift
    Show More Show Less
    1 hr and 13 mins
  • The Thermodynamics of Trading with Daniel Pontecorvo
    Jul 25 2025

    Daniel Pontecorvo runs the “physical engineering” team at Jane Street. This group blends architecture, mechanical engineering, electrical engineering, and construction management to build functional physical spaces. In this episode, Ron and Dan go deep on the challenge of heat exchange in a datacenter, especially in the face of increasingly dense power demands—and the analogous problem of keeping traders cool at their desks. Along the way they discuss the way ML is changing the physical constraints of computing; the benefits of having physical engineering expertise in-house; the importance of monitoring; and whether you really need Apollo-style CO2 scrubbers to ensure your office gets fresh air.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • ASHRAE (American Society of Heating, Refrigerating and Air-Conditioning Engineers)
    • Some research on CO2’s effects on human performance, which motivated us to look into CO2 Scrubbers
    • The Open Compute Project
    • Rail-Optimized and Rail-only network topologies.
    • Immersion cooling, where you submerge a machine in a dielectric fluid!
    Show More Show Less
    59 mins
  • Building Tools for Traders with Ian Henry
    May 28 2025

    Ian Henry started his career at Warby Parker and Trello, building consumer apps for millions of users. Now he writes high-performance tools for a small set of experts on Jane Street’s options desk. In this episode, Ron and Ian explore what it’s like writing code at a company that has been “on its own parallel universe software adventure for the last twenty years.” Along the way, they go on a tour of Ian’s whimsical and sophisticated side projects—like Bauble, a playground for rendering trippy 3D shapes using signed distance functions—that have gone on to inform his work: writing typesafe frontend code for users who measure time in microseconds and prefer their UIs to be “six pixels high.”

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • Bauble studio
    • Janet for Mortals, by Ian Henry
    • What if writing tests was a joyful experience?
    Show More Show Less
    1 hr and 20 mins
  • Finding Signal in the Noise with In Young Cho
    Mar 12 2025

    In Young Cho thought she was going to be a doctor but fell into a trading internship at Jane Street. Now she helps lead the research group’s efforts in machine learning. In this episode, In Young and Ron touch on the porous boundaries between trading, research, and software engineering, which require different sensibilities but are often blended in a single person. They discuss the tension between flexible research tools and robust production systems; the challenges of ML in a low-data, high-noise environment subject to frequent regime changes; and the shift from simple linear models to deep neural networks.

    You can find the transcript for this episode on our website.

    Show More Show Less
    1 hr
  • The Uncertain Art of Accelerating ML Models with Sylvain Gugger
    Oct 14 2024

    Sylvain Gugger is a former math teacher who fell into machine learning via a MOOC and became an expert in the low-level performance details of neural networks. He’s now on the ML infrastructure team at Jane Street, where he helps traders speed up their models. In this episode, Sylvain and Ron go deep on learning rate schedules; the subtle performance bugs PyTorch lets you write; how to keep a hungry GPU well-fed; and lots more, including the foremost importance of reproducibility in training runs. They also discuss some of the unique challenges of doing ML in the world of trading, like the unusual size and shape of market data and the need to do inference at shockingly low latencies.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • “Practical Deep Learning for Coders,” a FastAI MOOC by Jeremy Howard, and the book, of which Sylvain is a co-author.
    • The Stanford DAWNBench competition that Sylvain participated in.
    • HuggingFace, and the Accelerate library that Sylvain wrote there.
    • Some of the languages/systems for expression ML models that were discussed: PyTorch, TensorFlow, Jax, Mojo, and Triton
    • CUDA graphs and streams
    • Hogwild concurrency
    Show More Show Less
    1 hr and 6 mins
  • Solving Puzzles in Production with Liora Friedberg
    Oct 7 2024

    Liora Friedberg is a Production Engineer at Jane Street with a background in economics and computer science. In this episode, Liora and Ron discuss how production engineering blends high-stakes puzzle solving with thoughtful software engineering, as the people doing support build tools to make that support less necessary. They also discuss how Jane Street uses both tabletop simulation and hands-on exercises to train Production Engineers; what skills effective Production Engineers have in common; and how to create a culture where people aren’t blamed for making costly mistakes.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • More about production engineering at Jane Street, including how to apply.
    • Notes on Site reliability engineering in the wider world.
    • Alarm fatigue and desensitization.
    • Jane Street’s 1950’s era serialization-format of choice,
    • Some games that Streeters have used for training people to respond to incidents.
    Show More Show Less
    54 mins
  • From the Lab to the Trading Floor with Erin Murphy
    Jul 12 2024

    Erin Murphy is Jane Street’s first UX designer, and before that, she worked at NASA’s Jet Propulsion Laboratory building user interfaces for space missions. She’s also an illustrator with her own quarterly journal. In this episode, Erin and Ron discuss the challenge of doing user-centered design in an organization where experts are used to building tools for themselves. How do you bring a command-line interface to the web without making it worse for power users? They also discuss how beauty in design is more about utility than aesthetics; what Jane Street looks for in UX candidates; and how to help engineers discover what their users really want.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • Erin’s website that shows off her work.
    • Her quarterly journal of sketches and observations.
    • An article about Erin’s design work with NASA JPL.
    • A paper that among other things talks about the user study work that Erin did at JPL.
    • Jane Street’s current UX job opening.
    Show More Show Less
    1 hr and 4 mins
  • Performance Engineering on Hard Mode with Andrew Hunter
    Nov 28 2023

    Andrew Hunter makes code really, really fast. Before joining Jane Street, he worked for seven years at Google on multithreaded architecture, and was a tech lead for tcmalloc, Google’s world-class scalable malloc implementation. In this episode, Andrew and Ron discuss how, paradoxically, it can be easier to optimize systems at hyperscale because of the impact that even miniscule changes can have. Finding performance wins in trading systems—which operate at a smaller scale, but which have bursty, low-latency workloads—is often trickier. Andrew explains how he approaches the problem, including his favorite profiling techniques and tools for visualizing traces; the unique challenges of optimizing OCaml versus C++; and when you should and shouldn’t care about nanoseconds. They also touch on the joys of musical theater, and how to pass an interview when you’re sleep-deprived.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • “Profiling a warehouse-scale computer”
    • Magic-trace
    • OODA loop
    Show More Show Less
    56 mins