• o3 and o4-mini - they’re great, but easy to over-hype

  • Apr 16 2025
  • Length: 14 mins
  • Podcast

o3 and o4-mini - they’re great, but easy to over-hype

  • Summary

  • Critical analysis of the two most powerful new models behind ChatGPT, o3 and o4-mini. Not just the system cards, benchmarks, and my own tests, but some you may not have seen before. Yes, they can whip up amazing front-end in a few seconds, but you always have to ask what is in their data. Either way, they prove the gains from RL are just beginning…

    https://weave-docs.wandb.ai/?utm_source=sponsorship&utm_medium=simple_bench&utm_campaign=ai_explained

    AI Insiders ($9!): https://www.patreon.com/AIExplained


    Chapters:
    00:00 - o3 and o4-mini


    https://simple-bench.com/

    Plus, Teams and Pro, plus token count: https://x.com/btibor91/status/1912568994512662679

    System Card: https://openai.com/index/o3-o4-mini-system-card/

    Release Notes: https://openai.com/index/introducing-o3-and-o4-mini/

    https://deepmind.google/technologies/gemini/pro/

    https://x.com/DeryaTR_/status/1912558350794961168

    https://x.com/polynoamial/status/1912564068168450396

    API Pricing:https://openai.com/api/pricing/

    https://aider.chat/docs/leaderboards/


    Non-hype Newsletter: https://signaltonoise.beehiiv.com/

    Show More Show Less
activate_mytile_page_redirect_t1

What listeners say about o3 and o4-mini - they’re great, but easy to over-hype

Average Customer Ratings

Reviews - Please select the tabs below to change the source of reviews.

In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.