AI Explained Official Podcast

By: Philip - Host of AI Explained YT
  • Summary

  • Covering the biggest news of the century - the arrival of smarter-than-human AI. From the author of Simple Bench, which reveals the remaining gap between LLM and human reasoning. Hype-free, and the British accent is a freebie bonus.

    © 2025 AI Explained Official Podcast
    Show More Show Less
activate_mytile_page_redirect_t1
Episodes
  • o3 breaks (some) records, but AI becomes pay-to-win
    Apr 25 2025

    A green card, o3 vs Gemini 2.5, 6 Benchmarks and a whole bunch of my thoughts on what on earth is happening in AI, from here to 2030. Plus, how AI is becoming pay-to-win, and why. Crazy times, 14 mins probably wasn’t enough.

    https://app.grayswan.ai/ai-explained

    AI Insiders ($9!): https://www.patreon.com/AIExplained

    Chapters:
    00:00 - Introduction
    00:33 - FictionLiveBench
    01:37 - PHYBench
    02:14 - SimpleBench
    02:54 - Virology Capabilities Test
    03:13 - Mathematics Performance
    04:29 - Vision Benchmarks
    05:43 - V* and how o3 works
    06:44 - Revenue and costs for you
    08:54 - Expensive RL and trade-offs
    09:40 - How to spend the OOMs
    13:27 - Gray Swan Arena

    Green Card: https://techcrunch.com/2025/04/25/an-openai-researcher-who-worked-on-gpt-4-5-had-their-green-card-denied/
    PHYBench: https://arxiv.org/pdf/2504.16074Virologytest: https://www.virologytest.ai/
    How o3 Vision Works: https://arxiv.org/pdf/2312.14135 https://x.com/sainingxie/status/1912570624523829573
    Visual puzzles: https://neulab.github.io/VisualPuzzles/
    Fiction Bench: https://x.com/ficlive/status/1912863028141244850
    https://geobench.org/
    https://simple-bench.com/
    AIME 2025: https://openai.com/index/introducing-o3-and-o4-mini/
    USAMO: https://x.com/mbalunovic/status/1914398518896193747
    NaturalBench: https://linzhiqiu.github.io/papers/naturalbench/
    Where’s Waldo: https://uk.pinterest.com/pin/492792384225896298/
    IMO and AlphaProof:https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/
    Crazy Revenue: https://www.theinformation.com/articles/openai-forecasts-revenue-topping-125-billion-2029-agents-new-products-gain?rc=sy0ihq
    Number of Users: https://www.theinformation.com/briefings/googles-gemini-user-numbers-revealed-court?rc=sy0ihq
    Subscriptions pay to win: https://www.forbes.com/sites/paulmonckton/2025/04/23/google-leak-reveals-new-gemini-ai-subscription-levels/
    GPU Trade-offs: https://x.com/sama/status/1915098951067554030
    RL Scale-up Amodei: https://www.darioamodei.com/post/on-deepseek-and-export-controls
    Log-linear Returns: https://x.com/bobmcgrewai/status/1895228291981943265
    2030 Scaling: https://epoch.ai/blog/can-ai-scaling-continue-through-2030
    Model Size: https://x.com/slow_developer/status/1874554473256997201
    Adam on AGI: https://x.com/TheRealAdamG/status/1913998366632968381
    Papers on Patreon: https://arxiv.org/pdf/2502.01839
    https://arxiv.org/pdf/2504.13837
    Chollet Quote: https://x.com/fchollet/status/1912934762580447447
    OpenSim: https://opensim.stanford.edu/


    Non-hype Newsletter: https://signaltonoise.beehiiv.com/

    Show More Show Less
    15 mins
  • o3 and o4-mini - they’re great, but easy to over-hype
    Apr 16 2025

    Critical analysis of the two most powerful new models behind ChatGPT, o3 and o4-mini. Not just the system cards, benchmarks, and my own tests, but some you may not have seen before. Yes, they can whip up amazing front-end in a few seconds, but you always have to ask what is in their data. Either way, they prove the gains from RL are just beginning…

    https://weave-docs.wandb.ai/?utm_source=sponsorship&utm_medium=simple_bench&utm_campaign=ai_explained

    AI Insiders ($9!): https://www.patreon.com/AIExplained


    Chapters:
    00:00 - o3 and o4-mini


    https://simple-bench.com/

    Plus, Teams and Pro, plus token count: https://x.com/btibor91/status/1912568994512662679

    System Card: https://openai.com/index/o3-o4-mini-system-card/

    Release Notes: https://openai.com/index/introducing-o3-and-o4-mini/

    https://deepmind.google/technologies/gemini/pro/

    https://x.com/DeryaTR_/status/1912558350794961168

    https://x.com/polynoamial/status/1912564068168450396

    API Pricing:https://openai.com/api/pricing/

    https://aider.chat/docs/leaderboards/


    Non-hype Newsletter: https://signaltonoise.beehiiv.com/

    Show More Show Less
    14 mins
  • ‘Speaking Dolphin’ to AI Data Dominance, 4.1 + Kling 2: 7 Developments Critically Analysed
    Apr 16 2025

    This pod won’t just be about the release of GPT 4.1 in the last 48 hours, o3 build-up, Kling 2.0, a sneak-peak at the next OpenAI model, or even the new Dolphin language tool. It will be about 7 such stories that contextualise where we are in AI and what is happening.

    https://www.emergentmind.com/


    Chapters:

    00:00 - Introduction

    00:30 - Kling 2.0

    01:35 - GPT 4.1

    05:25 - o3 Build-up

    07:37 - ‘Product Company’

    09:31 - Safe Superintelligence

    10:54 - DolphinGemma

    13:16 - Data Dominance?


    Kling 2.0: https://app.klingai.com/global/release-notes


    Dolphin Gemma: https://blog.google/technology/ai/dolphingemma/?s=09


    https://openai.com/index/gpt-4-1/


    OpenAI o3 Build-up The Information: https://www.theinformation.com/articles/openais-latest-breakthrough-ai-comes-new-ideas?rc=sy0ihq


    Physical reasoning: https://x.com/a_karvonen/status/1911839968990814503


    Fiction Live.bench: https://x.com/ficlive/status/1911853409847906626


    Altman Ted: https://www.youtube.com/watch?v=5MWT_doo68k


    https://simple-bench.com/try-yourself


    https://aider.chat/docs/leaderboards/


    4.5: https://www.youtube.com/watch?v=6nJZopACRuQ


    Geospatial reasoning: https://research.google/blog/geospatial-reasoning-unlocking-insights-with-generative-ai-and-multiple-foundation-models/


    Pioneers: https://x.com/OpenAIDevs/status/1910017976256119151

    Evals: https://www.youtube.com/watch?v=scsW6_2SPC4

    Anthropic Updates: https://www.bloomberg.com/news/articles/2025-04-15/anthropic-is-readying-a-voice-assistant-feature-to-rival-openai?srnd=phx-ai

    https://x.com/sethsaler/status/1912188383457059301


    https://techcrunch.com/2025/04/12/openai-co-founder-ilya-sutskevers-safe-superintelligence-reportedly-valued-at-32b/

    https://ai.meta.com/blog/llama-4-multimodal-intelligence/

    https://deepmind.google/technologies/gemini/pro/

    https://research.google/blog/accelerating-scientific-breakthroughs-with-an-ai-co-scientist/

    https://blog.google/products/google-cloud/ironwood-tpu-age-of-inference/

    OpenAI Documentary: https://www.patreon.com/posts/one-machine-to-121940490

    Show More Show Less
    20 mins

What listeners say about AI Explained Official Podcast

Average Customer Ratings

Reviews - Please select the tabs below to change the source of reviews.

In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.