Exploring Multimodal AI: Why Google’s Gemini and OpenAI’s GPT-4o Chose This Path | ChatCAT and the Future of Interspecies Communication | Episode 23 cover art

Exploring Multimodal AI: Why Google’s Gemini and OpenAI’s GPT-4o Chose This Path | ChatCAT and the Future of Interspecies Communication | Episode 23

Exploring Multimodal AI: Why Google’s Gemini and OpenAI’s GPT-4o Chose This Path | ChatCAT and the Future of Interspecies Communication | Episode 23

Listen for free

View show details

About this listen

The recent spring updates and demos by both Google (Gemini) and OpenAI (GPT-4o) feature prominently their multimodal capabilities. In this episode, we discuss the advantages of multimodal AI versus models focused on specific modalities such as language. Via the example of chatCAT, a hypothetical AI that helps owners understand their cats, we explore multimodal’s promise for a more holistic understanding Please enjoy this episode.

For more information, check out https://www.superprompt.fm There you can contact me and/or sign up for our newsletter.

No reviews yet
In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.