EP21 - Large Language Models and The Power of Scale cover art

EP21 - Large Language Models and The Power of Scale

EP21 - Large Language Models and The Power of Scale

Listen for free

View show details

About this listen

This episode moves from the Transformer architecture to the models that define our era: Large Language Models (LLMs). We explore how the simple act of "next-word prediction," when combined with internet-scale data and massive compute, leads to the surprising "emergent abilities" of models like GPT-4, and we break down the crucial training paradigm of pre-training and fine-tuning.
No reviews yet
In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.