EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models cover art

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Listen for free

View show details

About this listen

In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.

What listeners say about EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Average Customer Ratings

Reviews - Please select the tabs below to change the source of reviews.

In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.