AI Safety Breakthrough

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

AI Safety Breakthrough

By: AI SafeGuard

Listen for free

About this listen

The future of AI is in our hands. Join AI SafeGuard on "AI Safety Breakthrough" as we explore the frontiers of AI safety research and discuss how we can ensure a future where AI remains beneficial for everyone. We delve into the latest breakthroughs, uncover potential risks, and empower listeners to become informed participants in the conversation about AI's role in society. Subscribe now and become part of the solution!

Intro about the author

J, graduated from Carnegie Mellon University, School of Computer Science, 10+ years in Cybersecurity, Cyber Threat Intelligence, Risk, Compliance, privacy and AI Safety.

Economics

Leadership

Management & Leadership

Episodes View all

Navigating the New AI Security

Aug 13 2025

Welcome to Agentic AI Unlocked, your deep dive into the transformative world of Agentic AI—systems combining large language models with advanced reasoning and autonomous action. These intelligent agents promise to disrupt industries, yet introduce a fundamentally new threat surface. Risks like memory poisoning, tool misuse, prompt injection, and insider threats highlight the urgent need for robust security and real-time governance.
The OWASP GenAI Security Project aims to provide actionable insights into these challenges, helping organizations responsibly develop, deploy, and govern agentic AI. We advocate a proactive, defense-in-depth approach across the entire agent lifecycle.
Join us as we explore crucial safeguards like fine-grained access control, runtime monitoring, memory hygiene, and secure tool integration. We'll also cover the evolving ecosystem of agent frameworks, emerging protocols, and complex regulatory landscapes like ISO/IEC 42001, NIST AI RMF, and the EU AI Act.
Agentic AI offers immense promise alongside significant risks. This podcast equips you with the understanding and strategies for secure and responsible deployment. Let’s unlock the future of AI, securely.

Show More Show Less

25 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
DeepSeek: A Disruptive Force in AI

Feb 3 2025

This episode explores DeepSeek, a Chinese AI startup challenging the AI landscape with its free alternative to ChatGPT. We'll examine DeepSeek's innovative architecture, including Mixture-of-Experts (MoE) and Multi-head Latent Attention (MLA), which optimize efficiency. The discussion will highlight DeepSeek's use of reinforcement learning (RL) and its impact on reasoning capabilities, as well as how its open-source approach is democratizing AI access and innovation.
We will also discuss ethical concerns, the competitive advantages and disadvantages of US-based models, and how DeepSeek is impacting cost structures and proprietary models. Join us as we analyze DeepSeek’s influence on the AI industry and the future of AI development and international collaboration

Show More Show Less

10 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
VLSBench: A Visual Leakless Multimodal Safety Benchmark

Jan 26 2025

Are current AI safety benchmarks for multimodal models flawed? This podcast explores the groundbreaking research behind VLSBench, a new benchmark designed to address a critical flaw in existing safety evaluations: visual safety information leakage (VSIL)
We delve into how sensitive information in images is often unintentionally revealed in the accompanying text prompts, allowing models to identify unsafe content based on text alone, without truly understanding the visual risks This "leakage" leads to a false sense of security and a bias towards simple textual alignment methods.
Tune in to understand the critical need for leakless multimodal safety benchmarks and the importance of true multimodal alignment for responsible AI development. Learn how VLSBench is changing the way we evaluate AI safety and what it means for the future of AI.

Show More Show Less

20 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free

No reviews yet

AI Safety Breakthrough

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

AI Safety Breakthrough

About this listen

Navigating the New AI Security

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

DeepSeek: A Disruptive Force in AI

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

VLSBench: A Visual Leakless Multimodal Safety Benchmark

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed