The Memriq AI Inference Brief – Engineering Edition - Podcasts on Audible

Episodes

Kaizen at Digital Speed: Engineering the Agentic Enterprise Operating System

Feb 16 2026

In this episode of Memriq Inference Digest — Engineering Edition, we dive into the transformational role of engineers in the age of the agentic enterprise. Discover how continuous improvement at digital speed reshapes engineering from shipping code to building self-improving workflows powered by autonomous AI agents.
In this episode:
- Explore the shift from feature delivery to workflow orchestration in agentic systems
- Understand the five technical pillars every agent engineer must master
- Learn why operational literacy and governance are critical skills for engineers
- Contrast 'tool-first' versus 'operating-system-first' engineering approaches
- Get practical steps to prepare yourself for the future of agent-driven enterprises
Key tools & technologies mentioned:
- Autonomous AI agents
- Workflow orchestration and architecture
- Observability frameworks (logging, metrics, traces)
- Evaluation and continuous testing harnesses
- Governance models and policy gates
Timestamps:
0:00 Introduction & episode overview
2:30 Why agentification matters now
5:15 The evolving role of engineers in the agentic enterprise
8:45 The five technical pillars: workflow, integration, observability, evaluation, governance
14:30 Engineering paths: tool-first vs operating-system-first
17:00 Practical preparation roadmap for engineers
19:30 Closing thoughts & next steps
Resources:
"Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.

Show More Show Less

20 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Opus 4.6 Deep Dive: Memory, Reasoning & Multi-Agent AI Architectures

Feb 9 2026

Unlock the potential of Anthropic's Claude Opus 4.6, a breakthrough AI model designed for deep reasoning and multi-agent orchestration with a massive one million token context window. Discover how this update transforms agent stack design by introducing adaptive effort tuning, advanced memory management, and role discipline in multi-model pipelines.
In this episode:
- Explore Opus 4.6’s unique ‘effort’ parameter and its role in controlling deep reasoning workloads
- Understand how Opus 4.6 integrates large context windows and subagent orchestration for complex workflows
- Compare Opus 4.6 with OpenAI’s GPT-5.2 to weigh trade-offs in cost, multimodality, and reasoning depth
- Learn practical deployment strategies and model role assignments for efficient multi-agent pipelines
- Hear real-world success stories from enterprises leveraging Opus 4.6 in production
- Review open challenges like cost governance, migration complexity, and multi-agent safety
Key tools & technologies mentioned: Anthropic Claude Opus 4.6, OpenAI GPT-5.2, GitHub Copilot, Retrieval-Augmented Generation, Adaptive Thinking, Effort Parameter, Multi-Agent AI Pipelines
Timestamps:
[00:00] Introduction & Episode Overview
[02:30] The 'Effort' Parameter & Overthinking Feature
[06:00] Why Opus 4.6 Matters Now: Long Context & Reasoning Boost
[09:30] Architecting Multi-Model Agent Pipelines
[12:45] Head-to-Head: Opus 4.6 vs GPT-5.2
[15:00] Under the Hood: Technical Innovations
[17:30] Real-World Impact & Use Cases
[19:45] Practical Tips & Open Challenges
Resources:
- "Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
- This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.

Show More Show Less

20 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Moltbook Uncovered: Lessons from the AI Social Network Experiment

Feb 2 2026

Explore Moltbook, the groundbreaking AI social network where autonomous agents debate, self-organize, and evolve their own culture — revealing critical insights for developers building agentic systems. In this episode, we unpack Moltbook’s architecture, emergent behaviors, and the leadership challenges posed by autonomous AI social dynamics.
In this episode:
- What makes Moltbook a unique multi-agent AI social network and why it matters now
- The technical core: personality templates, interaction graphs, and reinforcement learning
- Trade-offs between emergent social AI and traditional rule-based multi-agent systems
- Real-world applications and the cost, governance, and risk considerations for leaders
- Practical strategies and tooling advice for developers experimenting with agentic AI
- Open challenges including unpredictability, bias, and evaluation in emergent AI cultures
Key tools & technologies: Transformer-based large language models, multi-agent reinforcement learning frameworks, interaction graph data structures
Timestamps:
00:00 - Introduction to Moltbook and agentic AI social networks
03:30 - The AI social drama and emergent behaviors in Moltbook
08:15 - Technical deep dive: architecture and agent design
12:00 - Payoff metrics and emergent cultures
14:30 - Leadership reality checks and governance implications
17:00 - Practical applications and tech battle scenario
19:30 - Open problems and final insights
Resources:
- "Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
- This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.

Show More Show Less

27 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Agent-Driven UI Testing: What Changes & Which Stacks Are Ready?

Jan 26 2026

UI testing has long been a pain point for engineering teams—expensive to write, brittle, and hard to maintain. In this episode of Memriq Inference Digest - Edition, we explore how AI-powered agents are transforming end-to-end (E2E) click-through testing by automating test planning, generation, and repair, making UI testing more scalable and sustainable. We also compare how different technology stacks like React/Next.js and Flutter support these new agent-driven approaches.
In this episode, we cover:
- Why traditional E2E UI tests often fail to catch real user issues despite existing for years
- How Playwright’s Planner, Generator, and Healer agents automate test lifecycle and maintenance
- The impact of UI framework choices on agent-driven testing success, especially React/Next.js vs Flutter
- Practical trade-offs between AI code-generation tools and runtime UI interaction agents
- Real-world examples and engineering best practices to make UI tests robust and maintainable
- Open challenges and the future direction of agent-driven UI testing
Key tools and technologies discussed:
- Playwright v1.56 AI agents (Planner, Generator, Healer)
- React and Next.js web frameworks
- Flutter’s flutter_test and integration_test frameworks
- Vitest, Jest, MSW for test runners and mocking
- AI coding assistants like Claude Code and GitHub Copilot

Timestamps:
0:00 – Introduction to Agent-Driven UI Testing
3:30 – Why Traditional E2E Tests Often Fail
6:45 – Playwright’s Planner, Generator & Healer Explained
10:15 – Framework Readiness: React/Next.js vs Flutter
13:00 – Comparing AI Code Gen and Agent-Driven Testing
15:30 – Real-World Use Cases and Engineering Insights
18:00 – Open Challenges & The Future of Agent-Driven Testing
20:00 – Closing Thoughts and Book Recommendation

Resources:
- "Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
- This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.

Show More Show Less

21 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Belief States Uncovered: Internal Knowledge & Uncertainty in AI Agents

Jan 19 2026
Uncertainty is not just noise—it's the internal state that guides AI decision-making. In this episode of Memriq Inference Digest, we explore belief states, a foundational concept that enables AI systems to represent and reason about incomplete information effectively. From classical Bayesian filtering to cutting-edge neural planners like BetaZero, we unpack how belief states empower intelligent agents in real-world, uncertain environments.
In this episode:
- Understand the core concept of belief states and their role in AI under partial observability
- Compare symbolic, probabilistic, and neural belief state representations and their trade-offs
- Dive into practical implementations including Bayesian filtering, particle filters, and neural implicit beliefs
- Explore integrating belief states with CoALA memory systems for conversational AI
- Discuss real-world applications in robotics, autonomous vehicles, and dialogue systems
- Highlight open challenges and research frontiers including scalability, calibration, and multi-agent belief reasoning
Key tools/technologies mentioned:
- Partially Observable Markov Decision Processes (POMDPs)
- Bayesian filtering methods: Kalman filters, particle filters
- Neural networks: RNNs, Transformers
- Generative models: VAEs, GANs, diffusion models
- BetaZero and Monte Carlo tree search
- AGM belief revision framework
- I-POMDPs for multi-agent settings
- CoALA agentic memory architecture
Resources:
"Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.
Show More Show Less
37 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Recursive Language Models: A Paradigm Shift for Agentic AI Scalability

Jan 12 2026

Discover how Recursive Language Models (RLMs) are fundamentally changing the way AI systems handle ultra-long contexts and complex reasoning. In this episode, we unpack why RLMs enable models to programmatically query massive corpora—two orders of magnitude larger than traditional transformers—delivering higher accuracy and cost efficiency for agentic AI applications.
In this episode:
- Explore the core architectural shift behind RLMs and how they externalize context via sandboxed Python environments
- Compare RLMs against other long-context approaches like Gemini 1.5 Pro, Longformer, BigBird, and RAG
- Dive into technical trade-offs including latency, cost variability, and verification overhead
- Hear real-world use cases in legal discovery, codebase analysis, and research synthesis
- Get practical tips on tooling with RLM official repo, Modal and Prime sandboxes, and hybrid workflows
- Discuss open challenges and future research directions for optimizing RLM deployments
Key tools and technologies mentioned:
- Recursive Language Model (RLM) official GitHub repo
- Modal and Prime sandboxed execution environments
- GPT-5 and GPT-5-mini models
- Gemini 1.5 Pro, Longformer, BigBird architectures
- Retrieval-Augmented Generation (RAG)
- Prime Intellect context folding
- MemGPT, LLMLingua token compression
Timestamps:
00:00 - Introduction to Recursive Language Models and agentic AI
03:15 - The paradigm shift: externalizing context and recursive querying
07:30 - Benchmarks and performance comparisons with other long-context models
11:00 - Under the hood: how RLMs orchestrate recursive sub-LLM calls
14:20 - Real-world applications: legal, code, and research use cases
16:45 - Technical trade-offs: latency, cost, and verification
18:30 - Toolbox and best practices for engineers
20:15 - Future directions and closing thoughts
Resources:
"Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.
Stay tuned and keep pushing the boundaries of AI engineering with Memriq Inference Digest!

Show More Show Less

21 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Evaluating Agentic AI: DeepEval, RAGAS & TruLens Frameworks Compared

Jan 5 2026
# Evaluating Agentic AI: DeepEval, RAGAS & TruLens Frameworks Compared
In this episode of Memriq Inference Digest - Engineering Edition, we explore the cutting-edge evaluation frameworks designed for agentic AI systems. Dive into the strengths and trade-offs of DeepEval, RAGAS, and TruLens as we unpack how they address multi-step agent evaluation challenges, production readiness, and integration with popular AI toolkits.
In this episode:
- Compare DeepEval’s extensive agent-specific metrics and pytest-native integration for development testing
- Understand RAGAS’s knowledge graph-powered synthetic test generation that slashes test creation time by 90%
- Discover TruLens’s production-grade observability with hallucination detection via the RAG Triad framework
- Discuss hybrid evaluation strategies combining these frameworks across the AI lifecycle
- Learn about real-world deployments in fintech, e-commerce, and enterprise conversational AI
- Hear expert insights from Keith Bourne on calibration and industry trends
Key tools & technologies mentioned:
DeepEval, RAGAS, TruLens, LangChain, LlamaIndex, LangGraph, OpenTelemetry, Snowflake, Datadog, Cortex AI, DeepTeam
Timestamps:
00:00 - Introduction to agentic AI evaluation frameworks
03:00 - Key metrics and evaluation challenges
06:30 - Framework architectures and integration
10:00 - Head-to-head comparison and use cases
14:00 - Deep technical overview of each framework
17:30 - Real-world deployments and best practices
19:30 - Open problems and future directions
Resources:
"Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.
Show More Show Less
20 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Model Context Protocol: The Universal AI Integration Standard Explained

Dec 15 2025
Discover how the Model Context Protocol (MCP) is revolutionizing AI systems integration by simplifying complex multi-tool interactions into a scalable, open standard. In this episode, we unpack MCP’s architecture, adoption by industry leaders, and its impact on engineering workflows.
In this episode:
- What MCP is and why it matters for AI/ML engineers and infrastructure teams
- The M×N integration problem and how MCP reduces it to M+N
- Core primitives: Tools, Resources, and Prompts, and their roles in MCP
- Technical deep dive into JSON-RPC 2.0 messaging, transports, and security with OAuth 2.1 + PKCE
- Comparison of MCP with OpenAI Function Calling, LangChain, and custom REST APIs
- Real-world adoption, performance metrics, and engineering trade-offs
- Open challenges including security, authentication, and operational complexity
Key tools & technologies mentioned:
- Model Context Protocol (MCP)
- JSON-RPC 2.0
- OAuth 2.1 with PKCE
- FastMCP Python SDK, MCP TypeScript SDK
- agentgateway by Solo.io
- OpenAI Function Calling
- LangChain
Timestamps:
00:00 — Introduction to MCP and episode overview
02:30 — The M×N integration problem and MCP’s solution
05:15 — Why MCP adoption is accelerating
07:00 — MCP architecture and core primitives explained
10:00 — Head-to-head comparison with alternatives
12:30 — Under the hood: protocol mechanics and transports
15:00 — Real-world impact and usage metrics
17:30 — Challenges and security considerations
19:00 — Closing thoughts and future outlook
Resources:
"Unlocking Data with Generative AI and RAG" by Keith Bourne - Search for 'Keith Bourne' on Amazon and grab the 2nd edition
This podcast is brought to you by Memriq.ai - AI consultancy and content studio building tools and resources for AI practitioners.
Show More Show Less
20 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free

Episodes

Kaizen at Digital Speed: Engineering the Agentic Enterprise Operating System

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Opus 4.6 Deep Dive: Memory, Reasoning & Multi-Agent AI Architectures

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Moltbook Uncovered: Lessons from the AI Social Network Experiment

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Agent-Driven UI Testing: What Changes & Which Stacks Are Ready?

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Belief States Uncovered: Internal Knowledge & Uncertainty in AI Agents

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Recursive Language Models: A Paradigm Shift for Agentic AI Scalability

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Evaluating Agentic AI: DeepEval, RAGAS & TruLens Frameworks Compared

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Model Context Protocol: The Universal AI Integration Standard Explained

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed