Why validity beats scale when building multi‑step AI systems

The AI Fundamentalists • January 06, 2026 • Solo Episode

Guests

No guests identified for this episode.

Description

In this episode, Dr. Sebastian (Seb) Benthall joins us to discuss research from his and Andrew's paper entitled “Validity Is What You Need” for agentic AI that actually works in the real world. Our discussion connects systems engineering, mechanism design, and requirements to multi‑step AI that creates enterprise impact to achieve measurable outcomes. Defining agentic AI beyond LLM hypeLimits of scale and the need for multi‑step controlTool use, compounding errors, and guardrailsSystems...

Audio