03
Judgment Labs takes $32M from Lightspeed to fix agents in production
verified
Wednesday, May 13, 2026
Before your next agent release, write down three failure modes you've personally watched in trajectory logs in the last month — not "hallucination," specific ones (e.g., "agent treats order-level complaint as item-level," "agent rejects refund then closes ticket without escalation"). If you can't name three, you don't yet have the observability you need to deploy an agent in front of paying customers, regardless of what your eval scores say.
For Engineering / founder building agent products