Evaluation and Testing were the most frequently named constraints in our 2026 TMLS Steering Committee survey.
So our first committee meetings explored how we approach evals, specifically when dealing with uncertain metrics (specifically as they pertain to probabilistic agentic systems).
Here’s Part Tw...