All runs6.7s · 2 tools
37
score
Acme checkout API escalation
failedrun_acme_hero · escalation-agent@1.0.0 · 21d ago · source mock
“Investigate why Acme Corp's checkout API started failing this morning and draft a customer-safe update.”
Latency
6.7s
Cost
$0.12
Tokens
5,230
Tool calls
2
Execution signal path
5 steps · 2 missing
- Never called — Check deployment historyRequired before a customer-facing root-cause claim. This gap is the failure.Never called — Check runtime telemetryRequired before a customer-facing root-cause claim. This gap is the failure.
Ask Autopilot
Autonomous, evidence-backed diagnosis
This run failed its evals. Ask Autopilot to diagnose the root cause from the trace and evidence.
Final customer-facing response
Based on our investigation, the checkout API failures at Acme Corp were caused by last night's deployment. We are rolling back the release now and expect service to be fully restored shortly. We apologise for the disruption.
⚠ Contains 1 unsupported causal claim.
Eval scores
Groundedness31%
Required tool usage0%
Approval policy100%
Usefulness62%
Cost & Latency Doctor
Efficient — no waste detected.