All runs
90
score

Acme checkout API escalation

warning

run_cust_acme_warn_980 · escalation-agent@1.0.0 · 22d ago · source mock

Investigate why Acme Corp's checkout API started failing this morning and draft a customer-safe update.

Latency
18.4s
Cost
$0.27
Tokens
7,350
Tool calls
9
Execution signal path
3 steps · 2 missing
18.4s · 9 tools
  1. Never called — Check deployment history
    Required before a customer-facing root-cause claim. This gap is the failure.
    Never called — Check runtime telemetry
    Required before a customer-facing root-cause claim. This gap is the failure.
Ask Autopilot
Autonomous, evidence-backed diagnosis

This run failed its evals. Ask Autopilot to diagnose the root cause from the trace and evidence.

Final customer-facing response

Investigation complete with verified evidence. Note: run latency was elevated due to repeated tool calls.

Eval scores
Groundedness84%
Required tool usage100%
Approval policy100%
Usefulness76%
Cost & Latency Doctor
  • High latency
    Run took 18.4s (threshold 12s).
  • Excessive retries
    4 retries (cap 2). Add a retry budget per tool.
  • Excessive tool calls
    9 tool calls in one run — consider caching or planning fewer lookups.
  • Elevated cost
    Estimated $0.27 for this run.
AgentOps Autopilot — reliability for AI agents