Review 568 seeded sandbox traces across 7 days, with realistic agent names, case stories, prompt context, and linked investigation paths.
Select two runs to compare, inspect recent failures, and move from evidence to investigation.