Sandbox

support-exec-weekly-summary

54 items3 trace-derived

Executive-review cohort connecting customer-visible issues to budgets, SLAs, and experiment decisions.

Dataset id

dataset_support_exec_weekly_summary

Recommended dataset actions

Return to datasets

Go back to the dataset workbench and compare this evidence set against the rest of your current inventory.

Review evaluator coverage

Confirm the active evaluator set is appropriate for the cases represented in this dataset.

Launch or inspect experiments

Use this dataset as the evidence base for candidate prompt or routing experiments.

Return to source traces

Inspect the original runs feeding this dataset to make sure curation still matches the operational problem.

Dataset id

dataset_support_exec_weekly_summary

Stable dataset identifier.

Created

1m ago

Dataset creation time relative to now.

Items

54

Recorded cases currently attached to this dataset.

Trace lineage

3

Items carrying a source trace id for evidence inspection.

Dataset items

dataset_support_exec_weekly_summary_item_1

Added 3d ago

trace-derived

Input

{
  "traceId": "trace_returns_exception_v18_regression",
  "storyLabel": "Compressed refund rollout denied a valid exception path",
  "sessionId": "session_returns_exception_week1",
  "promptName": "support-reply",
  "promptVersion": 18
}

Expected output

{
  "expectedAgentId": "Returns Resolution Copilot",
  "expectedPromptName": "support-reply",
  "expectedPromptVersion": 18,
  "expectedStory": "Compressed refund rollout denied a valid exception path",
  "expectedOutcome": "Match the seeded production behavior captured by this trace."
}

Open source trace

dataset_support_exec_weekly_summary_item_2

Added 1d ago

trace-derived

Input

{
  "traceId": "trace_shipping_kb_timeout_failed",
  "storyLabel": "Shipping resolution degraded during logistics lookup timeout",
  "sessionId": "session_shipping_timeout_cluster",
  "promptName": "shipping-delay-triage",
  "promptVersion": 7
}

Expected output

{
  "expectedAgentId": "Shipping Delay Resolution",
  "expectedPromptName": "shipping-delay-triage",
  "expectedPromptVersion": 7,
  "expectedStory": "Shipping resolution degraded during logistics lookup timeout",
  "expectedOutcome": "Match the seeded production behavior captured by this trace."
}

Open source trace

dataset_support_exec_weekly_summary_item_3

Added 4h ago

trace-derived

Input

{
  "traceId": "trace_fraud_pattern_review_07_060",
  "storyLabel": "Fraud-watch run flagged suspicious refund behavior (7.60)",
  "sessionId": "session_refund_risk_07",
  "promptName": "fraud-risk-review",
  "promptVersion": 2
}

Expected output

{
  "expectedAgentId": "Fraud Watch Investigator",
  "expectedPromptName": "fraud-risk-review",
  "expectedPromptVersion": 2,
  "expectedStory": "Fraud-watch run flagged suspicious refund behavior (7.60)",
  "expectedOutcome": "Match the seeded production behavior captured by this trace."
}

Open source trace

support-exec-weekly-summary

54 items3 trace-derived

Executive-review cohort connecting customer-visible issues to budgets, SLAs, and experiment decisions.

Dataset id

dataset_support_exec_weekly_summary

Recommended dataset actions

Return to datasets

Go back to the dataset workbench and compare this evidence set against the rest of your current inventory.

Review evaluator coverage

Confirm the active evaluator set is appropriate for the cases represented in this dataset.

Launch or inspect experiments

Use this dataset as the evidence base for candidate prompt or routing experiments.

Return to source traces

Inspect the original runs feeding this dataset to make sure curation still matches the operational problem.

Dataset id

dataset_support_exec_weekly_summary

Stable dataset identifier.

Created

1m ago

Dataset creation time relative to now.

Items

54

Recorded cases currently attached to this dataset.

Trace lineage

3

Items carrying a source trace id for evidence inspection.

Dataset items

dataset_support_exec_weekly_summary_item_1

Added 3d ago

trace-derived

Input

{
  "traceId": "trace_returns_exception_v18_regression",
  "storyLabel": "Compressed refund rollout denied a valid exception path",
  "sessionId": "session_returns_exception_week1",
  "promptName": "support-reply",
  "promptVersion": 18
}

Expected output

{
  "expectedAgentId": "Returns Resolution Copilot",
  "expectedPromptName": "support-reply",
  "expectedPromptVersion": 18,
  "expectedStory": "Compressed refund rollout denied a valid exception path",
  "expectedOutcome": "Match the seeded production behavior captured by this trace."
}

Open source trace

dataset_support_exec_weekly_summary_item_2

Added 1d ago

trace-derived

Input

{
  "traceId": "trace_shipping_kb_timeout_failed",
  "storyLabel": "Shipping resolution degraded during logistics lookup timeout",
  "sessionId": "session_shipping_timeout_cluster",
  "promptName": "shipping-delay-triage",
  "promptVersion": 7
}

Expected output

{
  "expectedAgentId": "Shipping Delay Resolution",
  "expectedPromptName": "shipping-delay-triage",
  "expectedPromptVersion": 7,
  "expectedStory": "Shipping resolution degraded during logistics lookup timeout",
  "expectedOutcome": "Match the seeded production behavior captured by this trace."
}

Open source trace

dataset_support_exec_weekly_summary_item_3

Added 4h ago

trace-derived

Input

{
  "traceId": "trace_fraud_pattern_review_07_060",
  "storyLabel": "Fraud-watch run flagged suspicious refund behavior (7.60)",
  "sessionId": "session_refund_risk_07",
  "promptName": "fraud-risk-review",
  "promptVersion": 2
}

Expected output

{
  "expectedAgentId": "Fraud Watch Investigator",
  "expectedPromptName": "fraud-risk-review",
  "expectedPromptVersion": 2,
  "expectedStory": "Fraud-watch run flagged suspicious refund behavior (7.60)",
  "expectedOutcome": "Match the seeded production behavior captured by this trace."
}

Open source trace