Skip to content

Fix dataset-backed eval origin for copied dataset rows#2140

Merged
ekeith (evanmkeith) merged 6 commits into
mainfrom
06-18-fix-playground-dataset-log-origin
Jun 18, 2026
Merged

Fix dataset-backed eval origin for copied dataset rows#2140
ekeith (evanmkeith) merged 6 commits into
mainfrom
06-18-fix-playground-dataset-log-origin

Conversation

@evanmkeith

Copy link
Copy Markdown
Contributor

Summary

Fix dataset-backed eval/playground origin handling so copied dataset rows use the active dataset row id instead of
preserved source provenance.

Details

Dataset rows copied from logs can carry origin pointing at the original project_logs row. runEvaluator previously
preferred that datum.origin over the synthesized dataset-row origin, causing playground output/progress events to be
keyed by the source log row id instead of the selected dataset row id.

This changes dataset-backed evals to prefer the active dataset row origin whenever eventDataset and datum.id are
present. Inline/non-dataset evals still preserve explicit datum.origin.

Tests

  • Added regression coverage for log-derived dataset rows using dataset row origin
  • Added coverage for dataset rows without _xact_id
  • Added coverage for streamed progress events carrying dataset row origin
  • Ran focused Vitest origin tests successfully via local Vitest binary

## Summary

  Fix dataset-backed eval/playground origin handling so copied dataset rows use the active dataset row id instead of
  preserved source provenance.

  ## Details

  Dataset rows copied from logs can carry origin pointing at the original project_logs row. runEvaluator previously
  preferred that datum.origin over the synthesized dataset-row origin, causing playground output/progress events to be
  keyed by the source log row id instead of the selected dataset row id.

  This changes dataset-backed evals to prefer the active dataset row origin whenever eventDataset and datum.id are
  present. Inline/non-dataset evals still preserve explicit datum.origin.

  ## Tests

  - Added regression coverage for log-derived dataset rows using dataset row origin
  - Added coverage for dataset rows without _xact_id
  - Added coverage for streamed progress events carrying dataset row origin
  - Ran focused Vitest origin tests successfully via local Vitest binary
@j13huang

Copy link
Copy Markdown
Contributor

updated the code a little bit. Feel free to merge after tests. Thanks again ekeith (@evanmkeith)!

@evanmkeith ekeith (evanmkeith) enabled auto-merge (squash) June 18, 2026 21:12
@max-braintrust

Copy link
Copy Markdown
Contributor

Thanks for catching+fixing this! Added the changeset so it will pass - lmk if I can help FI.

@evanmkeith

Copy link
Copy Markdown
Contributor Author

Thank you both!!

@evanmkeith ekeith (evanmkeith) removed the request for review from john (j13huang) June 18, 2026 21:33
@evanmkeith ekeith (evanmkeith) merged commit 5c3e091 into main Jun 18, 2026
90 of 92 checks passed
@evanmkeith ekeith (evanmkeith) deleted the 06-18-fix-playground-dataset-log-origin branch June 18, 2026 21:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants