Fix experiment trigger on remote eval by stretpjc · Pull Request #128 · braintrustdata/braintrust-sdk-java

Pedro Mora (stretpjc) · 2026-06-18T16:19:08Z

Summary

Remote evals triggered as an Experiment from the Braintrust UI failed (IllegalArgumentException: braintrust parent (playground_id) not found) while Playground runs worked. Experiment runs send parent=null + experiment_name/project_id; the dev server had no branch for that path. This PR adds the missing path and a regression test for it.

Changes

Devserver.extractParentInfo: when no playground parent is present, create an experiment via ExperimentsApi.postExperiment(...) and parent spans to experiment_id:<id>.
Devserver.extractParentInfo: pass ensure_new=true so repeated UI runs create distinct experiments instead of appending to the first.
Devserver.extractParentInfo: keep the playground path (playground_id:<id>) unchanged and turn the old unconditional throw into a real "neither signal present" fallback.
DevserverTest.testExperimentEval: new test added in this PR, sending the experiment-run shape and asserting clean summary/done with the eval span parented to experiment_id:<id>.
cassettes/.../v1_experiment-java-experiment-repro.json: add stub for POST /v1/experiment matching "ensure_new":true (also serves as the regression guard for the flag).

Testing

The new testExperimentEval failed before the fix with the exact reported error; passes now.
Full DevserverTest suite passes (playground paths unaffected); validated end-to-end against a local fixed-SDK build.

Out of scope

Eval.java (CLI runner) calls postExperiment without ensureNew — left unchanged as it may be intentional for CLI re-runs.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8bc4a344a5

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "Codex (@codex) address that feedback".

chatgpt-codex-connector · 2026-06-18T16:24:07Z

+                                    new CreateExperiment()
+                                            .projectId(projectId)
+                                            .name(experimentName)
+                                            .ensureNew(true));


Link dataset runs to the created experiment

When the Experiment trigger is run against a Braintrust dataset (data.dataset_id or project_name/dataset_name), this creates the experiment with only project/name/ensure_new before extractDataset() opens the dataset cursor. Unlike Eval.run(), which copies the dataset id and cursor version into CreateExperiment, the new remote-eval experiment is not linked to the dataset/version, so the Experiment page loses the dataset association even though the rows were fetched from it.

Useful? React with 👍 / 👎.

Andrew Kent (realark)

(going to take over this branch and refactor devserver and evals to share a code path to address issues like the one the bot raised)

Pedro Mora (stretpjc) added 2 commits June 18, 2026 10:12

Adding test and bug fix

18e2cda

Added ensureNew(true) to make sure new experiments are created

8bc4a34

chatgpt-codex-connector Bot reviewed Jun 18, 2026

View reviewed changes

Pedro Mora (stretpjc) requested a review from Andrew Kent (realark) June 18, 2026 18:14

Andrew Kent (realark) requested changes Jun 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix experiment trigger on remote eval#128

Fix experiment trigger on remote eval#128
Pedro Mora (stretpjc) wants to merge 2 commits into
mainfrom
fix-experiment-trigger-on-remote-eval

Pedro Mora (stretpjc) commented Jun 18, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 18, 2026

Uh oh!

Andrew Kent (realark) left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Pedro Mora (stretpjc) commented Jun 18, 2026

Summary

Changes

Testing

Out of scope

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

Andrew Kent (realark) left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants