ci: upload system-tests results to Test Optimization and add tracer-release nightly by nccatoni · Pull Request #3660 · DataDog/dd-trace-php

nccatoni · 2026-02-18T14:16:02Z

Description

Two changes to the system-tests CI integration:

Upload all system-tests results to Test Optimization — adds an after_script to the .system_tests base job that copies JUnit XML files and uploads them via the existing silent-upload-junit-to-datadog.sh script.
Add tracer-release nightly — create a new system-tests workflow that uses the official system-tests reusable workflow to run the tracer-release scenario group on master.

This is meant to replace the system-tests-dashboard nightly.

Reviewer checklist

This is a CI-only change, no code changes.
Appropriate labels assigned.

Run the official system-tests reusable workflow daily at 4 AM GMT using dev mode against the latest commit on the default branch. Co-authored-by: Cursor <cursoragent@cursor.com>

datadog-official · 2026-02-18T14:22:44Z

⚠️ Tests

✨ Fix all issues with BitsAI or with Cursor

⚠️ Warnings

🧪 1028 Tests failed

testSearchPhpBinaries from integration.DDTrace\Tests\Integration\PHPInstallerTest (Datadog) (Fix with Cursor)

DDTrace\Tests\Integration\PHPInstallerTest::testSearchPhpBinaries
Test code or tested code printed unexpected output: Searching for available php binaries, this operation might take a while.

testSimplePushAndProcess from laravel-58-test.DDTrace\Tests\Integrations\Laravel\V5_8\QueueTest (Datadog) (Fix with Cursor)

DDTrace\Tests\Integrations\Laravel\V5_8\QueueTest::testSimplePushAndProcess
Test code or tested code printed unexpected output: spanLinksTraceId: 69a969a600000000c48b65f113553381
tid: 69a969a600000000
hexProcessTraceId: c48b65f113553381
hexProcessSpanId: 2b288e9916edfc1a
processTraceId: 14162525539282531201
processSpanId: 3109892330865163290

phpvfscomposer://tests/vendor/phpunit/phpunit/phpunit:106

testSimplePushAndProcess from laravel-8x-test.DDTrace\Tests\Integrations\Laravel\V8_x\QueueTest (Datadog) (Fix with Cursor)

DDTrace\Tests\Integrations\Laravel\V8_x\QueueTest::testSimplePushAndProcess
Test code or tested code printed unexpected output: spanLinksTraceId: 69a969df000000007dbede904607fcdd
tid: 69a969df00000000
hexProcessTraceId: 7dbede904607fcdd
hexProcessSpanId: 8db73156cf8cf114
processTraceId: 9060924211547602141
processSpanId: 10211684929004957972

View all

ℹ️ Info

❄️ No new flaky tests detected

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: a65519d | Docs | Datadog PR Page | Was this helpful? React with 👍/👎 or give us feedback!}

Co-authored-by: Cursor <cursoragent@cursor.com>

codecov-commenter · 2026-02-18T14:35:08Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 62.11%. Comparing base (a9caaa0) to head (a65519d).
⚠️ Report is 14 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3660      +/-   ##
==========================================
- Coverage   62.21%   62.11%   -0.11%     
==========================================
  Files         141      141              
  Lines       13387    13387              
  Branches     1753     1753              
==========================================
- Hits         8329     8315      -14     
- Misses       4260     4273      +13     
- Partials      798      799       +1

see 3 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a9caaa0...a65519d. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Co-authored-by: Cursor <cursoragent@cursor.com>

- Add after_script to .system_tests base job to upload JUnit results to Datadog Test Optimization using the existing upload script - Add tracer-release scenario group to configure_system_tests on schedule and master runs - Remove separate GitHub Actions nightly workflow Made-with: Cursor

The silent wrapper's grep returns exit code 1 when no summary lines are found, causing the after_script to fail. Use the non-silent version with || true to prevent after_script failures. Made-with: Cursor

Replace the generic upload-junit-to-datadog.sh with a custom after_script that: - Uses --service system-tests (not dd-trace-php-tests) - References logs directly from system-tests/logs*/ - Uses the datadog-ci standalone binary (no npm needed) - Uses Python for JSON/zip (guaranteed available in the image) - Provides explicit error messages instead of silent exit 0 Made-with: Cursor

pr-commenter · 2026-02-26T17:25:52Z

Benchmarks [ tracer ]

Benchmark execution time: 2026-03-05 12:42:20

Comparing candidate commit a65519d in PR branch add-nightly-system-tests with baseline commit a9caaa0 in branch master.

Found 5 performance improvements and 9 performance regressions! Performance is the same for 180 metrics, 0 unstable metrics.

scenario:MessagePackSerializationBench/benchMessagePackSerialization

🟩 execution_time [-4.880µs; -3.940µs] or [-4.501%; -3.634%]

scenario:MessagePackSerializationBench/benchMessagePackSerialization-opcache

🟩 execution_time [-8.557µs; -7.023µs] or [-7.589%; -6.229%]

scenario:PDOBench/benchPDOBaseline-opcache

🟥 execution_time [+1.791µs; +2.129µs] or [+2.916%; +3.466%]

scenario:PHPRedisBench/benchRedisBaseline

🟩 execution_time [-58.269µs; -56.595µs] or [-42.866%; -41.635%]

scenario:PHPRedisBench/benchRedisOverhead

🟩 execution_time [-267.416µs; -261.197µs] or [-31.947%; -31.204%]

scenario:SamplingRuleMatchingBench/benchRegexMatching1

🟥 execution_time [+93.852ns; +156.748ns] or [+8.123%; +13.567%]

scenario:SamplingRuleMatchingBench/benchRegexMatching1-opcache

🟥 execution_time [+86.725ns; +138.875ns] or [+6.807%; +10.900%]

scenario:SamplingRuleMatchingBench/benchRegexMatching2

🟥 execution_time [+88.359ns; +150.441ns] or [+7.583%; +12.910%]

scenario:SamplingRuleMatchingBench/benchRegexMatching2-opcache

🟥 execution_time [+94.857ns; +129.743ns] or [+7.400%; +10.121%]

scenario:SamplingRuleMatchingBench/benchRegexMatching3

🟥 execution_time [+99.530ns; +144.470ns] or [+8.613%; +12.502%]

scenario:SamplingRuleMatchingBench/benchRegexMatching3-opcache

🟥 execution_time [+97.659ns; +141.741ns] or [+7.610%; +11.045%]

scenario:SamplingRuleMatchingBench/benchRegexMatching4

🟥 execution_time [+90.680ns; +154.920ns] or [+7.831%; +13.379%]

scenario:SamplingRuleMatchingBench/benchRegexMatching4-opcache

🟥 execution_time [+102.338ns; +149.062ns] or [+8.010%; +11.666%]

scenario:SymfonyBench/benchSymfonyDdprof-opcache

🟩 execution_time [-660.751µs; -325.129µs] or [-7.408%; -3.645%]

Made-with: Cursor

nccatoni · 2026-03-05T12:06:47Z

For this to work two things need to be done:
Adding docker/login-action and nick-fields/retry to the list of authorized actions for the repo
Adding an Datadog API key as a github repo secret

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a65519dc59

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-05T12:12:14Z

.github/workflows/system_tests.yml

+
+jobs:
+  system-tests:
+    uses: DataDog/system-tests/.github/workflows/system-tests.yml@main


Pin reusable workflow ref before passing API key secrets

This job passes DD_API_KEY/TEST_OPTIMIZATION_API_KEY into a reusable workflow referenced by mutable @main, so any upstream change on DataDog/system-tests immediately executes here with access to those secrets. In the event of an upstream compromise or an unintended change, secrets can be exposed without any commit in this repository; pinning to an immutable commit SHA avoids that supply-chain risk.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-05T12:12:14Z

.gitlab/generate-package.php

+      /tmp/datadog-ci junit upload \
+        system-tests/logs*/reportJunit.xml \


Check junit upload exit status before declaring completion

Because this block runs with set +e, datadog-ci junit upload can fail (for example on transient network/auth/API errors) and the script will still continue to print Upload complete. That creates a false-success signal while silently dropping test results from Test Optimization; explicitly checking the upload command status and logging a failure path would prevent this blind spot.

Useful? React with 👍 / 👎.

Add nightly system-tests workflow

17ff796

Run the official system-tests reusable workflow daily at 4 AM GMT using dev mode against the latest commit on the default branch. Co-authored-by: Cursor <cursoragent@cursor.com>

Enable push_to_test_optimization

10f7eab

Co-authored-by: Cursor <cursoragent@cursor.com>

nccatoni changed the title ~~Add nightly system-tests workflow~~ ci: add nightly system-tests workflow Feb 18, 2026

nccatoni and others added 6 commits February 18, 2026 15:40

Pass secrets explicitly to reusable workflow

2f73f4b

Co-authored-by: Cursor <cursoragent@cursor.com>

Only pass TEST_OPTIMIZATION_API_KEY secret

34d46d2

Co-authored-by: Cursor <cursoragent@cursor.com>

tmp: add pull_request trigger for testing

420b281

Co-authored-by: Cursor <cursoragent@cursor.com>

Set desired_execution_time to 300s

1185a36

Co-authored-by: Cursor <cursoragent@cursor.com>

Also pass DD_API_KEY secret

4af9b76

Co-authored-by: Cursor <cursoragent@cursor.com>

nccatoni changed the title ~~ci: add nightly system-tests workflow~~ ci: upload system-tests results to Test Optimization and add tracer-release nightly Feb 26, 2026

nccatoni added 2 commits February 26, 2026 16:54

ci: fix after_script by using upload script directly

42bcd97

The silent wrapper's grep returns exit code 1 when no summary lines are found, causing the after_script to fail. Use the non-silent version with || true to prevent after_script failures. Made-with: Cursor

nccatoni added 3 commits March 2, 2026 14:20

Run only tracer-release on schedule, it already includes other groups

34f79e3

Made-with: Cursor

Temporarily run tracer-release on all triggers for testing

bafec28

Made-with: Cursor

Tracer release scenario group on main

578e0b6

nccatoni force-pushed the add-nightly-system-tests branch from 9a812c9 to 578e0b6 Compare March 5, 2026 11:08

nccatoni added 2 commits March 5, 2026 12:11

Test

4db0655

Remove ref argument

a65519d

nccatoni marked this pull request as ready for review March 5, 2026 12:06

nccatoni requested a review from a team as a code owner March 5, 2026 12:06

chatgpt-codex-connector bot reviewed Mar 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: upload system-tests results to Test Optimization and add tracer-release nightly#3660

ci: upload system-tests results to Test Optimization and add tracer-release nightly#3660
nccatoni wants to merge 15 commits intomasterfrom
add-nightly-system-tests

nccatoni commented Feb 18, 2026 •

edited

Loading

Uh oh!

datadog-official bot commented Feb 18, 2026 •

edited by datadog-datadog-prod-us1-2 bot

Loading

Uh oh!

codecov-commenter commented Feb 18, 2026 •

edited

Loading

Uh oh!

pr-commenter bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

nccatoni commented Mar 5, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 5, 2026

Uh oh!

chatgpt-codex-connector bot Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		/tmp/datadog-ci junit upload \
		system-tests/logs*/reportJunit.xml \

Conversation

nccatoni commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Reviewer checklist

Uh oh!

datadog-official bot commented Feb 18, 2026 • edited by datadog-datadog-prod-us1-2 bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Warnings

ℹ️ Info

Uh oh!

codecov-commenter commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

pr-commenter bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks [ tracer ]

scenario:MessagePackSerializationBench/benchMessagePackSerialization

scenario:MessagePackSerializationBench/benchMessagePackSerialization-opcache

scenario:PDOBench/benchPDOBaseline-opcache

scenario:PHPRedisBench/benchRedisBaseline

scenario:PHPRedisBench/benchRedisOverhead

scenario:SamplingRuleMatchingBench/benchRegexMatching1

scenario:SamplingRuleMatchingBench/benchRegexMatching1-opcache

scenario:SamplingRuleMatchingBench/benchRegexMatching2

scenario:SamplingRuleMatchingBench/benchRegexMatching2-opcache

scenario:SamplingRuleMatchingBench/benchRegexMatching3

scenario:SamplingRuleMatchingBench/benchRegexMatching3-opcache

scenario:SamplingRuleMatchingBench/benchRegexMatching4

scenario:SamplingRuleMatchingBench/benchRegexMatching4-opcache

scenario:SymfonyBench/benchSymfonyDdprof-opcache

Uh oh!

nccatoni commented Mar 5, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nccatoni commented Feb 18, 2026 •

edited

Loading

datadog-official bot commented Feb 18, 2026 •

edited by datadog-datadog-prod-us1-2 bot

Loading

codecov-commenter commented Feb 18, 2026 •

edited

Loading

pr-commenter bot commented Feb 26, 2026 •

edited

Loading