Track strings to calculate codeSurvivalRate #2430

hediet · 2025-12-05T11:18:42Z

Supersedes #2365. Fixes microsoft/vscode#281044

Copilot

Pull request overview

This PR enhances edit survival tracking by capturing the actual text content at three key stages: before AI edits, after AI edits, and after user edits. This data enables better analysis of what code changes survive and how users modify AI-generated code.

Key Changes:

Extends EditSurvivalTracker.computeTrackedEditsSurvivalScore() to return text arrays for each edit region
Adds text capture fields to EditSurvivalResult interface
Updates telemetry events across multiple tools to include serialized text arrays
Adds comprehensive test coverage for the new functionality

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`src/platform/editSurvivalTracking/common/editSurvivalTracker.ts`	Collects text snapshots from three stages (before AI edits, after AI edits, after user edits) and returns them as arrays alongside existing survival scores
`src/platform/editSurvivalTracking/common/editSurvivalReporter.ts`	Extends `EditSurvivalResult` interface with optional text array fields and passes them through to telemetry
`src/extension/tools/node/applyPatchTool.tsx`	Adds new internal telemetry event with JSON-stringified text arrays for apply patch operations
`src/extension/tools/node/abstractReplaceStringTool.tsx`	Adds new internal telemetry event with JSON-stringified text arrays for string replace operations
`src/extension/prompts/node/codeMapper/codeMapperService.ts`	Updates existing internal telemetry to include JSON-stringified text arrays
`src/extension/test/node/utils.spec.ts`	Adds three new test cases validating text capture functionality with inline snapshots, refactors existing test to use expect/toMatchInlineSnapshot pattern

Copilot · 2025-12-05T11:23:13Z

src/extension/test/node/utils.spec.ts

+			textBeforeAiEdits: 'text1BeforeAiEdits',
+			textAfterAiEdits: 'text2AfterAiEdits',
+			textAfterUserEdits: 'text3AfterUserEdits',


[nitpick] The renameProps function is a test helper that renames properties in the test output to make the snapshots more readable (e.g., textBeforeAiEdits → text1BeforeAiEdits). However, the naming convention with numbered prefixes (text1, text2, text3) is a bit unclear. Consider using more descriptive names like textOriginal, textAiEdited, textUserEdited to make the test snapshots more self-documenting.

Suggested change

textBeforeAiEdits: 'text1BeforeAiEdits',

textAfterAiEdits: 'text2AfterAiEdits',

textAfterUserEdits: 'text3AfterUserEdits',

textBeforeAiEdits: 'textOriginal',

textAfterAiEdits: 'textAiEdited',

textAfterUserEdits: 'textUserEdited',

Copilot · 2025-12-05T11:23:13Z

src/platform/editSurvivalTracking/common/editSurvivalTracker.ts

 	 * fourGram: Number between 0 (no edits survived) and 1 (all edits survived).
 	 * noRevert: Number between 0 (the text after user edits equals the text before the AI edits) and 1 (the text after user edits does not revert any text to the initial state)


The JSDoc comment describes the return value as having fourGram and noRevert properties, but the function now also returns textBeforeAiEdits, textAfterAiEdits, and textAfterUserEdits. The documentation should be updated to describe these new fields:

/** * Computes survival scores and text snapshots for tracked edits. * * @returns An object containing: * - fourGram: Number between 0 (no edits survived) and 1 (all edits survived). * - noRevert: Number between 0 (the text after user edits equals the text before the AI edits) and 1 (the text after user edits does not revert any text to the initial state) * - textBeforeAiEdits: Array of text strings before AI edits were applied for each edit region * - textAfterAiEdits: Array of text strings after AI edits were applied for each edit region * - textAfterUserEdits: Array of text strings after user edits were applied for each edit region */

Suggested change

* fourGram: Number between 0 (no edits survived) and 1 (all edits survived).

* noRevert: Number between 0 (the text after user edits equals the text before the AI edits) and 1 (the text after user edits does not revert any text to the initial state)

* Computes survival scores and text snapshots for tracked edits.

*

* @returns An object containing:

* - fourGram: Number between 0 (no edits survived) and 1 (all edits survived).

* - noRevert: Number between 0 (the text after user edits equals the text before the AI edits) and 1 (the text after user edits does not revert any text to the initial state)

* - textBeforeAiEdits: Array of text strings before AI edits were applied for each edit region.

* - textAfterAiEdits: Array of text strings after AI edits were applied for each edit region.

* - textAfterUserEdits: Array of text strings after user edits were applied for each edit region.

Copilot · 2025-12-05T11:23:14Z

src/platform/editSurvivalTracking/common/editSurvivalReporter.ts

+	/**
+	 * Text states for each edit region
+	 */
+	readonly textBeforeAiEdits?: string[];
+	readonly textAfterAiEdits?: string[];
+	readonly textAfterUserEdits?: string[];


[nitpick] Consider adding documentation about potential data size implications when serializing textBeforeAiEdits, textAfterAiEdits, and textAfterUserEdits arrays. For files with many edits or large edit regions, the JSON.stringify() calls could produce very large strings in telemetry events. Consider adding a note in the JSDoc about this being intended for internal-only telemetry, or potentially adding size limits/truncation logic if this becomes an issue.

xuga and others added 3 commits December 3, 2025 10:52

Track strings to calculate codeSurvivalRate

dd4fab8

Update test for the new telemetry adding

2fde3c1

Sends data to internal telemetry, adds more tests

7ebcdca

Copilot AI review requested due to automatic review settings December 5, 2025 11:18

Copilot started reviewing on behalf of hediet December 5, 2025 11:19 View session

vs-code-engineering bot added this to the November 2025 milestone Dec 5, 2025

vs-code-engineering bot assigned hediet Dec 5, 2025

Copilot finished reviewing on behalf of hediet December 5, 2025 11:21

Copilot AI reviewed Dec 5, 2025

View reviewed changes

chrmarti approved these changes Dec 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Track strings to calculate codeSurvivalRate #2430

Track strings to calculate codeSurvivalRate #2430

hediet commented Dec 5, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 5, 2025

Uh oh!

Copilot AI Dec 5, 2025

Uh oh!

Copilot AI Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		* fourGram: Number between 0 (no edits survived) and 1 (all edits survived).
		* noRevert: Number between 0 (the text after user edits equals the text before the AI edits) and 1 (the text after user edits does not revert any text to the initial state)

-	 * fourGram: Number between 0 (no edits survived) and 1 (all edits survived).
-	 * noRevert: Number between 0 (the text after user edits equals the text before the AI edits) and 1 (the text after user edits does not revert any text to the initial state)
+	 * Computes survival scores and text snapshots for tracked edits.
+	 *
+	 * @returns An object containing:
+	 * - fourGram: Number between 0 (no edits survived) and 1 (all edits survived).
+	 * - noRevert: Number between 0 (the text after user edits equals the text before the AI edits) and 1 (the text after user edits does not revert any text to the initial state)
+	 * - textBeforeAiEdits: Array of text strings before AI edits were applied for each edit region.
+	 * - textAfterAiEdits: Array of text strings after AI edits were applied for each edit region.
+	 * - textAfterUserEdits: Array of text strings after user edits were applied for each edit region.

Track strings to calculate codeSurvivalRate #2430

Are you sure you want to change the base?

Track strings to calculate codeSurvivalRate #2430

Conversation

hediet commented Dec 5, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants