Hypothesis
Copilot code review drip-feeds ~2 comments per pass. If we request review 3 times before dispatching the responder, we might surface ~6-10 comments upfront. The responder could then fix them all in one shot instead of burning 3 attempts across 3 push-review cycles.
Community reports (GitHub Discussion #152385) confirm that re-requesting review surfaces comments on different files each time, even without a code change. This suggests Copilot's internal selection is non-deterministic and re-rolls each request.
What Needs Testing
Run a controlled experiment on a real PR:
- Open a PR with ~5-10 changed files
- Request Copilot code review → record comments (count, files, content)
- Wait for review to complete
- Re-request review (no push) → record NEW comments
- Wait for review to complete
- Re-request a third time → record NEW comments
- Analyze:
- Do comments accumulate or duplicate?
- Does Copilot respect "already reviewed this unchanged code"?
- What's the total unique comment count across 3 passes?
- Are later passes higher or lower quality?
Open Questions
- Cost: Each review costs a premium request. 3x reviews per PR triples review cost. Is the tradeoff worth it?
- Deduplication: Does Copilot post the same comment twice if you re-request? Or does it skip already-posted comments?
- Timing: How long between requests? Does Copilot need to fully complete before re-requesting?
- Diminishing returns: If pass 1 gives 2 comments and pass 2 gives 2, does pass 3 give 0?
- API mechanism: Can we programmatically request re-review via the GitHub API, or does it require the UI?
Implementation (if experiment succeeds)
The orchestrator would change from:
PR opened → wait for review → dispatch responder
To:
PR opened → request review → wait → re-request → wait → re-request → wait → dispatch responder (with all accumulated comments)
This could be implemented as a pre-responder step in the orchestrator, or as a separate "review accumulator" workflow.
Related
Hypothesis
Copilot code review drip-feeds ~2 comments per pass. If we request review 3 times before dispatching the responder, we might surface ~6-10 comments upfront. The responder could then fix them all in one shot instead of burning 3 attempts across 3 push-review cycles.
Community reports (GitHub Discussion #152385) confirm that re-requesting review surfaces comments on different files each time, even without a code change. This suggests Copilot's internal selection is non-deterministic and re-rolls each request.
What Needs Testing
Run a controlled experiment on a real PR:
Open Questions
Implementation (if experiment succeeds)
The orchestrator would change from:
To:
This could be implemented as a pre-responder step in the orchestrator, or as a separate "review accumulator" workflow.
Related