You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
🤖 fix: de-flake file editing tests with increased timeouts (#829)
## Summary
Fixes flaky integration tests in `runtimeFileEditing.test.ts` that were
timing out waiting for Anthropic API responses.
## Root Cause
CI runners are slower than local dev machines due to:
- Shared VMs with less CPU/memory
- Higher network latency to Anthropic API
- No prompt cache benefit (Anthropic cache requires 2048+ tokens, our
test prompts are ~200-500 tokens)
- 4 concurrent tests × 2 runtime types = 8 parallel API calls
## Changes
- Increased stream timeout: 15s → 30s (local), 25s → 45s (SSH)
- Increased test timeout: 25s → 45s (local), 60s → 90s (SSH)
- Added `configureTestRetries(3)` to handle occasional API hiccups
## Why not switch models or use 1h cache TTL?
- Tried codex-mini but it struggles with file editing tool calls
- Anthropic's 1h cache TTL won't help - requires 2048+ token minimum,
our prompts are too short
_Generated with `mux`_
0 commit comments