Minor update by Trecek · Pull Request #497 · TalonT-Org/AutoSkillit

Trecek · 2026-03-24T03:37:10Z

No description provided.

## Summary `autoskillit init` creates files in `.autoskillit/` via independent helper functions, but the gitignore writer (`ensure_project_temp`) had no structural contract with the file creators (`_create_secrets_template`). Tests verified each function in isolation — no test ever checked the cross-cutting invariant: **every sensitive file placed into `.autoskillit/` must be covered by `.autoskillit/.gitignore`**. This PR adds structural immunity so that any future file added to the init flow without updating the gitignore or committed-files allowlist will be caught automatically by both CI tests and the doctor command. ### Changes - **`core/io.py`**: Added `_COMMITTED_BY_DESIGN` frozenset allowlist for intentionally committed files (`config.yaml`, `recipes`) - **`core/__init__.py`**: Re-exported `_AUTOSKILLIT_GITIGNORE_ENTRIES` and `_COMMITTED_BY_DESIGN` for cross-package access - **`cli/_doctor.py`**: Added `_check_gitignore_completeness` (check 9) — warns when `.autoskillit/` files aren't covered by gitignore or allowlist; two-pass scan covers both filesystem and canonical entries - **`tests/cli/test_init.py`**: 4 new structural immunity tests (dynamic file discovery, comment truthfulness, 2 regression guards) - **`tests/cli/test_doctor.py`**: 2 new doctor check tests + updated expected check set + fixed healthy doctor test setup - **`CLAUDE.md`**: Updated doctor check count from 8 to 9 ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; START([autoskillit init]) DOCTOR_START([autoskillit doctor]) subgraph InitFlow ["Init Flow — _register_all()"] direction TB EPT["● ensure_project_temp ━━━━━━━━━━ Creates temp/ + .gitignore Reads _AUTOSKILLIT_GITIGNORE_ENTRIES Backfills missing entries"] GI_EXISTS{"gitignore exists?"} GI_WRITE["Write all entries"] GI_BACKFILL["Backfill missing entries"] CST["_create_secrets_template ━━━━━━━━━━ Creates .secrets.yaml Comment: already in .gitignore"] end subgraph Registry ["● Registry — core/io.py"] direction TB ENTRIES["● _AUTOSKILLIT_GITIGNORE_ENTRIES ━━━━━━━━━━ temp/, .secrets.yaml"] ALLOW["● _COMMITTED_BY_DESIGN ━━━━━━━━━━ config.yaml, recipes"] end subgraph DoctorCheck9 ["● Doctor Check 9 — _check_gitignore_completeness"] direction TB D_DIR{".autoskillit/ exists?"} D_GI{".gitignore exists?"} D_SCAN["● Enumerate .autoskillit/ files ━━━━━━━━━━ Skip .gitignore itself Skip _COMMITTED_BY_DESIGN Check remaining vs gitignore"] D_CANON["● Check canonical entries ━━━━━━━━━━ Verify every entry in _AUTOSKILLIT_GITIGNORE_ENTRIES is present in .gitignore"] D_RESULT{"uncovered files?"} end subgraph TestGates ["● Test Gates — Structural Immunity"] direction TB T1["● test_init_all_created_files _covered_by_gitignore ━━━━━━━━━━ Dynamic file discovery"] T2["● test_secrets_template _gitignore_comment_is_true ━━━━━━━━━━ Comment truthfulness"] T3["● test_gitignore_entries _includes_secrets_yaml ━━━━━━━━━━ Regression guard"] T4["● test_doctor_warns_on _missing_gitignore_entry ━━━━━━━━━━ Doctor check validation"] end OK_INIT([INIT COMPLETE]) OK_DOCTOR([Severity.OK]) WARN_DOCTOR([Severity.WARNING]) START --> EPT EPT --> GI_EXISTS GI_EXISTS -->|"no"| GI_WRITE GI_EXISTS -->|"yes"| GI_BACKFILL GI_WRITE --> CST GI_BACKFILL --> CST CST --> OK_INIT EPT -.->|"reads"| ENTRIES GI_WRITE -.->|"reads"| ENTRIES GI_BACKFILL -.->|"reads"| ENTRIES DOCTOR_START --> D_DIR D_DIR -->|"no"| OK_DOCTOR D_DIR -->|"yes"| D_GI D_GI -->|"no"| WARN_DOCTOR D_GI -->|"yes"| D_SCAN D_SCAN --> D_CANON D_CANON --> D_RESULT D_RESULT -->|"none"| OK_DOCTOR D_RESULT -->|"found"| WARN_DOCTOR D_SCAN -.->|"reads"| ALLOW D_CANON -.->|"reads"| ENTRIES ENTRIES -.->|"validated by"| T1 ENTRIES -.->|"validated by"| T3 ALLOW -.->|"used by"| T1 CST -.->|"validated by"| T2 D_SCAN -.->|"validated by"| T4 class START,DOCTOR_START,OK_INIT,OK_DOCTOR terminal; class WARN_DOCTOR detector; class EPT,CST,GI_WRITE,GI_BACKFILL handler; class GI_EXISTS,D_DIR,D_GI,D_RESULT stateNode; class ENTRIES,ALLOW stateNode; class D_SCAN,D_CANON newComponent; class T1,T2,T3,T4 newComponent; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start, complete, and error states | | Orange | Handler | Init flow processing nodes | | Teal | State | Registry constants and decision points | | Green | New/Modified | New doctor check logic and test gates | | Red | Detector | Warning outcomes | ## Implementation Plan Plan file: `temp/rectify/rectify_init_gitignore_immunity_2026-03-19_190700.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…nnecessary permissions

## Summary The token summary vanished from pipeline-created PRs because its injection protocol lived exclusively in recipe `note:` fields — prose documentation addressed to the LLM orchestrator, not executable recipe steps. There was no runtime enforcement and no test sensitivity to the WARNING-severity semantic rule that fired on all three bundled production recipes. The remedy addresses two independent immunity mechanisms: (1) `open-pr` now self-retrieves token telemetry from disk using `cwd` as a pipeline-run scoping key — requiring a `cwd_filter` parameter added to the shared session-log iterator — making cross-process telemetry access typed, testable, and free from orchestrator compliance requirements; and (2) the test suite now asserts zero WARNING-level semantic findings on bundled production recipes, so any future note-encoded protocol that generates a WARNING fails CI immediately. ## Architecture Impact ### Data Lineage Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart LR classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph Prod ["Pipeline Execution (every run_skill step)"] STEP["run_skill invocation ━━━━━━━━━━ step_name = YAML step name cwd = work_dir (pipeline root)"] SL["flush_session_log() ━━━━━━━━━━ execution/session_log.py called after session exits"] end subgraph Disk ["Global Log Storage ~/.local/share/autoskillit/logs/"] JSONL[("sessions.jsonl ━━━━━━━━━━ cwd · timestamp · dir_name step_name · token counts")] TU[("sessions/dir/token_usage.json ━━━━━━━━━━ step_name · input_tokens output_tokens · timing_seconds")] end subgraph Iterator ["● Recovery Iterator (audit.py)"] ITER["● _iter_session_log_entries() ━━━━━━━━━━ + cwd_filter: str = '' skip if idx.cwd != cwd_filter yields Path to per-session files"] end subgraph TokenLog ["● DefaultTokenLog (tokens.py)"] LOAD["● load_from_log_dir() ━━━━━━━━━━ + cwd_filter: str = '' delegates to iterator"] REPT["get_report() / compute_total() ━━━━━━━━━━ returns list of TokenEntry dicts + aggregated total"] end subgraph OpenPR ["★ open-pr Skill (SKILL.md)"] STEP0B["★ Step 0b: Self-Retrieve ━━━━━━━━━━ PIPELINE_CWD=$(pwd) python3: load_from_log_dir(cwd_filter=PIPELINE_CWD)"] FMT["TelemetryFormatter.format_token_table() ━━━━━━━━━━ pipeline/telemetry_fmt.py → markdown table string"] BODY["PR Body Assembly ━━━━━━━━━━ ## Token Usage Summary embedded markdown table"] end subgraph ServerRecovery ["Existing: Server Restart Recovery (_state.py)"] SVREC["_initialize() on server start ━━━━━━━━━━ load_from_log_dir(since=24h) cwd_filter='' (global, unchanged)"] end subgraph Guards ["★ Structural Test Guards"] T1["★ test_load_from_log_dir_cwd_filter ━━━━━━━━━━ tests/pipeline/test_tokens.py proves cross-pipeline isolation"] T2["★ test_bundled_recipes_zero_warnings ━━━━━━━━━━ tests/recipe/test_bundled_recipes.py catches future WARNING regressions"] end STEP -->|"cwd=work_dir step_name=step"| SL SL -->|"append index entry with cwd field"| JSONL SL -->|"write telemetry file (when step_name set)"| TU JSONL -->|"● read + filter by cwd"| ITER TU -->|"read per-session telemetry"| ITER ITER -->|"yield file paths"| LOAD LOAD --> REPT REPT -->|"steps + total"| STEP0B STEP0B --> FMT FMT --> BODY JSONL -.->|"global since= filter only (unchanged)"| SVREC T1 -.->|"validates cwd isolation"| LOAD T2 -.->|"catches future WARNING note-protocol regressions"| JSONL class STEP cli; class SL handler; class JSONL,TU stateNode; class ITER,LOAD,REPT handler; class STEP0B,T1,T2 newComponent; class FMT phase; class BODY output; class SVREC phase; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Input | Pipeline step invocation (entry point) | | Orange | Handler | Recovery iterator, token log, disk read processing | | Teal | Storage | Primary disk storage (source of truth — sessions.jsonl, token_usage.json) | | Purple | Phase | Formatting and existing server recovery path | | Green (★/●) | New/Modified | Proposed new components: self-retrieval step + test guards | | Dark Teal | Output | PR body as final data destination | ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; START([open-pr invoked by pipeline]) subgraph Guard ["Step 0: Stable Branch Guard"] CHK_STABLE{"base_branch == stable AND head != main?"} end subgraph SelfRetrieval ["● Step 0b: Token Self-Retrieval (SKILL.md)"] CWD["● Capture PIPELINE_CWD=$(pwd) ━━━━━━━━━━ scoping key for this run"] LOAD_TOK["● load_from_log_dir(cwd_filter=PIPELINE_CWD) ━━━━━━━━━━ ● pipeline/tokens.py + audit.py reads sessions.jsonl + token_usage.json"] CHK_SESSIONS{"n sessions > 0?"} FMT_TOK["TelemetryFormatter.format_token_table() ━━━━━━━━━━ TOKEN_SUMMARY_CONTENT = markdown table"] NO_TOK["TOKEN_SUMMARY_CONTENT = '' ━━━━━━━━━━ section omitted gracefully"] end subgraph PRSetup ["Steps 1–5: Parse, Diff, Lenses"] PARSE["Step 1: Parse args ━━━━━━━━━━ plan_paths · feature_branch base_branch · closing_issue"] DIFF["Step 3: git diff ━━━━━━━━━━ new_files · modified_files"] LENSES["Step 4–5: Arch-Lens Diagrams ━━━━━━━━━━ select 1–3 lenses validate ★/● markers"] end subgraph Body ["Step 6: Compose PR Body (● SKILL.md)"] BODY["● Compose PR body ━━━━━━━━━━ Summary · Requirements Architecture Impact"] CHK_TOK{"TOKEN_SUMMARY_CONTENT non-empty?"} EMBED["Embed ## Token Usage Summary ━━━━━━━━━━ TOKEN_SUMMARY_CONTENT verbatim"] SKIP_SEC["Omit section ━━━━━━━━━━ standalone invocation"] end subgraph GitHub ["Steps 7–8: GitHub PR"] CHK_GH{"gh auth status exit 0?"} CREATE["gh pr create ━━━━━━━━━━ --body-file pr_body.md"] OUT_EMPTY["output: pr_url= ━━━━━━━━━━ graceful degradation"] end subgraph SemanticGuard ["● Semantic Validation Gate (CI)"] VAL["validate_recipe() ━━━━━━━━━━ ● rules_graph.py telemetry-before-open-pr REMOVED"] CHK_WARN{"warnings == 0? ━━━━━━━━━━ ● test_bundled_recipes.py"} PASS_CI["CI PASSES ━━━━━━━━━━ zero-warning guard satisfied"] FAIL_CI["CI FAILS ━━━━━━━━━━ new note-protocol caught immediately"] end END_SUCCESS([PR URL returned]) ERROR([ERROR: invalid base_branch]) START --> CHK_STABLE CHK_STABLE -->|"yes"| ERROR CHK_STABLE -->|"no"| CWD CWD --> LOAD_TOK LOAD_TOK --> CHK_SESSIONS CHK_SESSIONS -->|"yes"| FMT_TOK CHK_SESSIONS -->|"no"| NO_TOK FMT_TOK --> PARSE NO_TOK --> PARSE PARSE --> DIFF DIFF --> LENSES LENSES --> BODY BODY --> CHK_TOK CHK_TOK -->|"yes"| EMBED CHK_TOK -->|"no"| SKIP_SEC EMBED --> CHK_GH SKIP_SEC --> CHK_GH CHK_GH -->|"yes"| CREATE CHK_GH -->|"no"| OUT_EMPTY CREATE --> END_SUCCESS OUT_EMPTY --> END_SUCCESS VAL --> CHK_WARN CHK_WARN -->|"== 0"| PASS_CI CHK_WARN -->|"> 0"| FAIL_CI class START,END_SUCCESS terminal; class ERROR detector; class CHK_STABLE,CHK_SESSIONS,CHK_TOK,CHK_GH,CHK_WARN stateNode; class PARSE,DIFF,LENSES,BODY,CREATE handler; class CWD,LOAD_TOK,FMT_TOK newComponent; class NO_TOK,SKIP_SEC,OUT_EMPTY phase; class EMBED output; class VAL,PASS_CI,FAIL_CI phase; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start and end states | | Red | Detector | Error terminal (branch guard failure) | | Teal | State | Decision / routing nodes | | Orange | Handler | Processing steps (parse, diff, compose, create) | | Green (●) | Modified | New self-retrieval path: cwd capture, load_from_log_dir, format | | Purple | Phase | Graceful degradation paths and semantic validation | | Dark Teal | Output | Token summary embed into PR body | ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; subgraph WriteContracts ["INIT_ONLY Fields — sessions.jsonl (written once, never modified)"] SESS_CWD["cwd ━━━━━━━━━━ pipeline work_dir at session exit WRITE: flush_session_log() only READ: ● _iter_session_log_entries (cwd_filter gate)"] SESS_META["session_id · dir_name · timestamp step_name · input_tokens · output_tokens ━━━━━━━━━━ WRITE: flush_session_log() only READ: iterator for dir lookup + since filter"] end subgraph Protocols ["● Protocol Contracts (_type_protocols.py)"] P_TOKEN["● TokenStore.load_from_log_dir ━━━━━━━━━━ log_root: Path since: str = '' + cwd_filter: str = ''"] P_AUDIT["● AuditStore.load_from_log_dir ━━━━━━━━━━ same signature update"] P_TIMING["● TimingStore.load_from_log_dir ━━━━━━━━━━ same signature update"] end subgraph GateLayer ["● Isolation Gate — _iter_session_log_entries (audit.py)"] GATE{"● cwd_filter non-empty?"} SKIP["skip entry ━━━━━━━━━━ idx.cwd != cwd_filter cross-pipeline contamination blocked"] PASS["yield path to token_usage.json ━━━━━━━━━━ matching entry only"] COMPAT["cwd_filter = '' ━━━━━━━━━━ no filter applied backward-compatible"] end subgraph MutableState ["MUTABLE — DefaultTokenLog._entries (tokens.py)"] LIVE["record(step_name, usage) ━━━━━━━━━━ live accumulation during pipeline _entries[step_name] += usage"] LOAD["● load_from_log_dir(cwd_filter) ━━━━━━━━━━ disk recovery: rebuilds _entries from matching session files only"] ENTRIES["_entries: dict[str, TokenEntry] ━━━━━━━━━━ key = step_name value = accumulated token counts"] end subgraph Derived ["DERIVED — computed, not stored"] RPT["get_report() / compute_total() ━━━━━━━━━━ defensive copy of _entries regenerated on each call"] end subgraph RecoveryModes ["Two Distinct Recovery Contracts"] GLOBAL["Server Restart Recovery ━━━━━━━━━━ _state.py: load_from_log_dir since=24h · cwd_filter='' global, all pipelines"] SCOPED["● open-pr Self-Retrieval ━━━━━━━━━━ Step 0b: load_from_log_dir cwd_filter=PIPELINE_CWD scoped to this run only"] end SESS_CWD -->|"cwd field read by gate"| GATE SESS_META -->|"dir_name + since used"| GATE GATE -->|"cwd_filter='' (empty)"| COMPAT GATE -->|"cwd_filter non-empty AND idx.cwd != filter"| SKIP GATE -->|"cwd_filter non-empty AND idx.cwd == filter"| PASS COMPAT -->|"yields all matching since"| LOAD PASS -->|"yields scoped paths"| LOAD LOAD -->|"accumulates into"| ENTRIES LIVE -->|"live record into"| ENTRIES ENTRIES -->|"read-only snapshot"| RPT GLOBAL -->|"uses cwd_filter=''"| COMPAT SCOPED -->|"uses cwd_filter=PIPELINE_CWD"| GATE P_TOKEN -.->|"contract for"| LOAD P_AUDIT -.->|"contract for"| LOAD P_TIMING -.->|"contract for"| LOAD class SESS_CWD detector; class SESS_META stateNode; class P_TOKEN,P_AUDIT,P_TIMING newComponent; class GATE stateNode; class SKIP detector; class PASS,COMPAT handler; class LIVE,LOAD,ENTRIES handler; class RPT phase; class GLOBAL cli; class SCOPED newComponent; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Red | INIT_ONLY / Guard | `cwd` field (write-once), skip-entry enforcement | | Teal | State / Gate | sessions.jsonl metadata, `cwd_filter` decision node | | Green (●) | Modified | Protocol contracts, scoped recovery entry point | | Orange | Handler | Iterator pass-through, live record, load_from_log_dir | | Purple | Derived | Computed snapshots (get_report, compute_total) | | Dark Blue | Recovery | Global server-restart recovery path | Closes #441 ## Implementation Plan Plan file: `temp/rectify/rectify_token-summary-note-protocol-immunity_2026-03-20_000000.md` ## Token Usage Summary ## token_summary 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

) ## Summary When `auto_merge == "true"` and `queue_available == "false"`, the pipeline previously dropped the PR unmerged by routing the `route_queue_mode` default case to `release_issue_success`. This is incorrect — GitHub's `gh pr merge --squash --auto` works without a merge queue by merging directly once required checks pass. The fix adds a `direct_merge` step chain as the default route, mirroring the queue path's structure (enable → poll → conflict-fix → re-push → re-enable). The same change is applied in three recipes: `implementation.yaml`, `remediation.yaml`, and `implementation-groups.yaml`. ## Requirements ### ROUTE — Recipe Routing Updates - **REQ-ROUTE-001:** When `auto_merge` is `"true"` and `queue_available` is `"false"`, both `implementation.yaml` and `remediation.yaml` must route to a direct-merge step instead of skipping to cleanup/release. - **REQ-ROUTE-002:** The direct-merge step must invoke `gh pr merge --squash --auto` to enable GitHub-native auto-merge regardless of merge queue presence. - **REQ-ROUTE-003:** The direct-merge path must poll PR state for `merged` completion rather than calling `wait_for_merge_queue`. ### FAIL — Failure Handling - **REQ-FAIL-001:** The direct-merge path must handle merge failures (e.g., conflicts from concurrent merges) with a resolve-and-retry pattern analogous to the queue ejection path. - **REQ-FAIL-002:** If the direct merge fails due to a non-conflict error, the recipe must route to the same cleanup/release path used by the queue timeout case. ### DESC — Ingredient Description - **REQ-DESC-001:** The `auto_merge` ingredient description in both `implementation.yaml` and `remediation.yaml` must be updated to reflect that it controls automatic merging after checks pass, not specifically merge queue enrollment. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; START([CI checks pass]) subgraph Detection ["Queue Detection"] CMQ["check_merge_queue ━━━━━━━━━━ GraphQL: mergeQueue id? captures queue_available"] RQM{"● route_queue_mode ━━━━━━━━━━ auto_merge? queue?"} end subgraph QueuePath ["Queue Path"] EAM["enable_auto_merge ━━━━━━━━━━ gh pr merge --squash --auto → enroll in queue"] WFQ["wait_for_queue ━━━━━━━━━━ wait_for_merge_queue 900 s timeout"] REENROLL["reenroll_stalled_pr ━━━━━━━━━━ toggle_auto_merge"] QEF["queue_ejected_fix ━━━━━━━━━━ resolve-merge-conflicts"] RPQF["re_push_queue_fix ━━━━━━━━━━ push_to_remote"] RMQ["reenter_merge_queue ━━━━━━━━━━ gh pr merge --squash --auto"] end subgraph DirectPath ["★ Direct Merge Path (new)"] DM["★ direct_merge ━━━━━━━━━━ gh pr merge --squash --auto direct (no queue)"] WFDM["★ wait_for_direct_merge ━━━━━━━━━━ poll gh pr view 10 s × 90 = 15 min"] DMCF["★ direct_merge_conflict_fix ━━━━━━━━━━ resolve-merge-conflicts"] RPDF["★ re_push_direct_fix ━━━━━━━━━━ push_to_remote"] RDM["★ redirect_merge ━━━━━━━━━━ gh pr merge --squash --auto re-enable after fix"] end SUCCESS["release_issue_success ━━━━━━━━━━ label: staged"] TIMEOUT["release_issue_timeout ━━━━━━━━━━ no staged label"] FAILURE["release_issue_failure ━━━━━━━━━━ → cleanup_failure"] CONFIRM["confirm_cleanup ━━━━━━━━━━ delete clone? yes/no"] DONE([done]) START --> CMQ CMQ --> RQM RQM -->|"auto_merge != 'true'"| CONFIRM RQM -->|"queue_available == true"| EAM RQM -->|"● default (was release_issue_success)"| DM EAM --> WFQ EAM -->|"on_failure"| CONFIRM WFQ -->|"merged"| SUCCESS WFQ -->|"ejected"| QEF WFQ -->|"stalled"| REENROLL WFQ -->|"timeout"| TIMEOUT REENROLL --> WFQ QEF -->|"resolved"| RPQF QEF -->|"escalation_required"| FAILURE RPQF --> RMQ RMQ --> WFQ DM --> WFDM DM -->|"on_failure"| CONFIRM WFDM -->|"merged"| SUCCESS WFDM -->|"closed"| DMCF WFDM -->|"timeout"| TIMEOUT DMCF -->|"resolved"| RPDF DMCF -->|"escalation_required"| FAILURE RPDF --> RDM RDM --> WFDM SUCCESS --> CONFIRM TIMEOUT --> CONFIRM CONFIRM -->|"yes"| DONE CONFIRM -->|"no"| DONE class START,DONE terminal; class CMQ phase; class RQM stateNode; class EAM,WFQ,REENROLL,QEF,RPQF,RMQ handler; class DM,WFDM,DMCF,RPDF,RDM newComponent; class CONFIRM detector; class SUCCESS,TIMEOUT,FAILURE output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Pipeline start and end states | | Purple | Phase | Queue detection and analysis nodes | | Teal | State | `● route_queue_mode` — modified routing decision | | Orange | Handler | Existing queue path steps | | Green | New Component | `★` New direct-merge path steps added by this PR | | Dark Teal | Output | Terminal release / timeout / failure states | | Red | Detector | `confirm_cleanup` gate | ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 48, 'rankSpacing': 56, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph Inputs ["INIT_ONLY — Ingredient Fields (set at recipe start)"] direction LR OPR["inputs.open_pr ━━━━━━━━━━ default: true SKIP_GATE for all PR steps"] AM["● inputs.auto_merge ━━━━━━━━━━ default: true ● desc: direct merge fallback"] BB["inputs.base_branch ━━━━━━━━━━ merge target"] IU["inputs.issue_url ━━━━━━━━━━ optional: release gate"] end subgraph Prerequisites ["SET_ONCE — Pipeline Context (captured before merge path)"] direction LR QA["context.queue_available ━━━━━━━━━━ SET: check_merge_queue READ: route_queue_mode"] PRN["context.pr_number ━━━━━━━━━━ SET: extract_pr_number READ: direct_merge, wait_for_direct_merge"] WD["context.work_dir ━━━━━━━━━━ SET: clone READ: all direct-merge steps"] MT["context.merge_target ━━━━━━━━━━ SET: create_branch READ: re_push_direct_fix"] end subgraph Gate ["VALIDATION GATE — skip_when_false"] GATE["inputs.open_pr == true ━━━━━━━━━━ Guards all 5 new steps: direct_merge, wait_for_direct_merge, direct_merge_conflict_fix, re_push_direct_fix, redirect_merge"] end subgraph RoutingDecision ["● route_queue_mode — State-Based Router"] RQM["● route_queue_mode ━━━━━━━━━━ reads: inputs.auto_merge reads: context.queue_available ● default → direct_merge (was: release_issue_success)"] end subgraph NewCaptures ["★ CAPTURE_RESULT — New Context Fields (direct merge path)"] direction TB DMS["★ direct_merge_state ━━━━━━━━━━ SET: wait_for_direct_merge values: merged | closed | timeout READ: on_result conditions"] CER["conflict_escalation_required ━━━━━━━━━━ SET: direct_merge_conflict_fix values: true | false READ: on_result conditions"] end subgraph RouteContracts ["ON_RESULT CONTRACTS — Typed routing guards"] direction TB WFDMc["wait_for_direct_merge ━━━━━━━━━━ merged → release_issue_success closed → direct_merge_conflict_fix default → release_issue_timeout"] DMCFc["direct_merge_conflict_fix ━━━━━━━━━━ escalation_required == true → release_issue_failure default → re_push_direct_fix"] end OPR -->|"evaluated each step"| GATE AM -->|"read by"| RQM QA -->|"read by"| RQM BB -->|"passed to"| DMCFc IU -->|"gates"| RouteContracts PRN -->|"passed to"| Gate WD -->|"passed to"| Gate MT -->|"passed to"| Gate GATE -->|"pass → execute"| RQM RQM -->|"default route changed"| NewCaptures DMS -->|"consumed by"| WFDMc CER -->|"consumed by"| DMCFc class OPR,AM,BB,IU detector; class QA,PRN,WD,MT stateNode; class GATE gap; class RQM phase; class DMS newComponent; class CER handler; class WFDMc,DMCFc output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Red | INIT_ONLY | Ingredient fields — set once at recipe start, never mutated | | Teal | SET_ONCE | Pipeline context fields — captured before merge path, read by new steps | | Yellow/Orange | SKIP_GATE | `inputs.open_pr` validation gate protecting all new steps | | Purple | Modified Router | `● route_queue_mode` — routing logic changed by this PR | | Green | New Capture | `★ direct_merge_state` — new field introduced by this PR | | Orange | Existing Capture | `conflict_escalation_required` — existing field, also written by new step | | Dark Teal | Route Contracts | `on_result` typed routing guards (consume captured fields) | Closes #401 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-20260320-165150-373150/temp/make-plan/direct_merge_fallback_plan_2026-03-20_000001.md` ## Token Usage Summary ## Token Summary\n\n(Token data accumulated server-side) 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

…lti-Issue Runs (#451) ## Summary When `process-issues` orchestrates a multi-issue run, `claim_issue` is currently deferred to each recipe's internal step graph. This leaves every not-yet-started issue unclaimed and vulnerable to parallel pickup by another session. The fix requires three coordinated changes: 1. **`claim_issue` MCP tool** gains an `allow_reentry: bool = False` parameter — when `True` and the in-progress label is already present, returns `claimed: true` (reentry) instead of `claimed: false` (blocked). 2. **`process-issues` skill** is refactored to claim all manifest issues upfront before dispatching any recipe, track which were successfully claimed, pass `upfront_claimed: "true"` as a recipe ingredient, and release uncompleted issues on fatal failure. 3. **Three recipes** (`implementation.yaml`, `remediation.yaml`, `implementation-groups.yaml`) each gain an `upfront_claimed` ingredient (default `"false"`) and pass it as `allow_reentry` to their `claim_issue` step so that a pre-claim by the orchestrator is recognized as "proceed" rather than "abort". ## Requirements ### BATCH - **REQ-BATCH-001:** The `process-issues` skill must call `claim_issue` for every issue in the triage manifest before dispatching any recipe execution. - **REQ-BATCH-002:** The system must iterate through all issues in the manifest and call `claim_issue` individually for each, collecting results before proceeding. - **REQ-BATCH-003:** Issues where `claim_issue` returns `claimed: false` (already claimed by another session) must be excluded from the dispatch list and logged as skipped. ### COMPAT - **REQ-COMPAT-001:** Per-recipe `claim_issue` steps must not abort when the in-progress label was already applied by the same orchestration session's upfront claim. - **REQ-COMPAT-002:** The `claim_issue` tool's `on_result` routing in recipes must treat a pre-existing label applied by the current session as a proceed condition, not an escalate-stop condition. - **REQ-COMPAT-003:** Single-issue recipe flows (no orchestrator) must continue to function identically to current behavior. ### RELEASE - **REQ-RELEASE-001:** The `process-issues` skill must release all upfront-claimed but unprocessed issues when the orchestrator encounters a fatal failure. - **REQ-RELEASE-002:** The system must track which issues were claimed upfront and which have been handed off to recipe execution, so that only uncompleted issues are released on failure. - **REQ-RELEASE-003:** Issues that completed recipe execution (success or recipe-level failure with its own release) must not be double-released by the orchestrator cleanup. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; %% TERMINALS %% START([● process-issues invoked]) DONE([DONE]) ERROR([FATAL ERROR]) subgraph Phase0 ["Phase 0 — Parse & Discover"] direction TB Parse["Parse args ━━━━━━━━━━ --batch N, --dry-run, --comment, --merge-batch"] Discover["Discover manifest ━━━━━━━━━━ triage_manifest_*.json"] DryRun{"--dry-run?"} Confirm{"User confirms Y/n?"} end subgraph Phase1 ["● Phase 1 — Upfront Claiming (MODIFIED)"] direction TB Flatten["● Flatten all issues ━━━━━━━━━━ Collect issues from all selected batches"] ClaimCall["● claim_issue() ━━━━━━━━━━ allow_reentry=False for each issue"] ClaimedDecision{"● claimed == true?"} TrackClaimed["● pre_claimed_urls ━━━━━━━━━━ append issue_url"] TrackSkipped["● Log skipped ━━━━━━━━━━ foreign session owns it"] end subgraph Phase2 ["● Phase 2 — Batch Dispatch (MODIFIED)"] direction TB BatchIssueLoop{"For each issue in batch (asc order)"} InPreClaimed{"● issue_url in pre_claimed_urls?"} SkipIssue["Skip issue ━━━━━━━━━━ not pre-claimed"] LoadRecipe["load_recipe() ━━━━━━━━━━ ● with upfront_claimed: 'true' ingredient"] MarkDone["● completed_urls ━━━━━━━━━━ append after recipe returns"] end subgraph RecipeInternal ["● Recipe Internal — claim_issue step (MODIFIED)"] direction TB RecipeClaim["● recipe claim_issue ━━━━━━━━━━ allow_reentry: inputs.upfront_claimed"] ClaimTool["● claim_issue tool ━━━━━━━━━━ allow_reentry=True: label present→claimed=true allow_reentry=False: label present→claimed=false"] ClaimResult{"result.claimed == true?"} ProceedRecipe["compute_branch ━━━━━━━━━━ continue recipe..."] EscalateStop["escalate_stop ━━━━━━━━━━ foreign claim detected"] end subgraph Phase3 ["● Phase 3 — Fatal Cleanup (NEW)"] direction TB Diff["● Compute uncompleted ━━━━━━━━━━ pre_claimed_urls − completed_urls"] ReleaseLoop["● release_issue() ━━━━━━━━━━ for each uncompleted url"] end subgraph Phase4 ["Phase 4 — Summary"] direction TB WriteReport["Write process_report ━━━━━━━━━━ successes/failures/ skipped counts"] end %% FLOW %% START --> Parse --> Discover --> DryRun DryRun -->|"yes"| DONE DryRun -->|"no"| Confirm Confirm -->|"n"| DONE Confirm -->|"Y"| Flatten Flatten --> ClaimCall ClaimCall --> ClaimedDecision ClaimedDecision -->|"true"| TrackClaimed --> ClaimCall ClaimedDecision -->|"false"| TrackSkipped --> ClaimCall ClaimCall -->|"all done"| BatchIssueLoop BatchIssueLoop --> InPreClaimed InPreClaimed -->|"no"| SkipIssue --> BatchIssueLoop InPreClaimed -->|"yes"| LoadRecipe LoadRecipe --> MarkDone --> BatchIssueLoop BatchIssueLoop -->|"all done"| WriteReport --> DONE LoadRecipe -->|"fatal error"| Diff Diff --> ReleaseLoop --> ERROR LoadRecipe -.->|"dispatches"| RecipeClaim RecipeClaim --> ClaimTool --> ClaimResult ClaimResult -->|"true"| ProceedRecipe ClaimResult -->|"false"| EscalateStop %% CLASS ASSIGNMENTS %% class START,DONE,ERROR terminal; class Parse,Discover handler; class DryRun,Confirm stateNode; class Flatten,ClaimCall,TrackClaimed,TrackSkipped newComponent; class ClaimedDecision stateNode; class BatchIssueLoop phase; class InPreClaimed stateNode; class SkipIssue detector; class LoadRecipe,MarkDone handler; class RecipeClaim,ClaimTool newComponent; class ClaimResult stateNode; class ProceedRecipe handler; class EscalateStop detector; class Diff,ReleaseLoop newComponent; class WriteReport output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start, done, and error states | | Teal | State | Decision and routing nodes | | Purple | Phase | Control flow and loop nodes | | Orange | Handler | Processing and execution nodes | | Green | Modified/New | ● Components changed by this PR | | Red | Detector | Validation gates and failure handling | | Dark Teal | Output | Generated artifacts and results | Closes #445 ## Implementation Plan Plan file: `temp/make-plan/orchestrator_upfront_claim_plan_2026-03-20_171414.md` ## Token Usage Summary ## Token Usage Summary | Step | input | output | cached | count | time | |------|-------|--------|--------|-------|------| | plan | 9.7k | 265.2k | 8.3M | 16 | 1h 47m | | verify | 294 | 235.8k | 11.5M | 15 | 1h 20m | | implement | 6.2k | 313.3k | 40.0M | 15 | 2h 42m | | fix | 60 | 13.0k | 1.1M | 3 | 11m 58s | | audit_impl | 132 | 73.4k | 2.5M | 9 | 25m 3s | | open_pr | 370 | 207.5k | 10.8M | 14 | 1h 28m | | review_pr | 203 | 259.5k | 5.7M | 8 | 1h 3m | | resolve_review | 3.7k | 183.4k | 13.7M | 8 | 1h 21m | | resolve_conflicts | 75 | 30.6k | 2.6M | 3 | 10m 44s | | diagnose_ci | 22 | 7.3k | 466.1k | 1 | 2m 33s | | resolve_ci | 13 | 3.1k | 239.8k | 1 | 2m 4s | | **Total** | 20.7k | 1.6M | 97.1M | | 10h 35m | 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

…#452) ## Summary AutoSkillit automates code generation and commits at scale. Without a secret-scanning hook in the pre-commit pipeline, leaked credentials shift from "possible" to "inevitable." This plan adds a security gate to `autoskillit init` that checks for a known secret scanner in `.pre-commit-config.yaml` and — when absent — requires the user to type an explicit acknowledgment phrase before proceeding. The bypass decision is persisted to `.autoskillit/config.yaml` with a UTC timestamp. `autoskillit doctor` gains a new `secret_scanning_hook` check that reports `ERROR` when no scanner is detected. ## Architecture Impact ### Security Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; START([autoskillit init / doctor]) subgraph ConfigBoundary ["TRUST BOUNDARY 1: Pre-commit Config Parse"] PCFile["● .pre-commit-config.yaml ━━━━━━━━━━ Untrusted filesystem input (may be absent, malformed, or missing scanner)"] ParseYAML["● _detect_secret_scanner ━━━━━━━━━━ load_yaml() — parse repos[].hooks[].id Membership: _KNOWN_SCANNERS frozenset {gitleaks, detect-secrets, trufflehog, git-secrets}"] ScanFound{Scanner found?} end subgraph GreenPath ["PASS PATH"] GreenOK["● Print ✓ confirmation ━━━━━━━━━━ secret scanning: ✓ hook detected"] end subgraph TTYBoundary ["TRUST BOUNDARY 2: Interactive Session Gate"] TTYCheck["● sys.stdin.isatty() ━━━━━━━━━━ Non-interactive: fail closed (CI, pipes, headless sessions)"] TTYFail["ABORT — SystemExit(1) ━━━━━━━━━━ No scanner + non-interactive Cannot bypass this check"] end subgraph ConsentBoundary ["TRUST BOUNDARY 3: Typed Consent Gate"] WarnBox["● Warning box + phrase display ━━━━━━━━━━ Shows exact bypass phrase required"] UserInput["User types response ━━━━━━━━━━ input() → strip()"] PhraseMatch{Exact phrase match?} BadPhrase["ABORT — SystemExit(1) ━━━━━━━━━━ Phrase mismatch --force cannot bypass"] end subgraph AuditBoundary ["TRUST BOUNDARY 4: Bypass Audit Trail"] LogBypass["● _log_secret_scan_bypass ━━━━━━━━━━ safety.secret_scan_bypass_accepted = UTC ISO timestamp → config.yaml (atomic_write)"] end subgraph RegisterFlow ["Post-Gate: _register_all"] Hooks["sync_hooks_to_settings ━━━━━━━━━━ Hook registration"] MCP["Register MCP server ━━━━━━━━━━ ~/.claude.json"] end subgraph DoctorBoundary ["TRUST BOUNDARY 5: Doctor Observability Check"] DocCheck["● _check_secret_scanning_hook ━━━━━━━━━━ Check 10 in run_doctor() Reuses _detect_secret_scanner()"] DocOK["DoctorResult OK ━━━━━━━━━━ severity=ok check=secret_scanning_hook"] DocError["● DoctorResult ERROR ━━━━━━━━━━ severity=error Message: add scanner to prevent credential leaks"] end END([Init complete / Doctor report]) START --> PCFile PCFile --> ParseYAML ParseYAML --> ScanFound ScanFound -->|yes| GreenOK ScanFound -->|no| TTYCheck GreenOK --> Hooks TTYCheck -->|no tty| TTYFail TTYCheck -->|tty| WarnBox WarnBox --> UserInput UserInput --> PhraseMatch PhraseMatch -->|wrong| BadPhrase PhraseMatch -->|correct| LogBypass LogBypass --> Hooks Hooks --> MCP MCP --> END START --> DocCheck DocCheck --> DocOK DocCheck --> DocError DocOK --> END DocError --> END class START,END terminal; class PCFile phase; class ParseYAML,TTYCheck,PhraseMatch detector; class ScanFound phase; class GreenOK,LogBypass,DocOK newComponent; class WarnBox,UserInput newComponent; class TTYFail,BadPhrase,DocError gap; class Hooks,MCP handler; class DocCheck newComponent; ``` ### Operational Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph InitCLI ["● autoskillit init (app.py)"] InitCmd["● autoskillit init ━━━━━━━━━━ --force --scope user|project"] ConfigWrite["_generate_config_yaml ━━━━━━━━━━ Write .autoskillit/config.yaml"] SecretGate["● _check_secret_scanning ━━━━━━━━━━ Gate: scanner present OR consent Aborts with SystemExit(1) on failure"] RegisterAll["_register_all ━━━━━━━━━━ Hooks + MCP + summary"] end subgraph DoctorCLI ["● autoskillit doctor (_doctor.py)"] DoctorCmd["● autoskillit doctor ━━━━━━━━━━ --output-json"] ChecksExisting["Checks 1–9 ━━━━━━━━━━ stale_mcp_servers, mcp_server_registered, autoskillit_on_path, project_config, version_consistency, hook_health, hook_registration, script_version_health, gitignore_completeness"] CheckSecret["● Check 10: _check_secret_scanning_hook ━━━━━━━━━━ check=secret_scanning_hook Reuses _detect_secret_scanner()"] DoctorOut["DoctorResult[] ━━━━━━━━━━ severity: ok | warning | error JSON or human-readable output"] end subgraph Config ["CONFIGURATION SOURCES (read)"] PreCommit[".pre-commit-config.yaml ━━━━━━━━━━ Scanned for hook ids: gitleaks / detect-secrets trufflehog / git-secrets"] ConfigYaml[".autoskillit/config.yaml ━━━━━━━━━━ safety.secret_scan_bypass_accepted = UTC ISO timestamp (bypass log)"] end subgraph TTYState ["RUNTIME STATE (read)"] Stdin["sys.stdin.isatty() ━━━━━━━━━━ Interactive vs non-interactive Determines consent path"] end subgraph ObsOutputs ["OBSERVABILITY OUTPUTS (write)"] InitOutput["● init stdout ━━━━━━━━━━ secret scanning: ✓ hook detected OR bypass: accepted — logged OR ERROR + SystemExit(1)"] DoctorJSON["● doctor JSON / text ━━━━━━━━━━ { check: secret_scanning_hook, severity: ok | error, message: ... }"] end InitCmd --> ConfigWrite ConfigWrite --> SecretGate SecretGate --> PreCommit SecretGate --> Stdin SecretGate --> ConfigYaml SecretGate --> RegisterAll SecretGate --> InitOutput DoctorCmd --> ChecksExisting DoctorCmd --> CheckSecret CheckSecret --> PreCommit ChecksExisting --> DoctorOut CheckSecret --> DoctorOut DoctorOut --> DoctorJSON class InitCmd,DoctorCmd cli; class ConfigWrite,RegisterAll,ChecksExisting handler; class SecretGate,CheckSecret newComponent; class PreCommit,ConfigYaml,Stdin stateNode; class InitOutput,DoctorOut,DoctorJSON output; ``` Closes #448 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-448-20260320-180957-935972/temp/make-plan/init_secret_scanning_opt_in_plan_2026-03-20_180957.md` ## Token Usage Summary | Step | input | output | cached | count | time | |------|-------|--------|--------|-------|------| | plan | 9.7k | 298.8k | 10.5M | 18 | 1h 56m | | verify | 326 | 264.5k | 12.7M | 17 | 1h 34m | | implement | 6.3k | 365.6k | 46.4M | 17 | 2h 57m | | fix | 131 | 43.6k | 4.6M | 5 | 27m 26s | | audit_impl | 146 | 82.3k | 2.8M | 10 | 27m 50s | | open_pr | 404 | 222.9k | 12.1M | 15 | 1h 35m | | review_pr | 226 | 287.1k | 6.4M | 9 | 1h 9m | | resolve_review | 3.7k | 200.0k | 16.1M | 9 | 1h 32m | | resolve_conflicts | 75 | 30.6k | 2.6M | 3 | 10m 44s | | diagnose_ci | 36 | 9.0k | 670.5k | 2 | 3m 15s | | resolve_ci | 13 | 3.1k | 239.8k | 1 | 2m 4s | | **Total** | 21.1k | 1.8M | 115.2M | | 11h 57m | 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary The routing layer (`_prompts.py`, `sous-chef/SKILL.md`) uses only the boolean `needs_retry` field to decide whether a failed `run_skill` result should follow `on_context_limit` or `on_failure`. The `retry_reason` field is emitted in JSON output and visible to the LLM as prose, but the routing instructions never gate on its value. This creates a structural blindness: `RetryReason.RESUME` is used for both "context/turn limit with partial progress on disk" and "session never ran at all (empty output, kill anomaly)". When the session produced no output, the orchestrator nevertheless routes to `on_context_limit` (e.g., `retry_worktree`) which assumes partial work exists — leading it to attempt continuation of a worktree that contains no work. The fix makes `retry_reason` a **routing discriminant** rather than mere informational prose: adds `RetryReason.EMPTY_OUTPUT` for kill-anomaly retries under `NATURAL_EXIT` with no context exhaustion evidence, and updates routing rules to gate `on_context_limit` specifically on `retry_reason: resume`. Any future new `RetryReason` value automatically falls through to `on_failure` until explicitly added, making incorrect routing the fail-safe default. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; START([Claude Code Process Exits]) subgraph Classification ["SESSION CLASSIFICATION — ● session.py"] direction TB P1["Phase 1: session.needs_retry property ━━━━━━━━━━ ERROR_MAX_TURNS OR _is_context_exhausted() → (True, RESUME) always"] KA{"● _is_kill_anomaly? ━━━━━━━━━━ subtype: EMPTY_OUTPUT/UNPARSEABLE/ INTERRUPTED, or SUCCESS+empty"} TERM{"TerminationReason? ━━━━━━━━━━ NATURAL_EXIT / COMPLETED / STALE / TIMED_OUT"} CE{"● _is_context_exhausted? ━━━━━━━━━━ jsonl_context_exhausted OR marker in errors OR marker in result"} RESUME_OUT["RetryReason.RESUME ━━━━━━━━━━ Partial progress on disk Context/turn limit confirmed"] EO_OUT["● RetryReason.EMPTY_OUTPUT ━━━━━━━━━━ Clean exit, no output No partial progress on disk"] ES_OUT["RetryReason.EARLY_STOP ━━━━━━━━━━ Model stopped voluntarily Non-empty output present"] NONE_OUT["RetryReason.NONE ━━━━━━━━━━ No retry needed (success or terminal failure)"] end subgraph Packaging ["SKILL RESULT PACKAGING — core/_type_enums.py + _type_results.py"] direction LR SR["SkillResult ━━━━━━━━━━ needs_retry: bool ● retry_reason: RetryReason (routing discriminant, not prose)"] end subgraph Routing ["● ORCHESTRATOR ROUTING — ● _prompts.py + ● sous-chef/SKILL.md"] direction TB RG{"● retry_reason value? ━━━━━━━━━━ Gate on specific value, not just needs_retry bool"} HAS_OCL{"step defines on_context_limit?"} OCL["→ on_context_limit ━━━━━━━━━━ retry_worktree or test Partial progress assumed"] FAIL_FALL["→ on_failure ━━━━━━━━━━ All non-resume reasons No partial state assumed"] end START --> P1 P1 -->|"needs_retry=True (Phase 1 fires)"| RESUME_OUT P1 -->|"needs_retry=False (proceed to Phase 2)"| TERM TERM -->|"NATURAL_EXIT / COMPLETED"| KA TERM -->|"STALE / TIMED_OUT"| NONE_OUT KA -->|"yes + NATURAL_EXIT + rc=0"| CE KA -->|"yes + COMPLETED"| RESUME_OUT KA -->|"no + rc=0 + marker absent"| ES_OUT KA -->|"no + channel confirmed"| NONE_OUT CE -->|"● True — context exhausted"| RESUME_OUT CE -->|"● False — no exhaustion signal"| EO_OUT RESUME_OUT --> SR EO_OUT --> SR ES_OUT --> SR NONE_OUT --> SR SR -->|"needs_retry=True"| RG SR -->|"needs_retry=False"| SUCCESS_END RG -->|"● retry_reason = resume"| HAS_OCL RG -->|"● retry_reason = empty_output early_stop / zero_writes"| FAIL_FALL HAS_OCL -->|"yes"| OCL HAS_OCL -->|"no"| FAIL_FALL SUCCESS_END([success / terminal failure]) %% CLASS ASSIGNMENTS %% class START terminal; class P1 phase; class TERM,KA detector; class CE detector; class RESUME_OUT,NONE_OUT,ES_OUT stateNode; class EO_OUT newComponent; class SR handler; class RG,HAS_OCL detector; class OCL output; class FAIL_FALL gap; class SUCCESS_END terminal; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Process entry and exit points | | Purple | Phase | Control flow phase nodes | | Red (dark) | Detector | Decision gates (_is_kill_anomaly, _is_context_exhausted, routing gate) | | Teal (dark) | State | Existing RetryReason values (RESUME, EARLY_STOP, NONE) | | Green | New Component | ● Modified: EMPTY_OUTPUT reason + updated routing | | Orange | Handler | SkillResult contract | | Dark Teal | Output | on_context_limit route (resume path) | | Yellow | Gap | on_failure fallback for non-resumable retries | ### Error/Resilience Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; START([Headless Session Exits]) subgraph Classification ["FAILURE CLASSIFICATION — ● session.py"] direction TB PHASE1{"Phase 1 fires? ━━━━━━━━━━ ERROR_MAX_TURNS or _is_context_exhausted()"} KA{"● _is_kill_anomaly? ━━━━━━━━━━ Empty output or unparseable result"} CE{"● _is_context_exhausted? ━━━━━━━━━━ jsonl marker OR error list OR result text"} RESUME["RetryReason.RESUME ━━━━━━━━━━ Resumable: context/turn limit OR infrastructure kill Partial progress on disk"] EO["● RetryReason.EMPTY_OUTPUT ━━━━━━━━━━ Non-resumable: clean exit with no output produced No partial progress on disk"] OTHER["Other RetryReason values ━━━━━━━━━━ EARLY_STOP, ZERO_WRITES, DRAIN_RACE, NONE"] end subgraph Guards ["COMPOSITION GUARDS — session.py"] direction LR CONTRA{"Contradiction? ━━━━━━━━━━ success=True AND needs_retry=True"} DEADEND{"Dead-end? ━━━━━━━━━━ channel confirmed AND not success AND not retry"} end subgraph Routing ["● ORCHESTRATOR ROUTING — ● _prompts.py + ● sous-chef/SKILL.md"] direction TB NR{"needs_retry=True?"} RR{"● retry_reason? ━━━━━━━━━━ Inspect value, not just bool"} OCL_CHK{"Step defines on_context_limit?"} OCL["→ on_context_limit ━━━━━━━━━━ retry_worktree / test Resume partial work"] FAIL["→ on_failure ━━━━━━━━━━ Fresh restart or escalate"] end T_SUCCESS([SUCCEEDED]) T_TERMINAL([TERMINAL FAILURE]) START --> PHASE1 PHASE1 -->|"yes → RESUME"| RESUME PHASE1 -->|"no → Phase 2"| KA KA -->|"yes"| CE KA -->|"no"| OTHER CE -->|"● yes — exhausted"| RESUME CE -->|"● no — clean exit"| EO RESUME --> CONTRA EO --> CONTRA OTHER --> CONTRA CONTRA -->|"demote success=False"| DEADEND CONTRA -->|"no contradiction"| DEADEND DEADEND -->|"yes → DRAIN_RACE"| NR DEADEND -->|"no"| NR NR -->|"False"| T_SUCCESS NR -->|"False + terminal"| T_TERMINAL NR -->|"True"| RR RR -->|"● resume or drain_race"| OCL_CHK RR -->|"● empty_output"| FAIL RR -->|"● early_stop / zero_writes"| FAIL OCL_CHK -->|"yes"| OCL OCL_CHK -->|"no"| FAIL OCL -->|"retry worktree"| START FAIL -->|"recipe on_failure"| T_TERMINAL %% CLASS ASSIGNMENTS %% class START terminal; class PHASE1,KA,CE detector; class RESUME stateNode; class EO newComponent; class OTHER stateNode; class CONTRA,DEADEND phase; class NR,RR,OCL_CHK detector; class OCL output; class FAIL gap; class T_SUCCESS,T_TERMINAL terminal; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start, success, and terminal failure states | | Red (dark) | Detector | Classification gates and routing decision points | | Purple | Phase | Composition guards (contradiction, dead-end) | | Teal (dark) | State | Existing RetryReason values (RESUME, other) | | Green | New Component | ● EMPTY_OUTPUT — new non-resumable failure class | | Dark Teal | Output | on_context_limit recovery route | | Yellow | Gap | on_failure fallback (non-resumable path) | ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph EnumDef ["● ENUM DEFINITION — ● core/_type_enums.py"] direction LR RESUME_V["RetryReason.RESUME ━━━━━━━━━━ Contract: context/turn limit confirmed OR infrastructure kill Partial progress on disk"] EO_V["● RetryReason.EMPTY_OUTPUT ━━━━━━━━━━ Contract: clean exit no output produced No partial progress on disk"] OTHERS_V["RetryReason.EARLY_STOP ZERO_WRITES / DRAIN_RACE / NONE ━━━━━━━━━━ Existing values, unchanged"] end subgraph InvariantGate ["● INVARIANT ENFORCEMENT GATE — ● session.py"] direction TB P1_G["Phase 1 Gate ━━━━━━━━━━ ERROR_MAX_TURNS OR context_exhausted signal → RESUME always correct"] KA_G{"_is_kill_anomaly? ━━━━━━━━━━ empty/unparseable output"} CE_G{"● _is_context_exhausted? ━━━━━━━━━━ Validation gate: must confirm before RESUME"} RESUME_ASSIGN["assign RESUME ━━━━━━━━━━ Invariant holds: partial progress confirmed"] EO_ASSIGN["● assign EMPTY_OUTPUT ━━━━━━━━━━ Invariant holds: no progress confirmed"] end subgraph Contract ["SKILLRESULT CONTRACT — core/_type_results.py"] direction LR SR_FIELD["● SkillResult.retry_reason ━━━━━━━━━━ INIT_ONLY: set by _compute_retry ROUTING DISCRIMINANT: value inspected by orchestrator (was: informational label)"] end subgraph RoutingContract ["● ROUTING CONTRACT — ● _prompts.py + ● sous-chef/SKILL.md"] direction TB RESUME_RULE["RESUME contract rule ━━━━━━━━━━ retry_reason=resume → on_context_limit eligible Partial work on disk assumed"] EO_RULE["● EMPTY_OUTPUT contract rule ━━━━━━━━━━ retry_reason=empty_output → NEVER on_context_limit No partial work exists"] FAIL_SAFE["● Fail-safe default ━━━━━━━━━━ Any new RetryReason value automatically routes to on_failure until explicitly added"] end P1_G -->|"Phase 1 fires"| RESUME_ASSIGN P1_G -->|"Phase 1 skipped"| KA_G KA_G -->|"yes"| CE_G KA_G -->|"no"| OTHERS_V CE_G -->|"● True — exhaustion confirmed"| RESUME_ASSIGN CE_G -->|"● False — no exhaustion"| EO_ASSIGN RESUME_ASSIGN -->|"uses"| RESUME_V EO_ASSIGN -->|"● uses"| EO_V RESUME_V --> SR_FIELD EO_V --> SR_FIELD OTHERS_V --> SR_FIELD SR_FIELD -->|"● RESUME"| RESUME_RULE SR_FIELD -->|"● EMPTY_OUTPUT"| EO_RULE SR_FIELD -->|"● new future value"| FAIL_SAFE %% CLASS ASSIGNMENTS %% class RESUME_V stateNode; class EO_V newComponent; class OTHERS_V stateNode; class P1_G phase; class KA_G,CE_G detector; class RESUME_ASSIGN stateNode; class EO_ASSIGN newComponent; class SR_FIELD handler; class RESUME_RULE output; class EO_RULE newComponent; class FAIL_SAFE gap; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Teal (dark) | State | Existing RetryReason values and routing rules | | Green | New Component | ● EMPTY_OUTPUT value + its contract rule | | Red (dark) | Detector | Invariant enforcement gates (_is_kill_anomaly, _is_context_exhausted) | | Purple | Phase | Phase 1 classification gate | | Orange | Handler | SkillResult.retry_reason field (now routing discriminant) | | Dark Teal | Output | RESUME routing contract (on_context_limit eligible) | | Yellow | Gap | Fail-safe default for unknown retry reasons | Closes #447 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260320-181858-518174/temp/rectify/rectify_retry-reason-routing-blindness_2026-03-20_000000_part_a.md` ## Token Usage Summary ## token_summary 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary Add strict schema validation to every user-writable YAML config layer so that unrecognized keys fail loudly at load time, and a `SECRETS_ONLY_KEYS` enforcement layer ensures `github.token` (and any future secrets) can only appear in `.secrets.yaml` — never in any `config.yaml`. Additionally fix two misleading error messages in `execution/github.py` that point users toward the committed file, and harden the `setup-project` skill template with an explicit negative instruction. ## Architecture Impact ### Security Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; START(["load_config()"]) subgraph TrustedLayer ["TRUSTED LAYER (no validation)"] DEF["defaults.yaml ━━━━━━━━━━ Bundled artifact Schema source of truth"] end subgraph UserConfigBoundary ["TRUST BOUNDARY: User config.yaml"] UCFG["~/.autoskillit/config.yaml ━━━━━━━━━━ User-writable · can be committed"] UVAL["● validate_layer_keys() ━━━━━━━━━━ is_secrets_layer=False Unknown keys → hard fail"] USEC["● SECRETS_ONLY_KEYS check ━━━━━━━━━━ github.token FORBIDDEN → ConfigSchemaError"] end subgraph ProjectConfigBoundary ["TRUST BOUNDARY: Project config.yaml"] PCFG[".autoskillit/config.yaml ━━━━━━━━━━ Project-writable · can be committed"] PVAL["● validate_layer_keys() ━━━━━━━━━━ is_secrets_layer=False Unknown keys → hard fail"] PSEC["● SECRETS_ONLY_KEYS check ━━━━━━━━━━ github.token FORBIDDEN → ConfigSchemaError"] end subgraph SecretsBoundary ["TRUST BOUNDARY: .secrets.yaml (gitignored)"] SCFG[".autoskillit/.secrets.yaml ━━━━━━━━━━ gitignored · never committed"] SVAL["● validate_layer_keys() ━━━━━━━━━━ is_secrets_layer=True Unknown keys → hard fail"] SALLOW["github.token ALLOWED ━━━━━━━━━━ SECRETS_ONLY_KEYS gate opens for this layer"] end subgraph EnvLayer ["ENV LAYER (Dynaconf, unrestricted)"] ENV["AUTOSKILLIT_SECTION__KEY ━━━━━━━━━━ Env overrides bypass file validation"] end subgraph SchemaEnforcement ["● SCHEMA ENFORCEMENT (new)"] SCHEMA["● _CONFIG_SCHEMA ━━━━━━━━━━ Derived from AutomationConfig dataclass hierarchy"] SECKEYS["● SECRETS_ONLY_KEYS ━━━━━━━━━━ frozenset({'github.token'})"] ERR["● ConfigSchemaError ━━━━━━━━━━ Hard fail · ValueError subclass typo hint via difflib"] end subgraph TokenFlow ["TOKEN FLOW → API"] MERGED["● _make_dynaconf() ━━━━━━━━━━ Merged YAML → temp file → Dynaconf → AutomationConfig"] CTX["make_context() ━━━━━━━━━━ token = config.github.token or GITHUB_TOKEN env var"] GHF["DefaultGitHubFetcher ━━━━━━━━━━ self._token (private) never logged"] HDR["_headers() ━━━━━━━━━━ ● Error msg → .secrets.yaml Bearer only if token truthy"] end START --> DEF DEF --> UCFG UCFG --> UVAL UVAL -->|unknown key| ERR UVAL -->|valid| USEC USEC -->|github.token present| ERR USEC -->|clean| PCFG PCFG --> PVAL PVAL -->|unknown key| ERR PVAL -->|valid| PSEC PSEC -->|github.token present| ERR PSEC -->|clean| SCFG SCFG --> SVAL SVAL -->|unknown key| ERR SVAL -->|valid| SALLOW SALLOW --> MERGED ENV --> MERGED MERGED --> CTX CTX --> GHF GHF --> HDR SCHEMA -.->|informs| UVAL SCHEMA -.->|informs| PVAL SCHEMA -.->|informs| SVAL SECKEYS -.->|enforces| USEC SECKEYS -.->|enforces| PSEC SECKEYS -.->|gate opens| SALLOW class DEF stateNode; class UCFG,PCFG,SCFG phase; class UVAL,PVAL,SVAL,USEC,PSEC detector; class SALLOW,SCHEMA,SECKEYS,ERR newComponent; class ENV cli; class MERGED,CTX,GHF,HDR output; class START terminal; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal/Entry | Entry point and env-var layer | | Teal | Trusted | Bundled defaults (schema source of truth) | | Purple | Config Files | User-writable YAML layers | | Red | Validation | ● validate_layer_keys() gates and SECRETS_ONLY_KEYS checks | | Green | New/Modified | ● New enforcement: _CONFIG_SCHEMA, SECRETS_ONLY_KEYS, ConfigSchemaError | | Dark Teal | Token Flow | Merged config → context → API headers | ### Operational Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph InitWorkflow ["INIT WORKFLOW"] INIT["autoskillit init ━━━━━━━━━━ --test-command --scope user|project"] GEN_CFG[".autoskillit/config.yaml ━━━━━━━━━━ Non-secret settings only github.default_repo, test_check"] GEN_SEC[".autoskillit/.secrets.yaml ━━━━━━━━━━ github.token placeholder gitignored"] end subgraph SetupGuide ["● SETUP-PROJECT GUIDANCE (modified)"] SKILL["● setup-project/SKILL.md ━━━━━━━━━━ Step 5: Config Updates NEVER put token in config.yaml"] end subgraph ConfigLoad ["● CONFIG LOAD (modified)"] SERVE["autoskillit serve ━━━━━━━━━━ MCP server start"] SHOWCFG["autoskillit config show ━━━━━━━━━━ Resolved merged JSON"] LOAD["● load_config() ━━━━━━━━━━ _make_dynaconf() 4-layer merge"] end subgraph ConfigLayers ["CONFIGURATION HIERARCHY (5 layers)"] L1["defaults.yaml ━━━━━━━━━━ Layer 1 — trusted, no validation"] L2["~/.autoskillit/config.yaml ━━━━━━━━━━ Layer 2 — ● validated"] L3[".autoskillit/config.yaml ━━━━━━━━━━ Layer 3 — ● validated"] L4[".autoskillit/.secrets.yaml ━━━━━━━━━━ Layer 4 — ● validated, secrets OK"] L5["AUTOSKILLIT_SECTION__KEY ━━━━━━━━━━ Layer 5 — env vars, highest priority"] end subgraph ValidationGate ["● VALIDATION GATE (new)"] VALIDATE["● validate_layer_keys() ━━━━━━━━━━ Unknown key? → hard fail Wrong layer? → redirect"] SCHEMA_ERR["● ConfigSchemaError ━━━━━━━━━━ Typo hint via difflib Startup blocked"] end subgraph GitHubErrors ["● GITHUB ERROR MESSAGES (modified)"] GH_404["● github.py — HTTP 404 ━━━━━━━━━━ 'Configure github.token in .autoskillit/.secrets.yaml'"] end subgraph Output ["OPERATOR FEEDBACK"] OK["AutomationConfig loaded ━━━━━━━━━━ Server ready"] ERR_OUT["● ConfigSchemaError raised ━━━━━━━━━━ Message shows layer path +typo hint + .secrets.yaml redirect"] end INIT --> GEN_CFG INIT --> GEN_SEC SKILL -.->|"guides LLM during setup"| GEN_CFG SERVE --> LOAD SHOWCFG --> LOAD LOAD --> L1 --> L2 --> L3 --> L4 --> L5 L2 --> VALIDATE L3 --> VALIDATE L4 --> VALIDATE VALIDATE -->|valid| OK VALIDATE -->|invalid key| SCHEMA_ERR SCHEMA_ERR --> ERR_OUT GH_404 -.->|"operator sees .secrets.yaml hint"| GEN_SEC class INIT,SERVE,SHOWCFG cli; class GEN_CFG,GEN_SEC,L1,L2,L3,L4 stateNode; class LOAD,VALIDATE,SCHEMA_ERR,GH_404,SKILL newComponent; class OK output; class ERR_OUT gap; class L5 phase; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | CLI | Entry point commands | | Teal | Config Files | YAML layer files and config state | | Green | Modified | ● PR changes: validation, error messages, skill guidance | | Yellow/Orange | Error | ConfigSchemaError — blocks server startup | | Dark Teal | Success | Server ready with valid config | | Purple | Env Layer | Environment variable overrides (highest priority) | Closes #449 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-449-20260320-184502-060373/temp/make-plan/strict_schema_validation_config_yaml_plan_2026-03-20_000000.md` ## Token Usage Summary | Step | input | output | cached | count | time | |------|-------|--------|--------|-------|------| | plan | 9.7k | 298.8k | 10.5M | 18 | 1h 56m | | verify | 326 | 264.5k | 12.7M | 17 | 1h 34m | | implement | 6.3k | 365.6k | 46.4M | 17 | 2h 57m | | fix | 131 | 43.6k | 4.6M | 5 | 27m 26s | | audit_impl | 161 | 90.0k | 3.1M | 11 | 30m 5s | | open_pr | 435 | 237.8k | 13.3M | 16 | 1h 40m | | review_pr | 226 | 287.1k | 6.4M | 9 | 1h 9m | | resolve_review | 3.7k | 200.0k | 16.1M | 9 | 1h 32m | | resolve_conflicts | 75 | 30.6k | 2.6M | 3 | 10m 44s | | diagnose_ci | 36 | 9.0k | 670.5k | 2 | 3m 15s | | resolve_ci | 27 | 5.9k | 482.4k | 2 | 3m 57s | | **Total** | 21.2k | 1.8M | 117.0M | | 12h 6m | 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

…e Merges (#460) ## Summary The orchestrator has no rule preventing it from calling `gh pr merge` in parallel when multiple implementation pipelines each open a PR on a repo that lacks a GitHub merge queue. When two or more pipelines reach the merge phase simultaneously, only the first succeeds; the rest fail with stale-base conflicts. This plan closes that gap by: 1. Adding a `MERGE PHASE — MANDATORY` section to `sous-chef/SKILL.md` that (a) prohibits direct parallel `gh pr merge` calls on non-queue branches, (b) specifies the detection command the orchestrator must run once before the merge phase, and (c) mandates sequential routing when no queue is available. 2. Adding a `kitchen_rule` to `implementation.yaml` that names `check_merge_queue` and `route_queue_mode` as the canonical merge-routing steps and forbids ad-hoc `gh pr merge` calls from the orchestrator. 3. Adding contract tests to `tests/contracts/test_instruction_surface.py` that enforce both changes are present and contain the required sentinel phrases. No Python source changes are needed. All five orchestrator deviations described in the incident stem from bypassing these guidance rules, so adding them to the two surfaces the orchestrator reads (sous-chef + kitchen_rules) directly addresses the root cause. ## Requirements ### DETECT — Merge Queue Detection - **REQ-DETECT-001:** The system must determine whether the target repository branch has a GitHub merge queue enabled before the merge phase begins. - **REQ-DETECT-002:** The detection result must be available to the orchestrator without requiring a headless session (e.g., via a recipe step, tool hook, or config lookup). - **REQ-DETECT-003:** The detection must occur once per pipeline run, not per-PR. ### ROUTE — Conditional Merge Routing - **REQ-ROUTE-001:** When merge queue is available, the orchestrator must be permitted to enroll multiple PRs via `gh pr merge --squash --auto` in parallel. - **REQ-ROUTE-002:** When merge queue is NOT available, the orchestrator must merge PRs sequentially — one at a time, waiting for each to complete before starting the next. - **REQ-ROUTE-003:** The sequential merge path must use either the `merge-prs` recipe or `process-issues --merge-batch` style (`analyze-prs` → `merge-pr` per PR in order). ### GUIDE — Orchestrator Guidance - **REQ-GUIDE-001:** The sous-chef SKILL.md must contain an explicit instruction prohibiting parallel `gh pr merge` calls on branches without a merge queue. - **REQ-GUIDE-002:** The guidance must specify the required merge workflow for non-queue repos: sequential merge via `merge-prs` recipe or `--merge-batch` flag. - **REQ-GUIDE-003:** The implementation recipe's kitchen_rules must reference the merge queue detection result when describing the merge phase. ### FAIL — Failure Handling - **REQ-FAIL-001:** When `gh pr merge` fails with a merge conflict, the orchestrator must route to the recipe's `on_failure` path rather than improvising with direct git commands. - **REQ-FAIL-002:** The conflict recovery path must rebase the PR branch against the updated base, re-push, and retry the merge — sequentially, not in parallel. - **REQ-FAIL-003:** The orchestrator must never use `run_cmd` for git investigation (rebase --abort, git log, git reset) when a merge step fails — it must delegate to the appropriate skill. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; %% TERMINALS %% START([N PRs open CI passed]) DONE([release_issue_success Pipeline complete]) ERROR([escalate_stop Unresolvable conflict]) subgraph Guidance ["● Orchestrator Guidance (Modified)"] direction LR SousChef["● sous-chef/SKILL.md ━━━━━━━━━━ MERGE PHASE — MANDATORY Detect once · Route by availability NEVER parallel gh pr merge Route conflicts to on_failure"] KitchenRule["● implementation.yaml kitchen_rules ━━━━━━━━━━ MERGE ROUTING rule: check_merge_queue → route_queue_mode Prohibits direct gh pr merge"] end subgraph Detection ["Detection (run_cmd, once per run)"] direction TB CheckQueue["check_merge_queue ━━━━━━━━━━ gh api graphql mergeQueue → queue_available: true|false"] RouteMode{"route_queue_mode ━━━━━━━━━━ queue_available? auto_merge?"} end subgraph QueuePath ["Queue Path"] direction TB AutoMerge["enable_auto_merge ━━━━━━━━━━ gh pr merge --squash --auto Enroll in GitHub queue"] WaitQueue["wait_for_merge_queue ━━━━━━━━━━ Poll until merged or ejected"] end subgraph DirectPath ["Non-Queue Path (Sequential)"] direction TB DirectMerge["direct_merge ━━━━━━━━━━ gh pr merge --squash --auto One PR at a time"] WaitDirect{"wait_for_direct_merge ━━━━━━━━━━ Poll: merged / closed / timeout"} end subgraph ConflictRecovery ["Conflict Recovery"] direction TB ConflictFix["direct_merge_conflict_fix ━━━━━━━━━━ run_skill: resolve-merge-conflicts Rebase + fix conflicts"] RePush["re_push_direct_merge_fix ━━━━━━━━━━ push_to_remote Force-push rebased branch"] RedirectMerge["redirect_merge ━━━━━━━━━━ gh pr merge --squash --auto Re-enqueue after rebase"] end %% MAIN FLOW %% START --> SousChef START --> KitchenRule SousChef -.->|"guides orchestrator"| CheckQueue KitchenRule -.->|"enforces recipe steps"| CheckQueue CheckQueue --> RouteMode RouteMode -->|"queue_available=true"| AutoMerge RouteMode -->|"queue_available=false"| DirectMerge RouteMode -->|"auto_merge=false"| DONE AutoMerge --> WaitQueue WaitQueue -->|"merged"| DONE WaitQueue -->|"ejected"| ConflictFix DirectMerge --> WaitDirect WaitDirect -->|"merged"| DONE WaitDirect -->|"closed (stale base)"| ConflictFix WaitDirect -->|"timeout"| ERROR ConflictFix -->|"resolved"| RePush ConflictFix -->|"escalation_required=true"| ERROR RePush --> RedirectMerge RedirectMerge --> WaitDirect %% CLASS ASSIGNMENTS %% class START,DONE,ERROR terminal; class SousChef,KitchenRule phase; class CheckQueue handler; class RouteMode,WaitDirect stateNode; class AutoMerge,WaitQueue handler; class DirectMerge handler; class ConflictFix detector; class RePush,RedirectMerge phase; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start, complete, and error states | | Purple (●) | Phase/Guidance | Modified guidance surfaces: sous-chef/SKILL.md, kitchen_rules, recovery steps | | Orange | Handler | Recipe execution steps: check_merge_queue, enable_auto_merge, direct_merge, wait_for_queue | | Teal | State | Decision/routing nodes: route_queue_mode, wait_for_direct_merge | | Red | Detector | Conflict recovery entry: direct_merge_conflict_fix | Closes #444 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-444-20260320-232459-625096/temp/make-plan/orchestrator_merge_queue_sequencing_plan_2026-03-20_000000.md` ## Token Usage Summary No token data available. 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary The `open-pr-main` skill generates comprehensive promotion PRs but previously included no token cost data. This fix mirrors the proven self-retrieval pattern from `open-pr` Step 0b: at startup, session token logs are loaded from disk scoped to the current pipeline CWD via `DefaultTokenLog.load_from_log_dir(cwd_filter=PIPELINE_CWD)`, formatted via `TelemetryFormatter.format_token_table`, and conditionally embedded as a `## Token Usage Summary` section in the PR body. The change is entirely confined to `skills_extended/open-pr-main/SKILL.md` and a new contract test file. ## Requirements ### TOKEN — Token Aggregation - **REQ-TOKEN-001:** The open-pr-main skill must include a `## Token Usage Summary` section in the generated PR body. - **REQ-TOKEN-002:** The token summary must aggregate usage data from the constituent PRs that landed in integration since divergence from main. - **REQ-TOKEN-003:** The aggregated summary must be presented as a formatted markdown table consistent with TelemetryFormatter output. ### SKILL — Skill Interface - **REQ-SKILL-001:** The skill must support collecting telemetry data without requiring an external orchestrator to pre-generate the summary. - **REQ-SKILL-002:** The token summary section must appear in the PR body template alongside existing sections (domain analysis, architecture impact, etc.). ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; %% TERMINALS %% START([START]) COMPLETE([COMPLETE]) subgraph TokenPhase ["★ Step 0b: Token Self-Retrieval (NEW)"] direction TB Step0b["★ Token Self-Retrieval ━━━━━━━━━━ PIPELINE_CWD=$(pwd) load_from_log_dir(cwd_filter=PIPELINE_CWD) → writes temp/open-pr-main/token_summary.md"] TokenGate{"★ token_summary.md non-empty?"} SetToken["★ TOKEN_SUMMARY_CONTENT ━━━━━━━━━━ set to file contents"] SkipToken["★ TOKEN_SUMMARY_CONTENT ━━━━━━━━━━ left empty (graceful)"] end subgraph DiscoveryPhase ["Steps 1–14: PR Discovery & Analysis (existing)"] direction TB ParseArgs["Step 1: Parse Args ━━━━━━━━━━ integration_branch, base_branch"] DiscoverPRs["Steps 2–4: Discover PRs ━━━━━━━━━━ merge-base → gh pr list → closing refs"] DomainAnalysis["Steps 5–12: Domain Analysis ━━━━━━━━━━ issues · diffs · domain summaries · exec summary"] GenDiagrams["Steps 13–14: Arch-Lens Diagrams ━━━━━━━━━━ select lenses · generate · validate ★/● markers"] end subgraph BodyPhase ["● Step 15: PR Body Composition (MODIFIED)"] direction TB BuildBody["● Compose PR Body Sections ━━━━━━━━━━ executive summary · stats · highlights Merged PRs table · Linked Issues table Domain Analysis · Architecture Impact closing refs"] BodyTokenGate{"★ TOKEN_SUMMARY_CONTENT non-empty?"} EmbedToken["★ Embed Section ━━━━━━━━━━ ## Token Usage Summary {TOKEN_SUMMARY_CONTENT}"] SkipSection["omit section ━━━━━━━━━━ standalone / no pipeline sessions"] FinalBody["● temp/open-pr-main/pr_body_{ts}.md ━━━━━━━━━━ written to disk"] end subgraph CreatePhase ["Steps 16–18: GitHub PR Creation (existing)"] GHCheck{"gh auth status OK?"} CreatePR["gh pr create ━━━━━━━━━━ --body-file pr_body_{ts}.md"] end %% FLOW %% START --> Step0b Step0b --> TokenGate TokenGate -->|"yes"| SetToken TokenGate -->|"no / n=0"| SkipToken SetToken --> ParseArgs SkipToken --> ParseArgs ParseArgs --> DiscoverPRs DiscoverPRs --> DomainAnalysis DomainAnalysis --> GenDiagrams GenDiagrams --> BuildBody BuildBody --> BodyTokenGate BodyTokenGate -->|"non-empty"| EmbedToken BodyTokenGate -->|"empty"| SkipSection EmbedToken --> FinalBody SkipSection --> FinalBody FinalBody --> GHCheck GHCheck -->|"yes"| CreatePR GHCheck -->|"no (emit pr_url=)"| COMPLETE CreatePR --> COMPLETE %% CLASS ASSIGNMENTS %% class START,COMPLETE terminal; class Step0b,SetToken,SkipToken,EmbedToken newComponent; class TokenGate,BodyTokenGate,GHCheck detector; class ParseArgs,DiscoverPRs handler; class DomainAnalysis,GenDiagrams phase; class BuildBody,FinalBody output; class SkipSection stateNode; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | START and COMPLETE states | | Green | New Component (★) | New nodes added by this PR | | Red | Detector | Decision/gate nodes | | Orange | Handler | Argument parsing and PR discovery | | Purple | Phase | Domain analysis and diagram generation | | Dark Teal | Output | PR body composition and file writing | | Teal | State | Graceful skip paths | Closes #440 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-440-20260320-232459-195668/temp/make-plan/open_pr_main_token_usage_plan_2026-03-20_000000.md` ## Token Usage Summary No token data available. 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

… User-Defined (#459) ## Summary `list_recipes()` currently sorts all recipes alphabetically by `name`, interleaving bundled and user-defined recipes (UDRs). This causes numbered positions in `autoskillit order` and `autoskillit recipes list` to shift whenever UDRs are added or removed. Additionally, the MCP `list_recipes` tool strips the `source` field, leaving agents unable to distinguish provenance. Two targeted changes fix this: 1. **`src/autoskillit/recipe/io.py`**: Change the sort key in `list_recipes()` to sort by `(source != BUILTIN, name)` — bundled recipes sort first (False < True), then alphabetically within each tier. 2. **`src/autoskillit/recipe/_api.py`**: Add `source` to `RecipeListItem` TypedDict and to the dict constructed in `format_recipe_list_response()`. No other files require modification. The CLI `recipes list` and `order` commands call `list_recipes()` directly and already consume `r.source`, so they receive correct ordering automatically once the sort key is fixed. ## Requirements ### ORD — Recipe Ordering - **REQ-ORD-001:** The `list_recipes()` function must return bundled recipes before user-defined recipes. - **REQ-ORD-002:** Within each source tier (bundled, user-defined), recipes must be sorted alphabetically by name. - **REQ-ORD-003:** The ordering must be consistent across all consumer surfaces: CLI `recipes list`, CLI `order`, and the MCP `list_recipes` tool. ### MCP — MCP Tool Response - **REQ-MCP-001:** The `format_recipe_list_response` function must include the `source` field in each recipe entry returned by the MCP `list_recipes` tool. ## Architecture Impact ### Data Lineage Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart LR classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; subgraph Origins ["Data Origins"] BUILTIN["bundled recipes/ ━━━━━━━━━━ RecipeSource.BUILTIN .yaml files"] PROJECT[".autoskillit/recipes/ ━━━━━━━━━━ RecipeSource.PROJECT .yaml files"] end subgraph Collection ["_collect_recipes()"] COLLECTOR["RecipeInfo stream ━━━━━━━━━━ name, description, source, path, summary"] end subgraph Sorting ["● list_recipes() — recipe/io.py"] SORT["● sort key changed ━━━━━━━━━━ (source != BUILTIN, name) BUILTIN first, PROJECT after alphabetical within tier"] end subgraph CLISurfaces ["CLI Consumers (direct RecipeInfo)"] CLI_LIST["recipes list ━━━━━━━━━━ columnar table NAME / SOURCE / DESC"] CLI_ORDER["order command ━━━━━━━━━━ numbered menu stable positions"] end subgraph Projection ["● format_recipe_list_response() — recipe/_api.py"] FMT["● RecipeListItem ━━━━━━━━━━ name, description, summary, + source"] end subgraph MCPWire ["MCP Tool Handler"] MCP_TOOL["list_recipes tool ━━━━━━━━━━ json.dumps() JSON string over MCP"] end subgraph HookLayer ["● pretty_output.py — PostToolUse Hook"] HOOK["● _fmt_list_recipes() ━━━━━━━━━━ Markdown-KV render - name [source]: desc coverage contracts updated"] end BUILTIN -->|"_collect_recipes(BUILTIN)"| COLLECTOR PROJECT -->|"_collect_recipes(PROJECT)"| COLLECTOR COLLECTOR -->|"unsorted list[RecipeInfo]"| SORT SORT -->|"LoadResult[RecipeInfo] bundled-first order"| CLI_LIST SORT -->|"LoadResult[RecipeInfo] bundled-first order"| CLI_ORDER SORT -->|"LoadResult[RecipeInfo]"| FMT FMT -->|"dict with source field"| MCP_TOOL MCP_TOOL -.->|"JSON string (MCP wire)"| HOOK class BUILTIN,PROJECT cli; class COLLECTOR stateNode; class SORT phase; class CLI_LIST,CLI_ORDER output; class FMT handler; class MCP_TOOL integration; class HOOK output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Origins | YAML source files on disk (read-only inputs) | | Teal | State | RecipeInfo stream (internal struct) | | Purple | Phase | ● Sort stage — modified to stable bundled-first ordering | | Orange | Handler | ● Projection stage — RecipeListItem gains `source` field | | Red | MCP | MCP tool handler (json serialization) | | Dark Teal | Output | CLI surfaces and ● PostToolUse hook (Markdown-KV render) | ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; START([list_recipes called]) subgraph Discovery ["Discovery — _collect_recipes()"] direction TB PROJ["Scan PROJECT tier ━━━━━━━━━━ .autoskillit/recipes/*.yaml"] BUILTIN_SCAN["Scan BUILTIN tier ━━━━━━━━━━ pkg_root()/recipes/*.yaml"] DEDUP{"Name already seen? ━━━━━━━━━━ PROJECT shadows BUILTIN"} APPEND["Append RecipeInfo ━━━━━━━━━━ name, desc, source, path, summary"] end subgraph SortPhase ["● list_recipes() — Sort Phase"] direction TB SORT["● Sort by ━━━━━━━━━━ (source != BUILTIN, name) False < True → BUILTIN first then alphabetical within tier"] end ROUTE{"Consumer type?"} subgraph CLIPath ["CLI Path (direct RecipeInfo)"] direction TB CLI_RENDER["recipes list / order ━━━━━━━━━━ Iterate LoadResult.items Render table / numbered menu"] CLI_OUT["stdout ━━━━━━━━━━ Columnar table or interactive numbered menu"] end subgraph MCPPath ["MCP Path"] direction TB FMT["● format_recipe_list_response() ━━━━━━━━━━ Project RecipeInfo → RecipeListItem with source field"] ERRCHECK{"result.errors present?"} JSON_DUMP["json.dumps() ━━━━━━━━━━ Serialize to JSON string"] HOOK["● _fmt_list_recipes() hook ━━━━━━━━━━ PostToolUse: Markdown-KV render - name [source]: desc coverage contracts updated"] end END_CLI([CLI complete]) END_MCP([MCP response delivered]) START --> PROJ PROJ -->|"each .yaml"| DEDUP PROJ -->|"after PROJECT"| BUILTIN_SCAN BUILTIN_SCAN -->|"each .yaml"| DEDUP DEDUP -->|"no — new name"| APPEND DEDUP -->|"yes — skip"| PROJ APPEND --> PROJ BUILTIN_SCAN -->|"all collected"| SORT SORT -->|"LoadResult[RecipeInfo]"| ROUTE ROUTE -->|"CLI invocation"| CLI_RENDER ROUTE -->|"MCP tool call"| FMT CLI_RENDER --> CLI_OUT --> END_CLI FMT --> ERRCHECK ERRCHECK -->|"no errors"| JSON_DUMP ERRCHECK -->|"errors present"| JSON_DUMP JSON_DUMP --> HOOK --> END_MCP class START,END_CLI,END_MCP terminal; class PROJ,BUILTIN_SCAN,APPEND handler; class DEDUP,ERRCHECK,ROUTE stateNode; class SORT phase; class CLI_RENDER handler; class CLI_OUT output; class FMT phase; class JSON_DUMP handler; class HOOK output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Entry and exit states | | Orange | Handler | Discovery scan, append, JSON serialize, CLI render | | Teal | State | Decision and routing nodes | | Purple | Phase | ● Sort (stable ordering) and ● MCP projection (source field) | | Dark Teal | Output | CLI stdout and ● hook Markdown-KV render | Closes #456 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-456-20260320-232500-690754/temp/make-plan/stable_recipe_listing_order_plan_2026-03-20_120000.md` ## Token Usage Summary No token data available. 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary Adds a **PARALLEL STEP SCHEDULING — MANDATORY** section to `src/autoskillit/skills/sous-chef/SKILL.md` that codifies the wavefront scheduling rule: when running multiple pipelines in parallel, all fast steps (MCP tool calls completing in seconds) across ALL pipelines must be completed before any slow step (`run_skill`, which launches headless sessions taking minutes) is launched. Slow steps are then batched and launched together so they overlap in wall-clock time. Additionally, adds a `run_mode` ingredient (default: `sequential`, option: `parallel`) to the `implementation` and `remediation` bundled recipes. ## Requirements ### PROMPT - **REQ-PROMPT-001:** The sous-chef SKILL.md must contain a PARALLEL STEP SCHEDULING section that is marked MANDATORY. - **REQ-PROMPT-002:** The section must define fast steps as MCP tool calls that complete in seconds: `run_cmd`, `clone_repo`, `create_unique_branch`, `fetch_github_issue`, `claim_issue`, `merge_worktree`, `test_check`, `reset_test_dir`, `classify_fix`. - **REQ-PROMPT-003:** The section must define slow steps as any `run_skill` invocation. - **REQ-PROMPT-004:** The section must instruct the orchestrator to complete all fast steps for ALL pipelines before launching any slow step. - **REQ-PROMPT-005:** The section must instruct the orchestrator to launch all slow steps together in one parallel batch once all pipelines are aligned at a slow step boundary. - **REQ-PROMPT-006:** The section must explicitly prohibit launching a slow step for one pipeline while another pipeline still has fast steps pending. - **REQ-PROMPT-007:** The section must explain the wall-clock rationale: batched rounds wait for the slowest step, so fast steps that finish instantly idle until the slow step completes. - **REQ-PROMPT-008:** The section must be placed after the existing MULTIPLE ISSUES section in the SKILL.md file. ### INGREDIENT - **REQ-INGREDIENT-001:** Recipes that support multi-issue execution (e.g., `implementation`, `remediation`) must declare a `run_mode` ingredient with options `sequential` (default) and `parallel`. - **REQ-INGREDIENT-002:** The default value for `run_mode` must be `sequential` — parallel execution requires explicit opt-in. - **REQ-INGREDIENT-003:** When `run_mode: parallel` is set, the orchestrator must apply the wavefront scheduling rule from REQ-PROMPT-004 through REQ-PROMPT-006. - **REQ-INGREDIENT-004:** When `run_mode: sequential` is set (or defaulted), the orchestrator must process issues one at a time in the order provided. - **REQ-INGREDIENT-005:** The `run_mode` ingredient must be respected regardless of what the user says at runtime — if the user overrides verbally (e.g., says "run in parallel"), the verbal instruction takes precedence per the existing MULTIPLE ISSUES sous-chef rule. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; START([START: N issues submitted]) subgraph ModeSelect ["● Mode Selection (MULTIPLE ISSUES + run_mode)"] direction TB MODE{"run_mode or user intent?"} SEQ["Sequential Mode ━━━━━━━━━━ Process one issue at a time in provided order"] PAR["Parallel Mode Selected ━━━━━━━━━━ Launch N independent pipelines apply wavefront scheduling"] end subgraph FastDrain ["★ Fast-Step Drain (PARALLEL STEP SCHEDULING)"] direction TB INSPECT["● Inspect next pending step ━━━━━━━━━━ for each of N pipelines"] HAS_FAST{"Any pipeline has a fast step pending?"} RUN_FAST["★ Execute all fast steps ━━━━━━━━━━ run_cmd · clone_repo create_unique_branch fetch_github_issue claim_issue · merge_worktree test_check · reset_test_dir classify_fix"] end subgraph SlowBatch ["★ Slow-Step Batch (PARALLEL STEP SCHEDULING)"] direction TB ALL_SLOW["★ All pipelines at slow boundary ━━━━━━━━━━ every next step is run_skill"] LAUNCH["★ Launch all slow steps together ━━━━━━━━━━ run_skill × N in parallel wall-clock time overlaps"] WAIT["Wait for batch completion ━━━━━━━━━━ bounded by slowest session"] end DONE_CHECK{"All pipelines complete?"} DONE([DONE: all N pipelines finished]) START --> MODE MODE -->|"sequential / run_mode=sequential"| SEQ MODE -->|"parallel / run_mode=parallel"| PAR PAR --> INSPECT INSPECT --> HAS_FAST HAS_FAST -->|"YES — drain fast steps"| RUN_FAST RUN_FAST -->|"re-inspect after batch"| INSPECT HAS_FAST -->|"NO — all aligned at slow step"| ALL_SLOW ALL_SLOW --> LAUNCH LAUNCH --> WAIT WAIT --> DONE_CHECK DONE_CHECK -->|"more steps remain"| INSPECT DONE_CHECK -->|"all finished"| DONE SEQ --> DONE class START,DONE terminal; class MODE detector; class SEQ,PAR stateNode; class INSPECT phase; class HAS_FAST detector; class RUN_FAST newComponent; class ALL_SLOW,LAUNCH newComponent; class WAIT handler; class DONE_CHECK detector; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start and end states | | Red | Detector | Decision points and routing guards | | Teal | State | Mode selection outcomes | | Purple | Phase | Step inspection / control flow | | Green (★) | New Component | New scheduling logic added by this PR | | Orange | Handler | Wait / synchronization (bounded by slowest) | ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; START([START: load_recipe called]) subgraph IngredientDecl ["● Ingredient Declaration (INIT_ONLY)"] direction TB IMPL_YAML["● implementation.yaml ━━━━━━━━━━ run_mode: default: sequential"] REMED_YAML["● remediation.yaml ━━━━━━━━━━ run_mode: default: sequential"] OVERRIDE["override dict ━━━━━━━━━━ caller may pass run_mode=parallel"] end subgraph StructValidation ["Structural Validation Gates"] direction TB PRESENCE["ingredient presence check ━━━━━━━━━━ referenced names must be declared in recipe"] NO_ENUM["gap: no value validation ━━━━━━━━━━ sequential/parallel NOT enforced by schema"] end subgraph ActiveRecipe ["Resolved Ingredient State (INIT_ONLY after this point)"] direction TB RESOLVED["run_mode resolved ━━━━━━━━━━ sequential (default) OR parallel (override)"] end subgraph PromptContract ["● sous-chef SKILL.md Contract (reads run_mode)"] direction TB SKILL_READ["● PARALLEL STEP SCHEDULING ━━━━━━━━━━ reads run_mode at runtime applies wavefront rule if parallel"] end subgraph ContractTests ["★ Contract Enforcement (new test suite)"] direction TB SCHED_TESTS["★ test_sous_chef_scheduling.py ━━━━━━━━━━ REQ-PROMPT-001..008 asserts section present + correct"] INGRED_TESTS["● TestRunModeIngredient ━━━━━━━━━━ REQ-INGREDIENT-001..005 asserts run_mode declared + default"] end DONE([Orchestrator applies scheduling rule]) START --> IMPL_YAML START --> REMED_YAML OVERRIDE --> RESOLVED IMPL_YAML --> PRESENCE REMED_YAML --> PRESENCE PRESENCE --> NO_ENUM NO_ENUM --> RESOLVED RESOLVED --> SKILL_READ SKILL_READ --> DONE SCHED_TESTS -.->|"enforces contract on"| SKILL_READ INGRED_TESTS -.->|"enforces contract on"| IMPL_YAML INGRED_TESTS -.->|"enforces contract on"| REMED_YAML class START,DONE terminal; class IMPL_YAML,REMED_YAML handler; class OVERRIDE phase; class PRESENCE stateNode; class NO_ENUM gap; class RESOLVED detector; class SKILL_READ handler; class SCHED_TESTS newComponent; class INGRED_TESTS newComponent; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start and end states | | Orange | Handler | Recipe YAML definitions and prompt reads | | Purple | Phase | Caller-supplied override | | Teal | Gate | Structural validation (presence check) | | Yellow | Gap | Missing value constraint (no enum enforcement) | | Red | Resolved | INIT_ONLY state after resolution — never mutated | | Green (★) | New Component | New contract tests added by this PR | Closes #461 ## Implementation Plan Plan file: `temp/make-plan/add_parallel_step_scheduling_rule_plan_2026-03-21_085800.md` ## Token Usage Summary No token data available for this pipeline run. 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

…ence (#464) ## Summary Implements first-run detection for `cook` sessions and a guided onboarding menu with concurrent background intelligence gathering. When a project has been initialized (`autoskillit init`) but has never been onboarded, `cook` intercepts the session launch to present a 5-option interactive menu (Analyze, GitHub Issue, Demo Run, Write Recipe, Skip). Background threads gather project intelligence (build tools, pre-commit scanner, good-first-issues) concurrently while the user reads the menu. The chosen action becomes the `initial_prompt` for the Claude session. A `.autoskillit/.onboarded` marker is written after any menu path completes, preventing re-prompting on subsequent `cook` invocations. `autoskillit init --force` resets the marker. Also adds `tailorable`/`tailoring_hints` frontmatter fields to `SkillInfo` as infrastructure for the upcoming skill-tailoring workflow (Issue #215). ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; %% TERMINALS %% START([● cook]) DONE([session complete]) ABORT([aborted]) subgraph Guard ["Precondition Guards · ● _cook.py"] direction TB CLAUDE{"claude in PATH?"} CONFIRM{"Launch session? ━━━━━━━━━━ [Enter / n]"} end subgraph Detection ["★ First-Run Detection · _onboarding.is_first_run()"] direction TB FR{"★ is_first_run ━━━━━━━━━━ config.yaml exists? .onboarded absent? recipes/ empty? no overrides?"} end subgraph Onboarding ["★ Guided Onboarding · run_onboarding_menu()"] direction TB WELCOME["★ welcome banner ━━━━━━━━━━ Would you like help?"] OPT_IN{"Y / n?"} INTEL["★ gather_intel (bg threads) ━━━━━━━━━━ _detect_scanner() _detect_build_tools() _fetch_good_first_issues()"] MENU["★ menu display ━━━━━━━━━━ A / B / C / D / E"] CHOICE{"★ user choice ━━━━━━━━━━ [A/B/C/D/E]"} end subgraph Routes ["★ initial_prompt Routes"] direction LR PA["A: /autoskillit:setup-project"] PB["B: /autoskillit:prepare-issue {ref}"] PC["C: /autoskillit:setup-project {target}"] PD["D: /autoskillit:write-recipe"] end subgraph SessionLaunch ["● Session Launch · _cook.py"] direction TB BUILD["● build_interactive_cmd ━━━━━━━━━━ initial_prompt injected skills_dir added"] RUN["subprocess.run ━━━━━━━━━━ Claude interactive session"] end MARKER_SKIP["★ mark_onboarded ━━━━━━━━━━ write .autoskillit/.onboarded (skip / decline path)"] MARKER_DONE["★ mark_onboarded ━━━━━━━━━━ write .autoskillit/.onboarded (finally: A–D complete)"] %% MAIN FLOW %% START --> CLAUDE CLAUDE -->|"not found"| ABORT CLAUDE -->|"found"| CONFIRM CONFIRM -->|"n"| ABORT CONFIRM -->|"enter"| FR FR -->|"not first run"| BUILD FR -->|"first run"| WELCOME WELCOME --> OPT_IN OPT_IN -->|"n / no"| MARKER_SKIP OPT_IN -->|"Y / enter"| INTEL INTEL --> MENU MENU --> CHOICE CHOICE -->|"A"| PA CHOICE -->|"B"| PB CHOICE -->|"C"| PC CHOICE -->|"D"| PD CHOICE -->|"E / other"| MARKER_SKIP PA & PB & PC & PD --> BUILD MARKER_SKIP -->|"initial_prompt = None"| BUILD BUILD --> RUN RUN -->|"session exits · finally block"| MARKER_DONE MARKER_DONE --> DONE %% CLASS ASSIGNMENTS %% class START,DONE,ABORT terminal; class CLAUDE,CONFIRM detector; class FR detector; class OPT_IN,CHOICE stateNode; class WELCOME,MENU newComponent; class INTEL newComponent; class PA,PB,PC,PD newComponent; class MARKER_SKIP,MARKER_DONE newComponent; class BUILD,RUN handler; ``` ### Operational Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; subgraph CLILayer ["CLI ENTRY POINTS"] direction LR COOK["● autoskillit cook / c ━━━━━━━━━━ Interactive Claude session + first-run onboarding gate"] INIT["● autoskillit init ━━━━━━━━━━ --force resets onboarding --test-command --scope"] SKILLS["autoskillit skills list ━━━━━━━━━━ Lists skills incl. tailorable metadata"] end subgraph OnboardingModule ["★ Onboarding Module · cli/_onboarding.py"] direction TB DETECT["★ is_first_run() ━━━━━━━━━━ reads config.yaml ✓ reads .onboarded ✓ reads recipes/ ✓ reads .claude/skills/ ✓"] MENU["★ run_onboarding_menu() ━━━━━━━━━━ welcome + Y/n background intel gather A/B/C/D/E choice"] MARKER_WRITE["★ mark_onboarded() ━━━━━━━━━━ writes .autoskillit/.onboarded"] end subgraph ConfigState ["CONFIGURATION & STATE FILES"] direction TB CONFIG[".autoskillit/config.yaml ━━━━━━━━━━ read: first-run gate write: init"] ONBOARDED["★ .autoskillit/.onboarded ━━━━━━━━━━ gitignored marker absent = first run present = onboarded"] RECIPES[".autoskillit/recipes/ ━━━━━━━━━━ read: first-run gate (empty = first run)"] end subgraph SkillMeta ["● SkillInfo Metadata · workspace/skills.py"] direction TB SKILL_INFO["● SkillInfo dataclass ━━━━━━━━━━ ★ tailorable: bool ★ tailoring_hints: str (from SKILL.md frontmatter)"] end subgraph Outputs ["OBSERVABILITY OUTPUTS"] direction TB SESSION["Claude interactive session ━━━━━━━━━━ initial_prompt injected when first-run path taken"] GITIGNORE[".autoskillit/.gitignore ━━━━━━━━━━ auto-includes .onboarded via ensure_project_temp()"] end %% FLOWS %% COOK -->|"reads"| DETECT DETECT -->|"reads"| CONFIG DETECT -->|"reads"| ONBOARDED DETECT -->|"reads"| RECIPES DETECT -->|"first run → invoke"| MENU MENU -->|"writes"| MARKER_WRITE MARKER_WRITE -->|"creates"| ONBOARDED COOK -->|"launches"| SESSION INIT -->|"writes"| CONFIG INIT -->|"--force: deletes"| ONBOARDED SKILLS -->|"reads"| SKILL_INFO CONFIG -->|"gitignore updated by"| GITIGNORE %% CLASS ASSIGNMENTS %% class COOK,INIT,SKILLS cli; class DETECT,MENU newComponent; class MARKER_WRITE newComponent; class CONFIG,RECIPES stateNode; class ONBOARDED newComponent; class SKILL_INFO handler; class SESSION,GITIGNORE output; ``` ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; subgraph Gates ["★ FIRST-RUN DETECTION GATES · is_first_run()"] direction TB G1["★ Gate 1: config.yaml exists ━━━━━━━━━━ False → not first run (init has not run)"] G2["★ Gate 2: .onboarded absent ━━━━━━━━━━ False → not first run (already onboarded)"] G3["★ Gate 3: recipes/ empty ━━━━━━━━━━ False → not first run (customized project)"] G4["★ Gate 4: no overrides ━━━━━━━━━━ detect_project_local_overrides() False → not first run"] end subgraph MarkerLifecycle ["★ .onboarded Marker Lifecycle"] direction LR ABSENT["★ .onboarded ABSENT ━━━━━━━━━━ initial state after init triggers first-run gate"] PRESENT["★ .onboarded PRESENT ━━━━━━━━━━ idempotent write atomic_write (no overwrite)"] end subgraph WriteGuards ["★ Write Guards · mark_onboarded()"] direction TB IDEMPOTENT["★ exists() check before write ━━━━━━━━━━ no-op if already present (prevents double-write)"] ATOMIC["atomic_write() ━━━━━━━━━━ temp-file + rename (prevents partial write)"] GITIGNORE["● ensure_project_temp() ━━━━━━━━━━ .onboarded in _GITIGNORE_ENTRIES (prevents accidental commit)"] end subgraph ResetGate ["● RESET GATE · init --force"] direction TB FORCE{"● --force flag ━━━━━━━━━━ config written?"} DELETE["● onboarded_marker.unlink ━━━━━━━━━━ missing_ok=True (safe delete)"] end subgraph TransientState ["★ TRANSIENT STATE · cook() call frame"] direction TB PROMPT["initial_prompt: str | None ━━━━━━━━━━ None → no onboarding str → skill injected as Claude opening message"] INTEL["★ OnboardingIntel ━━━━━━━━━━ scanner_found: str | None build_tools: list[str] github_issues: list[str] populated once, read-only"] end subgraph SkillMeta ["● SKILL METADATA · SkillInfo"] direction TB TAILORABLE["● SkillInfo.tailorable: bool ━━━━━━━━━━ parsed from SKILL.md INIT_ONLY (frozen dataclass)"] HINTS["● SkillInfo.tailoring_hints: str ━━━━━━━━━━ parsed from SKILL.md INIT_ONLY (frozen dataclass)"] end %% GATE CHAIN %% G1 -->|"pass"| G2 G2 -->|"pass"| G3 G3 -->|"pass"| G4 G4 -->|"all pass → first run"| ABSENT %% MARKER LIFECYCLE %% ABSENT -->|"read by is_first_run()"| G2 ABSENT -->|"onboarding complete"| IDEMPOTENT IDEMPOTENT -->|"not exists"| ATOMIC ATOMIC -->|"writes"| PRESENT GITIGNORE -->|"gitignores"| PRESENT %% RESET %% FORCE -->|"yes"| DELETE DELETE -->|"resets to"| ABSENT %% TRANSIENT %% G4 -->|"first run detected"| PROMPT INTEL -->|"feeds suggestions to"| PROMPT %% CLASS ASSIGNMENTS %% class G1,G2,G3,G4 detector; class ABSENT,PRESENT newComponent; class IDEMPOTENT,ATOMIC newComponent; class GITIGNORE handler; class FORCE stateNode; class DELETE handler; class PROMPT phase; class INTEL newComponent; class TAILORABLE,HINTS handler; ``` Closes #457 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-457-20260321-085654-812593/temp/make-plan/first_run_detection_guided_onboarding_plan_2026-03-21_090000.md` ## Token Usage Summary No token data available for this pipeline run. 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary The adjudication pipeline classified `**plan_path** = /path` (model output with bold markdown on the token name) as `CONTRACT_VIOLATION` — a terminal failure state — because `re.search('plan_path\s*=\s*/.+', '**plan_path** = /path')` returned `None`. The model's markdown decorator (`**`) sits between the token name and the `=`, breaking the regex while the semantic content is fully intact. The root weakness was that `_check_expected_patterns` applied raw regexes to unprocessed model output with no tolerance for formatting variation. This single function is the universal choke-point for all 25+ skill contracts that use `key = value` structured output tokens. Part A installs a markdown normalizer (`_strip_markdown_from_tokens`) at the choke-point and adds comprehensive regression tests. Part B adds a static enforcement semantic rule (`output-section-no-markdown-directive`) and derives `_OUTPUT_PATH_TOKENS` from the contract schema, with no-markdown directives added to all 31 at-risk SKILL.md output sections. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; %% TERMINALS %% START([START: run_headless_core]) SUCCEEDED([SUCCEEDED]) RETRIABLE([RETRIABLE]) FAILED([FAILED]) subgraph Execution ["Session Execution"] direction TB Runner["● runner() ━━━━━━━━━━ SubprocessResult termination + channel"] TermDispatch{"termination?"} StaleRecovery["stale recovery ━━━━━━━━━━ re-parse stdout"] end subgraph Recovery ["Recovery Layer"] direction TB RecoverMarker["_recover_from_separate_marker ━━━━━━━━━━ scan assistant_messages for standalone marker"] RecoverBlock["● _recover_block_from_assistant_messages ━━━━━━━━━━ combine assistant_messages re-check patterns (normalized)"] end subgraph Outcome ["Outcome Computation"] direction TB ComputeSuccess{"● _compute_success ━━━━━━━━━━ Channel B bypass or termination dispatch"} CheckContent["_check_session_content ━━━━━━━━━━ error / empty / subtype marker / patterns"] CheckPatterns["● _check_expected_patterns ━━━━━━━━━━ normalize then re.search all patterns must match"] NormalizeMD["★ _strip_markdown_from_tokens ━━━━━━━━━━ **token** = → token = *token* = → token ="] ContradictionGuard{"contradiction guard ━━━━━━━━━━ success + retry?"} DeadEndGuard{"dead-end guard ━━━━━━━━━━ failed + no-retry + channel confirmed?"} EvalContent["_evaluate_content_state ━━━━━━━━━━ ABSENT / CONTRACT_VIOLATION / SESSION_ERROR / COMPLETE"] end subgraph Result ["Result Assembly"] direction TB NormalizeSubtype["● _normalize_subtype ━━━━━━━━━━ map outcome → label adjudicated_failure / success…"] BudgetGuard["_apply_budget_guard ━━━━━━━━━━ cap consecutive retries"] ZeroWriteGate["zero-write gate ━━━━━━━━━━ demote if write expected but write_count==0"] end %% MAIN FLOW %% START --> Runner Runner --> TermDispatch TermDispatch -->|"STALE"| StaleRecovery TermDispatch -->|"TIMED_OUT"| ComputeSuccess TermDispatch -->|"NATURAL_EXIT / COMPLETED"| RecoverMarker StaleRecovery -->|"recovered"| SUCCEEDED StaleRecovery -->|"not recovered"| RETRIABLE RecoverMarker -->|"marker found in messages"| RecoverBlock RecoverMarker -->|"no marker or completion_marker unset"| RecoverBlock RecoverBlock -->|"patterns matched in messages"| ComputeSuccess RecoverBlock -->|"no match"| ComputeSuccess ComputeSuccess -->|"Channel B path"| CheckPatterns ComputeSuccess -->|"COMPLETED / NATURAL_EXIT"| CheckContent CheckContent --> CheckPatterns CheckPatterns --> NormalizeMD NormalizeMD -->|"normalized text"| CheckPatterns CheckPatterns -->|"all match → success=True"| ContradictionGuard CheckPatterns -->|"any miss → success=False"| ContradictionGuard ContradictionGuard -->|"success=True AND retry=True demote success"| DeadEndGuard ContradictionGuard -->|"consistent"| DeadEndGuard DeadEndGuard -->|"failed + no-retry + channel confirmed"| EvalContent DeadEndGuard -->|"else"| NormalizeSubtype EvalContent -->|"ABSENT → promote to RETRIABLE"| NormalizeSubtype EvalContent -->|"CONTRACT_VIOLATION → terminal"| NormalizeSubtype EvalContent -->|"SESSION_ERROR → terminal"| NormalizeSubtype NormalizeSubtype --> BudgetGuard BudgetGuard --> ZeroWriteGate ZeroWriteGate -->|"outcome=SUCCEEDED"| SUCCEEDED ZeroWriteGate -->|"outcome=RETRIABLE"| RETRIABLE ZeroWriteGate -->|"outcome=FAILED"| FAILED %% CLASS ASSIGNMENTS %% class START terminal; class SUCCEEDED,RETRIABLE,FAILED terminal; class Runner,CheckContent handler; class TermDispatch,ComputeSuccess,ContradictionGuard,DeadEndGuard stateNode; class StaleRecovery,RecoverMarker,RecoverBlock,EvalContent phase; class CheckPatterns,NormalizeSubtype,BudgetGuard,ZeroWriteGate detector; class NormalizeMD newComponent; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | START / SUCCEEDED / RETRIABLE / FAILED states | | Teal | State | Decision forks (termination dispatch, success/retry guards) | | Purple | Phase | Recovery helpers and content-state evaluation | | Orange | Handler | subprocess runner and session content check | | Red | Detector | Pattern matching, subtype normalization, budget and write gates | | Green | New Component | ★ `_strip_markdown_from_tokens` — new normalization function | ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; %% TERMINALS %% START([Contract Defined]) ENFORCED([Contract Enforced]) VIOLATED([Contract Violated]) subgraph DesignTime ["DESIGN-TIME CONTRACT LAYER"] direction TB Contracts["● skill_contracts.yaml ━━━━━━━━━━ expected_output_patterns write_behavior · outputs"] SKILL_MD["● SKILL.md ## Output section ━━━━━━━━━━ 31 files updated with no-markdown directive"] NoMarkdownRule["★ output-section-no-markdown-directive ━━━━━━━━━━ Semantic rule (WARNING) Fires if directive absent"] ContractLoader["contracts.py load_bundled_manifest() ━━━━━━━━━━ lru_cache · L0 core loader SkillContract dataclass"] end subgraph DesignGate ["DESIGN-TIME VALIDATION GATE"] direction TB DesignCheck{"skill has expected_output_patterns?"} OutputSection{"## Output section contains directive?"} WarnFiring["WARNING: output-section- no-markdown-directive ━━━━━━━━━━ Add directive to ## Output"] DesignPass["Design-time gate: PASS ━━━━━━━━━━ Contract well-specified"] end subgraph RuntimeLayer ["RUNTIME CONTRACT ENFORCEMENT"] direction TB OutputPathTokens["_OUTPUT_PATH_TOKENS ━━━━━━━━━━ frozenset of file_path outputs module-level, from contracts"] PathContaminationCheck["path contamination check ━━━━━━━━━━ output paths must stay within cwd boundary"] NormalizeMD["★ _strip_markdown_from_tokens() ━━━━━━━━━━ **token** = → token = *token* = → token ="] PatternCheck["● _check_expected_patterns() ━━━━━━━━━━ normalize → re.search all AND semantics · all must match"] end subgraph ContentGate ["RUNTIME CONTENT STATE GATE"] direction TB ContentState{"_evaluate_content_state()"} COMPLETE["ContentState.COMPLETE ━━━━━━━━━━ non-empty · marker present all patterns matched"] ABSENT["ContentState.ABSENT ━━━━━━━━━━ empty result or marker absent → RETRIABLE (drain-race)"] CONTRACT_VIO["ContentState.CONTRACT_VIOLATION ━━━━━━━━━━ result present · marker present pattern absent → TERMINAL"] SESSION_ERR["ContentState.SESSION_ERROR ━━━━━━━━━━ CLI is_error=True → TERMINAL"] end subgraph SubtypeNorm ["SUBTYPE NORMALIZATION"] direction TB NormSubtype["● _normalize_subtype() ━━━━━━━━━━ CliSubtype.SUCCESS + FAILED → adjudicated_failure"] SkillResult["SkillResult ━━━━━━━━━━ success · subtype · needs_retry retry_reason"] end %% FLOW %% START --> Contracts Contracts --> ContractLoader SKILL_MD --> NoMarkdownRule ContractLoader --> DesignCheck DesignCheck -->|"has patterns"| OutputSection DesignCheck -->|"no patterns — exempt"| DesignPass OutputSection -->|"directive absent"| WarnFiring OutputSection -->|"directive present"| DesignPass WarnFiring -.->|"warning only, pipeline continues"| DesignPass DesignPass --> OutputPathTokens OutputPathTokens --> PathContaminationCheck PathContaminationCheck --> NormalizeMD NormalizeMD --> PatternCheck PatternCheck --> ContentState ContentState -->|"empty / marker absent"| ABSENT ContentState -->|"patterns all match"| COMPLETE ContentState -->|"result present, pattern missing"| CONTRACT_VIO ContentState -->|"CLI error"| SESSION_ERR COMPLETE --> NormSubtype ABSENT -->|"drain-race → promote RETRIABLE"| NormSubtype CONTRACT_VIO -->|"terminal failure"| NormSubtype SESSION_ERR -->|"terminal failure"| NormSubtype NormSubtype --> SkillResult SkillResult -->|"success=True"| ENFORCED SkillResult -->|"success=False, needs_retry=False"| VIOLATED SkillResult -->|"needs_retry=True"| START %% CLASS ASSIGNMENTS %% class START,ENFORCED,VIOLATED terminal; class Contracts,SKILL_MD handler; class NoMarkdownRule,NormalizeMD newComponent; class ContractLoader,OutputPathTokens phase; class DesignCheck,OutputSection,ContentState stateNode; class WarnFiring,CONTRACT_VIO,SESSION_ERR detector; class PathContaminationCheck,PatternCheck,NormSubtype detector; class COMPLETE,ABSENT output; class SkillResult gap; class DesignPass cli; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Contract defined / enforced / violated end states | | Orange | Handler | Contract manifest and SKILL.md source files (●) | | Green | New Component | ★ New normalization function and semantic rule | | Purple | Phase | Contract loader and path token frozenset | | Teal | State | Decision gates (pattern present? directive present?) | | Red | Detector | Warning firings, pattern checks, violation states, subtype normalization | | Dark Teal | Output | COMPLETE / ABSENT content states | | Yellow | Derived | SkillResult — mutable routing outcome | Closes #462 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260321-085252-104453/temp/rectify/rectify_structured-output-markdown-fragility_2026-03-21_091500_part_a.md` ## Token Usage Summary | Step | input | output | cached | count | time | |------|-------|--------|--------|-------|------| | open_pr | 3.4k | 191.6k | 12.9M | 13 | 1h 6m | | audit_impl | 5.3k | 166.4k | 5.2M | 16 | 1h 2m | | implement | 9.6k | 197.6k | 29.5M | 13 | 1h 21m | | review_pr | 2.2k | 406.9k | 16.2M | 13 | 2h 46m | | dry_walkthrough | 526 | 79.8k | 5.2M | 3 | 26m 55s | | fix | 114 | 37.3k | 3.9M | 5 | 20m 10s | | diagnose_ci | 27 | 4.0k | 382.6k | 2 | 1m 36s | | resolve_review | 2.7k | 301.3k | 36.5M | 12 | 1h 48m | | resolve_ci | 26 | 5.5k | 452.1k | 2 | 3m 52s | | assess | 102 | 31.2k | 4.5M | 3 | 18m 18s | | plan-31 | 29 | 9.4k | 515.6k | 1 | 4m 28s | | plan-28 | 21 | 12.1k | 307.8k | 1 | 7m 20s | | plan-32 | 21 | 16.3k | 390.4k | 1 | 9m 1s | | plan-33 | 21 | 20.3k | 361.1k | 1 | 9m 9s | | review-31 | 57 | 398 | 1.4M | 1 | 3m 13s | | review-32 | 1.6k | 5.0k | 144.7k | 1 | 3m 35s | | review-28-retry | 3.8k | 4.9k | 155.3k | 1 | 5m 28s | | review-33-retry | 1.9k | 6.2k | 163.4k | 1 | 6m 10s | | verify-28 | 17 | 7.2k | 426.3k | 1 | 2m 1s | | verify-31 | 15 | 8.7k | 360.3k | 1 | 2m 25s | | verify-32 | 22 | 16.0k | 831.0k | 1 | 4m 24s | | verify-33 | 26 | 21.2k | 974.8k | 1 | 6m 26s | | implement-31 | 34 | 11.3k | 934.3k | 1 | 3m 18s | | implement-32 | 26 | 10.6k | 809.4k | 1 | 3m 56s | | implement-28 | 41 | 18.1k | 1.3M | 1 | 5m 19s | | implement-33 | 36 | 23.6k | 1.3M | 1 | 7m 19s | | audit-33 | 12 | 6.0k | 151.4k | 1 | 1m 47s | | audit-28 | 14 | 7.1k | 206.9k | 1 | 1m 57s | | audit-32 | 14 | 7.2k | 202.2k | 1 | 2m 7s | | audit-31 | 375 | 10.4k | 149.8k | 1 | 3m 42s | | open-pr-33 | 21 | 9.8k | 513.3k | 1 | 2m 41s | | open-pr-28 | 21 | 9.9k | 485.4k | 1 | 3m 13s | | open-pr-32 | 29 | 12.5k | 837.5k | 1 | 4m 25s | | open-pr-31 | 28 | 11.5k | 657.9k | 1 | 4m 32s | | review-pr-28 | 25 | 27.0k | 561.0k | 1 | 4m 57s | | review-pr-31 | 20 | 31.5k | 423.2k | 1 | 5m 43s | | review-pr-32 | 23 | 31.7k | 659.1k | 1 | 5m 55s | | review-pr-33 | 23 | 31.7k | 565.2k | 1 | 7m 12s | | resolve-review-33 | 35 | 14.2k | 1.2M | 1 | 4m 58s | | resolve-review-31 | 40 | 17.2k | 1.2M | 1 | 5m 21s | | resolve-review-28 | 46 | 17.8k | 1.6M | 1 | 6m 26s | | resolve-review-32 | 38 | 20.7k | 1.2M | 1 | 6m 45s | | plan-30 | 24 | 14.9k | 549.5k | 1 | 5m 25s | | plan-29 | 1.5k | 14.3k | 342.6k | 1 | 5m 43s | | review-29 | 11 | 7.2k | 196.5k | 1 | 4m 34s | | review-30 | 2.6k | 5.9k | 155.7k | 1 | 5m 14s | | verify-30 | 19 | 10.9k | 639.2k | 1 | 3m 31s | | verify-29 | 2.9k | 15.2k | 575.0k | 1 | 4m 17s | | resolve_conflict_449 | 11 | 2.0k | 232.3k | 1 | 42s | | implement-30 | 21 | 12.8k | 485.2k | 1 | 3m 34s | | implement-29 | 30 | 9.1k | 893.3k | 1 | 4m 12s | | fix-30 | 21 | 4.9k | 480.0k | 1 | 1m 35s | | fix-29 | 29 | 6.2k | 793.3k | 1 | 2m 31s | | open-pr-30 | 24 | 10.7k | 516.1k | 1 | 4m 23s | | open-pr-29 | 27 | 17.5k | 767.1k | 1 | 6m 10s | | plan-34 | 23 | 16.1k | 493.0k | 1 | 5m 38s | | review-34 | 17 | 12.2k | 531.4k | 1 | 4m 17s | | verify-34 | 14 | 12.0k | 364.9k | 1 | 3m 44s | | implement-34 | 36 | 13.9k | 1.2M | 1 | 3m 56s | | open-pr-34 | 30 | 17.1k | 824.2k | 1 | 9m 8s | | plan-35 | 195 | 13.9k | 693.5k | 1 | 6m 24s | | plan-37 | 34 | 20.0k | 848.0k | 1 | 9m 13s | | plan-36 | 37 | 31.6k | 1.2M | 1 | 10m 39s | | verify-37 | 24 | 14.2k | 831.4k | 1 | 3m 45s | | verify-35 | 60 | 20.0k | 2.7M | 1 | 5m 49s | | verify-36 | 26 | 21.2k | 1.1M | 1 | 6m 18s | | implement-35 | 42 | 16.8k | 1.7M | 1 | 4m 58s | | plan | 6.4k | 205.6k | 9.7M | 10 | 1h 16m | | implement-37 | 93 | 55.5k | 6.7M | 1 | 19m 16s | | implement-36 | 100 | 68.9k | 8.2M | 1 | 24m 41s | | fix-35 | 14 | 2.2k | 204.0k | 1 | 43s | | verify | 1.0k | 104.4k | 6.1M | 9 | 31m 15s | | audit-35 | 15 | 7.1k | 236.3k | 1 | 1m 55s | | audit-36 | 15 | 8.2k | 266.1k | 1 | 2m 20s | | audit-37 | 11 | 9.4k | 207.8k | 1 | 2m 58s | | open-pr-36 | 32 | 17.3k | 851.6k | 1 | 5m 24s | | open-pr-37 | 34 | 18.1k | 1.2M | 1 | 5m 59s | | open-pr-35 | 20 | 11.0k | 481.3k | 1 | 6m 22s | | review-pr-36 | 251 | 36.7k | 735.0k | 1 | 10m 5s | | review-pr-35 | 36 | 38.8k | 1.3M | 1 | 11m 19s | | review-pr-37 | 25 | 47.4k | 871.5k | 1 | 12m 5s | | resolve-review-36 | 21 | 4.3k | 397.2k | 1 | 1m 28s | | resolve-review-37 | 590 | 32.4k | 2.3M | 1 | 9m 12s | | resolve-review-35 | 267 | 48.0k | 3.9M | 1 | 14m 44s | | review-pr-36-retry | 28 | 39.0k | 854.3k | 1 | 10m 1s | | resolve-review-36-retry | 54 | 33.0k | 3.1M | 1 | 9m 39s | | resolve-conflicts-36 | 26 | 13.1k | 808.6k | 1 | 3m 32s | | resolve-conflicts-37 | 34 | 15.7k | 1.2M | 1 | 4m 4s | | plan-38 | 22 | 24.3k | 473.4k | 1 | 7m 52s | | review-38 | 7.2k | 7.4k | 210.5k | 1 | 8m 2s | | verify-38 | 279 | 15.1k | 1.2M | 1 | 4m 31s | | implement-38 | 31 | 22.3k | 1.4M | 1 | 6m 28s | | audit-38 | 12 | 7.4k | 157.2k | 1 | 2m 2s | | open-pr-38 | 27 | 14.0k | 776.8k | 1 | 4m 16s | | review-pr-38 | 1.0k | 49.9k | 635.1k | 1 | 12m 7s | | resolve-review-38 | 52 | 35.6k | 2.7M | 1 | 10m 42s | | plan-39 | 3.1k | 16.4k | 555.3k | 1 | 6m 0s | | review-39 | 14 | 6.3k | 222.4k | 1 | 5m 31s | | verify-39 | 2.7k | 9.1k | 571.6k | 1 | 2m 29s | | implement-39 | 28 | 9.5k | 1.1M | 1 | 3m 11s | | audit-39 | 17 | 9.5k | 316.1k | 1 | 2m 39s | | open-pr-39 | 25 | 11.0k | 661.1k | 1 | 4m 7s | | review-pr-39 | 24 | 42.2k | 706.9k | 1 | 7m 36s | | resolve-review-39 | 3.1k | 22.6k | 1.5M | 1 | 7m 25s | | plan-40 | 1.4k | 23.9k | 768.7k | 1 | 9m 42s | | review-40 | 3.2k | 5.6k | 171.6k | 1 | 6m 50s | | verify-40 | 20 | 13.2k | 867.9k | 1 | 3m 46s | | implement-40 | 43 | 104.5k | 2.4M | 1 | 24m 59s | | audit-40 | 18 | 10.8k | 382.3k | 1 | 3m 7s | | open-pr-40 | 31 | 12.5k | 874.8k | 1 | 5m 50s | | review-pr-40 | 27 | 34.2k | 765.3k | 1 | 7m 39s | | resolve-review-40 | 43 | 26.0k | 2.1M | 1 | 9m 4s | | plan-41 | 31 | 18.5k | 858.9k | 1 | 7m 40s | | review-41 | 6.7k | 8.2k | 181.9k | 1 | 5m 37s | | verify-41 | 34 | 37.0k | 2.4M | 1 | 10m 46s | | implement-41 | 91 | 56.2k | 6.9M | 1 | 19m 22s | | audit-41 | 416 | 13.9k | 207.9k | 1 | 9m 51s | | open-pr-41 | 37 | 14.6k | 1.3M | 1 | 5m 26s | | review-pr-41 | 34 | 37.7k | 1.1M | 1 | 9m 24s | | resolve-review-41 | 55 | 20.0k | 2.1M | 1 | 7m 14s | | investigate | 3.5k | 11.0k | 484.1k | 1 | 8m 35s | | rectify | 20 | 23.6k | 641.3k | 1 | 11m 12s | | **Total** | 82.5k | 3.8M | 241.9M | | 22h 43m | 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary The MCP server's `_initialize()` startup recovery loaded token and timing telemetry from ALL pipeline runs in the last 24 hours with no pipeline-scoping filter, contaminating the `DefaultTokenLog` and `DefaultTimingLog` singletons with entries from entirely unrelated pipelines. The architectural fix removes token/timing recovery from `_initialize()` entirely — these logs are per-pipeline live accumulators and have no cross-pipeline recovery semantics. `DefaultAuditLog` legitimately spans pipelines (failure tracking) and is left intact. A stale instruction in `tools_recipe.py` simultaneously directed the orchestrator to use the contaminated server-side `get_token_summary` tool to pre-stage a PR token file; that instruction is removed. The architectural result: at server startup the token/timing logs are empty; they are populated only from live `run_skill` calls in the running pipeline; skills self-retrieve clean, CWD-scoped data from disk when they need to build PR bodies. ## Architecture Impact ### Data Lineage Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 70, 'curve': 'basis'}}}%% flowchart LR %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; subgraph Origins ["Data Origins"] direction TB RUN_SKILL["run_skill() calls ━━━━━━━━━━ Live pipeline steps raw step_name + token counts"] end subgraph Disk ["Disk Storage (Source of Truth)"] direction TB SESSIONS[("~/.local/share/autoskillit/logs/ ━━━━━━━━━━ sessions.jsonl index (cwd field) sessions/{dir}/token_usage.json sessions/{dir}/step_timing.json")] end subgraph Startup ["● Server Startup (_state._initialize)"] direction TB AUDIT_REC["DefaultAuditLog.load_from_log_dir ━━━━━━━━━━ since= filter only (no cwd_filter) Cross-pipeline failures — correct"] TOK_EMPTY["● DefaultTokenLog — empty at startup ━━━━━━━━━━ load_from_log_dir REMOVED Was contaminating across pipelines"] TIM_EMPTY["● DefaultTimingLog — empty at startup ━━━━━━━━━━ load_from_log_dir REMOVED Was contaminating across pipelines"] end subgraph Live ["Live In-Memory Singletons"] direction TB TOK_LIVE["● DefaultTokenLog (singleton) ━━━━━━━━━━ ● canonical_step_name() strips -N Current pipeline only"] TIM_LIVE["● DefaultTimingLog (singleton) ━━━━━━━━━━ ● canonical_step_name() strips -N Current pipeline only"] AUDIT_LIVE["DefaultAuditLog (singleton) ━━━━━━━━━━ Startup data + current failures"] end subgraph SkillRetrieval ["Skill Self-Retrieval (pipeline-scoped)"] direction TB OPEN_PR["open-pr Step 0b ━━━━━━━━━━ Fresh DefaultTokenLog() load_from_log_dir(cwd_filter=PIPELINE_CWD)"] end subgraph Artifacts ["PR Artifacts"] PR_BODY["PR Body Token Table ━━━━━━━━━━ temp/open-pr/token_summary.md Pipeline-scoped only"] end subgraph Contracts ["★ New Contract Tests"] direction TB CONTRACT["★ test_tools_recipe_contracts.py ━━━━━━━━━━ Asserts get_token_summary( not in load_recipe docstring"] end RUN_SKILL -->|"record(step_name, usage) canonical_step_name()"| TOK_LIVE RUN_SKILL -->|"record(step_name, duration) canonical_step_name()"| TIM_LIVE RUN_SKILL -->|"session_log.flush() writes token_usage.json step_timing.json + index"| SESSIONS SESSIONS -->|"load_from_log_dir(since=) no cwd_filter — intentional"| AUDIT_REC SESSIONS -. "✗ REMOVED — startup contamination eliminated" .-> TOK_EMPTY SESSIONS -. "✗ REMOVED — startup contamination eliminated" .-> TIM_EMPTY TOK_EMPTY -->|"starts empty"| TOK_LIVE TIM_EMPTY -->|"starts empty"| TIM_LIVE AUDIT_REC --> AUDIT_LIVE SESSIONS -->|"load_from_log_dir cwd_filter=PIPELINE_CWD"| OPEN_PR OPEN_PR -->|"pipeline-scoped token table"| PR_BODY CONTRACT -. "asserts stale instruction absent" .-> OPEN_PR class RUN_SKILL cli; class SESSIONS stateNode; class AUDIT_REC,OPEN_PR handler; class TOK_EMPTY,TIM_EMPTY newComponent; class TOK_LIVE,TIM_LIVE,AUDIT_LIVE phase; class PR_BODY output; class CONTRACT detector; ``` ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 70, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; subgraph Startup ["● _initialize() — Startup Contract (server/_state.py)"] direction LR AUDIT_INIT["DefaultAuditLog ━━━━━━━━━━ CROSS_PIPELINE load_from_log_dir(since=) ✓ No cwd_filter — intentional"] TOK_INIT["● DefaultTokenLog ━━━━━━━━━━ INIT_ONLY = empty {} load_from_log_dir REMOVED Was: contaminated startup"] TIM_INIT["● DefaultTimingLog ━━━━━━━━━━ INIT_ONLY = empty {} load_from_log_dir REMOVED Was: contaminated startup"] end subgraph Gates ["Mutation Gates (both record() and load_from_log_dir())"] direction TB NORM_GATE["● canonical_step_name() ━━━━━━━━━━ strips trailing -N suffixes 'plan-30' → 'plan' Applied on EVERY mutation path"] CWD_GATE["cwd_filter gate ━━━━━━━━━━ Empty = all pipelines Non-empty = current pipeline only _iter_session_log_entries()"] SINCE_GATE["since= filter ━━━━━━━━━━ ISO timestamp bound Excludes old sessions _iter_session_log_entries()"] end subgraph LiveMutation ["Live Mutation — record() path"] direction TB LIVE_REC["● DefaultTokenLog.record() ━━━━━━━━━━ APPEND_ONLY _entries Aggregates by canonical key invocation_count += 1"] LIVE_TIM["● DefaultTimingLog.record() ━━━━━━━━━━ APPEND_ONLY _entries Aggregates total_seconds invocation_count += 1"] end subgraph DiskMutation ["Disk Mutation — load_from_log_dir() path (skill self-retrieval)"] direction TB DISK_REC["● DefaultTokenLog.load_from_log_dir ━━━━━━━━━━ cwd_filter=PIPELINE_CWD required Merges into fresh instance Not called at startup"] DISK_TIM["● DefaultTimingLog.load_from_log_dir ━━━━━━━━━━ ● cwd_filter now tested Merges into fresh instance Not called at startup"] end subgraph Contracts ["Contract Enforcement"] direction TB STATE_TEST["● test_state.py ━━━━━━━━━━ Asserts token/timing empty after _initialize()"] CONTRACT_TEST["★ test_tools_recipe_contracts.py ━━━━━━━━━━ Asserts get_token_summary( absent from load_recipe docstring"] TIM_TEST["● test_timings.py ━━━━━━━━━━ TestLoadFromLogDirCwdFilterTiming Verifies cwd_filter isolation"] end subgraph Reset ["State Reset"] CLEAR["DefaultTokenLog.clear() ━━━━━━━━━━ _entries = {} (empty dict) Resets to INIT_ONLY state Writes telemetry_clear_marker"] end AUDIT_INIT -->|"populated from disk"| SINCE_GATE TOK_INIT -. "starts empty no disk load" .-> NORM_GATE TIM_INIT -. "starts empty no disk load" .-> NORM_GATE SINCE_GATE -->|"passes timestamp"| CWD_GATE CWD_GATE -->|"cwd match"| NORM_GATE NORM_GATE -->|"canonical key"| DISK_REC NORM_GATE -->|"canonical key"| DISK_TIM NORM_GATE -->|"canonical key"| LIVE_REC NORM_GATE -->|"canonical key"| LIVE_TIM STATE_TEST -. "asserts empty after init" .-> TOK_INIT STATE_TEST -. "asserts empty after init" .-> TIM_INIT CONTRACT_TEST -. "asserts stale instruction absent" .-> DISK_REC TIM_TEST -. "validates cwd_filter" .-> DISK_TIM LIVE_REC -->|"clear() resets to {}"| CLEAR CLEAR -. "INIT_ONLY boundary restored" .-> TOK_INIT class AUDIT_INIT handler; class TOK_INIT,TIM_INIT newComponent; class NORM_GATE,CWD_GATE,SINCE_GATE stateNode; class LIVE_REC,LIVE_TIM phase; class DISK_REC,DISK_TIM output; class STATE_TEST,TIM_TEST detector; class CONTRACT_TEST gap; class CLEAR cli; ``` ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; START([SERVER START]) subgraph Startup ["● _initialize() — Server Startup (server/_state.py)"] direction TB RECOVER["recover_crashed_sessions() ━━━━━━━━━━ tmpfs trace cleanup only No token/timing involvement"] CLEAR_MARKER{"read_telemetry_clear_marker() ━━━━━━━━━━ marker > since_dt?"} SINCE_CALC["Compute since_dt ━━━━━━━━━━ now - 24h OR clear_marker (whichever is later)"] AUDIT_LOAD["● ctx.audit.load_from_log_dir ━━━━━━━━━━ since=since_str (no cwd_filter) Cross-pipeline — correct"] TOK_SKIP["● DefaultTokenLog starts empty {} ━━━━━━━━━━ load_from_log_dir REMOVED No cross-pipeline contamination"] TIM_SKIP["● DefaultTimingLog starts empty {} ━━━━━━━━━━ load_from_log_dir REMOVED No cross-pipeline contamination"] end INIT_DONE([SERVER READY]) subgraph LiveFlow ["● Live Accumulation — record() path (tokens.py / timings.py)"] direction TB RUN_SKILL_CALL["run_skill() invocation completes ━━━━━━━━━━ Yields step_name + token_usage dict"] STEP_EMPTY{"step_name empty or token_usage None?"} CANON_LIVE["● canonical_step_name(step_name) ━━━━━━━━━━ strip trailing -N suffix 'plan-30' → 'plan'"] RECORD_ACC["● token_log.record() / timing_log.record() ━━━━━━━━━━ _entries[key] += counts invocation_count += 1"] end subgraph SkillRetrieve ["Skill Self-Retrieval — load_from_log_dir() (open-pr Step 0b)"] direction TB FRESH_LOG["new DefaultTokenLog() ━━━━━━━━━━ Fresh empty instance Bypasses contaminated singleton"] ITER_SESSIONS["_iter_session_log_entries() ━━━━━━━━━━ reads sessions.jsonl index"] SINCE_CHECK{"since= filter ━━━━━━━━━━ entry.timestamp >= since?"} CWD_CHECK{"cwd_filter non-empty? ━━━━━━━━━━ entry.cwd == PIPELINE_CWD?"} CANON_DISK["● canonical_step_name(raw_step) ━━━━━━━━━━ Same normalization as live path"] MERGE_ENTRY["Merge into fresh _entries ━━━━━━━━━━ Accumulate by canonical key"] TOKEN_FILE["temp/open-pr/token_summary.md ━━━━━━━━━━ Pipeline-scoped token table"] end NO_OP([SKIP — no-op]) SKIP_SESSION([SKIP session]) PR_BODY([PR BODY with clean token table]) START --> RECOVER RECOVER --> CLEAR_MARKER CLEAR_MARKER -->|"marker exists + newer"| SINCE_CALC CLEAR_MARKER -->|"no marker / older"| SINCE_CALC SINCE_CALC --> AUDIT_LOAD SINCE_CALC --> TOK_SKIP SINCE_CALC --> TIM_SKIP AUDIT_LOAD --> INIT_DONE TOK_SKIP --> INIT_DONE TIM_SKIP --> INIT_DONE INIT_DONE --> RUN_SKILL_CALL RUN_SKILL_CALL --> STEP_EMPTY STEP_EMPTY -->|"yes"| NO_OP STEP_EMPTY -->|"no"| CANON_LIVE CANON_LIVE --> RECORD_ACC INIT_DONE --> FRESH_LOG FRESH_LOG --> ITER_SESSIONS ITER_SESSIONS --> SINCE_CHECK SINCE_CHECK -->|"too old"| SKIP_SESSION SINCE_CHECK -->|"passes"| CWD_CHECK CWD_CHECK -->|"cwd mismatch"| SKIP_SESSION CWD_CHECK -->|"cwd matches"| CANON_DISK CANON_DISK --> MERGE_ENTRY MERGE_ENTRY -->|"all sessions processed"| TOKEN_FILE TOKEN_FILE --> PR_BODY class START,INIT_DONE terminal; class NO_OP,SKIP_SESSION,PR_BODY output; class RECOVER,SINCE_CALC handler; class CLEAR_MARKER,STEP_EMPTY,SINCE_CHECK,CWD_CHECK stateNode; class AUDIT_LOAD phase; class TOK_SKIP,TIM_SKIP newComponent; class RUN_SKILL_CALL,RECORD_ACC,FRESH_LOG,ITER_SESSIONS,MERGE_ENTRY,TOKEN_FILE handler; class CANON_LIVE,CANON_DISK detector; ``` Closes #466 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260321-113541-384854/temp/rectify/rectify_token-telemetry-contamination_2026-03-21_120000_part_a.md` ## Token Usage Summary ## token_summary 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary The `direct_merge` step in `implementation.yaml`, `remediation.yaml`, and `implementation-groups.yaml` uses `gh pr merge --squash --auto` unconditionally. On repos where `autoMergeAllowed=false` (no branch protection rules), this command fails, and the pipeline silently falls through to `confirm_cleanup` — leaving the PR unmerged. The same issue affects `redirect_merge` and `merge-pr` SKILL.md Step 2. **Fix:** Insert a new `check_auto_merge` detection step after `check_merge_queue` in all three recipes, add a third route condition in `route_queue_mode`, and introduce a complete `immediate_merge` path (5 new steps per recipe) that uses plain `gh pr merge --squash` when `autoMergeAllowed=false`. Update `merge-pr` SKILL.md and `sous-chef` SKILL.md to document the three-way routing. ## Requirements ### DETECT — Auto-Merge Availability Detection - **REQ-DETECT-001:** The recipe must determine whether the target repository has `autoMergeAllowed` enabled before reaching any step that uses `gh pr merge --auto`. - **REQ-DETECT-002:** The detection must use the GraphQL `autoMergeAllowed` field on the repository object, captured into `context.auto_merge_available`. - **REQ-DETECT-003:** The detection must occur once per pipeline run (combined with the existing `check_merge_queue` step or as an adjacent step). ### ROUTE — Three-Way Merge Routing - **REQ-ROUTE-001:** `route_queue_mode` must branch into three paths: queue available, auto-merge available (no queue), and neither available. - **REQ-ROUTE-002:** When `queue_available == false` and `auto_merge_available == false`, the recipe must route to a new `immediate_merge` step that uses `gh pr merge --squash` without the `--auto` flag. - **REQ-ROUTE-003:** The `immediate_merge` step must have its own conflict-fix and retry path analogous to `direct_merge_conflict_fix`. ### FIX — Affected Step Corrections - **REQ-FIX-001:** The `direct_merge` step must only be reachable when `auto_merge_available == true`. - **REQ-FIX-002:** The `redirect_merge` step (retry after conflict fix) must also respect auto-merge availability — use `--auto` only when available, plain `--squash` otherwise. - **REQ-FIX-003:** The `merge-pr` SKILL.md must detect `autoMergeAllowed` before choosing between `--auto` and direct merge. ### GUIDE — Orchestrator Guidance Update - **REQ-GUIDE-001:** The sous-chef MERGE PHASE section must document the three-way routing including the `autoMergeAllowed` dimension. - **REQ-GUIDE-002:** The `implementation.yaml` kitchen_rules MERGE ROUTING entry must reference the auto-merge detection and the `immediate_merge` path. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 38, 'rankSpacing': 52, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; CI_PASS([CI passed]) subgraph Detection ["Detection Phase"] direction TB CMQ["check_merge_queue ━━━━━━━━━━ GraphQL: mergeQueue exists? → context.queue_available"] CAM["★ check_auto_merge ━━━━━━━━━━ GraphQL: autoMergeAllowed? → context.auto_merge_available"] end subgraph Routing ["● route_queue_mode (4-way router)"] R1{"auto_merge input ≠ true?"} R2{"queue_available = true?"} R3{"★ auto_merge_available = true?"} end subgraph QueuePath ["Queue Path"] EAM["enable_auto_merge ━━━━━━━━━━ gh pr merge --squash --auto → merge queue"] WFQ["wait_for_queue ━━━━━━━━━━ merge queue watcher"] end subgraph DirectPath ["Direct Merge Path"] DM["direct_merge ━━━━━━━━━━ gh pr merge --squash --auto"] WFDM["wait_for_direct_merge ━━━━━━━━━━ poll 90×10s"] DMCF["direct_merge_conflict_fix ━━━━━━━━━━ resolve-merge-conflicts"] REDM["redirect_merge ━━━━━━━━━━ gh pr merge --squash --auto"] end subgraph ImmediatePath ["★ Immediate Merge Path (new)"] IM["★ immediate_merge ━━━━━━━━━━ gh pr merge --squash (no --auto)"] WFIM["★ wait_for_immediate_merge ━━━━━━━━━━ poll 30×10s"] IMCF["★ immediate_merge_conflict_fix ━━━━━━━━━━ resolve-merge-conflicts"] RPIF["★ re_push_immediate_fix ━━━━━━━━━━ push_to_remote"] RMI["★ remerge_immediate ━━━━━━━━━━ gh pr merge --squash"] end CLEANUP["confirm_cleanup"] SUCCESS([release_issue_success]) TIMEOUT([release_issue_timeout]) FAILURE([release_issue_failure]) CI_PASS --> CMQ CMQ -->|"on_success"| CAM CMQ -->|"on_failure"| SUCCESS CAM -->|"on_success / on_failure"| R1 R1 -->|"auto_merge ≠ true"| CLEANUP R1 -->|"else"| R2 R2 -->|"queue_available = true"| EAM R2 -->|"else"| R3 R3 -->|"auto_merge_available = true"| DM R3 -->|"default"| IM EAM -->|"on_success"| WFQ EAM -->|"on_failure"| CLEANUP WFQ -->|"merged"| SUCCESS WFQ -->|"ejected/stalled/timeout"| TIMEOUT DM -->|"on_success"| WFDM DM -->|"on_failure"| CLEANUP WFDM -->|"merged"| SUCCESS WFDM -->|"closed"| DMCF WFDM -->|"timeout"| TIMEOUT DMCF -->|"escalation=true"| FAILURE DMCF -->|"resolved"| REDM REDM -->|"on_success"| WFDM REDM -->|"on_failure"| FAILURE IM -->|"on_success"| WFIM IM -->|"on_failure"| CLEANUP WFIM -->|"merged"| SUCCESS WFIM -->|"closed"| IMCF WFIM -->|"timeout"| TIMEOUT IMCF -->|"escalation=true"| FAILURE IMCF -->|"resolved"| RPIF RPIF -->|"on_success"| RMI RPIF -->|"on_failure"| FAILURE RMI -->|"on_success"| WFIM RMI -->|"on_failure"| FAILURE %% CLASS ASSIGNMENTS %% class CI_PASS,SUCCESS,TIMEOUT,FAILURE terminal; class CMQ,CAM detector; class R1,R2,R3 stateNode; class EAM,WFQ,DM,WFDM,DMCF,REDM handler; class IM,WFIM,IMCF,RPIF,RMI newComponent; class CLEANUP phase; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | CI passed, success, timeout, failure terminals | | Red | Detector | Detection steps (check_merge_queue, ★ check_auto_merge) | | Teal | State | ● route_queue_mode routing decision nodes | | Orange | Handler | Existing queue and direct merge execution steps | | Green | New Component | ★ New immediate_merge path steps (5 new steps) | | Purple | Phase | confirm_cleanup (graceful exit) | Closes #469 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-20260321-193442-670135/temp/make-plan/direct_merge_auto_merge_routing_plan_2026-03-21_194423.md` ## Token Usage Summary ## token_summary 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary The `init()` command in `app.py` placed the security gate (`_check_secret_scanning`) **after** user-input collection (`_prompt_test_command`) and file writes (`atomic_write`). In a non-interactive environment without `--test-command`, `input()` raised `EOFError` before the gate fired — bypassing it entirely with a generic crash exit. The fix is structural: the security gate now runs **before any user input or file I/O**. Additionally, `_prompt_test_command()` and all other CLI `input()` callers now use a shared `_require_interactive_stdin()` TTY guard that fails explicitly in non-interactive contexts, closing the EOFError escape hatch regardless of call-site ordering. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; START([START — autoskillit init]) ERROR([ERROR — SystemExit 1]) COMPLETE([COMPLETE — init done]) subgraph Setup ["Setup"] ScopeCheck{"scope valid? ━━━━━━━━━━ user or project"} MkDir["config_dir.mkdir() ━━━━━━━━━━ .autoskillit/ created"] end subgraph Gate ["● Security Gate — runs FIRST, before any I/O"] SecGate["● _check_secret_scanning() ━━━━━━━━━━ returns ★ _ScanResult(passed, bypass_accepted)"] ScannerCheck{"scanner detected? ━━━━━━━━━━ _detect_secret_scanner()"} NonInterGate["print ERROR block ━━━━━━━━━━ non-interactive: no bypass"] BypassPrompt["show warning + phrase ━━━━━━━━━━ interactive bypass path"] PhraseCheck{"phrase matches? ━━━━━━━━━━ _SECRET_SCAN_BYPASS_PHRASE"} GateFail{"gate.passed?"} end subgraph ConfigWrite ["Config Write (gate already cleared — no rollback needed)"] ConfigExists{"config exists AND NOT force?"} AlreadyMsg["print 'already exists' ━━━━━━━━━━ log bypass if accepted"] TestCmdGiven{"test_command arg provided?"} UseGiven["cmd_parts = split() ━━━━━━━━━━ non-interactive path"] PromptCmd["● _prompt_test_command() ━━━━━━━━━━ ★ _require_interactive_stdin() guard"] TtyCheck{"★ sys.stdin.isatty()? ━━━━━━━━━━ inside _require_interactive_stdin"} NiTtyFail["print 'requires interactive terminal' ━━━━━━━━━━ explicit SystemExit(1)"] InputCall["input('Test command [...]:') ━━━━━━━━━━ interactive only"] WriteConfig["atomic_write(config.yaml) ━━━━━━━━━━ no rollback flag needed"] LogBypass["_log_secret_scan_bypass() ━━━━━━━━━━ if bypass_accepted"] end subgraph Register ["Registration Phase"] RegAll["_register_all() ━━━━━━━━━━ hooks, MCP server, GitHub repo"] end START --> ScopeCheck ScopeCheck -->|"invalid"| ERROR ScopeCheck -->|"valid"| MkDir MkDir --> SecGate SecGate --> ScannerCheck ScannerCheck -->|"scanner found"| GateFail ScannerCheck -->|"not TTY, no scanner"| NonInterGate NonInterGate --> GateFail ScannerCheck -->|"TTY, no scanner"| BypassPrompt BypassPrompt --> PhraseCheck PhraseCheck -->|"wrong phrase"| GateFail PhraseCheck -->|"phrase matched"| GateFail GateFail -->|"passed=False"| ERROR GateFail -->|"passed=True"| ConfigExists ConfigExists -->|"yes — skip write"| AlreadyMsg AlreadyMsg --> RegAll ConfigExists -->|"no / force"| TestCmdGiven TestCmdGiven -->|"provided"| UseGiven TestCmdGiven -->|"not provided"| PromptCmd PromptCmd --> TtyCheck TtyCheck -->|"not TTY"| NiTtyFail NiTtyFail --> ERROR TtyCheck -->|"is TTY"| InputCall InputCall --> WriteConfig UseGiven --> WriteConfig WriteConfig --> LogBypass LogBypass --> RegAll RegAll --> COMPLETE class START,COMPLETE,ERROR terminal; class ScopeCheck,ScannerCheck,ConfigExists,TestCmdGiven,PhraseCheck,GateFail stateNode; class MkDir,AlreadyMsg,UseGiven,InputCall,WriteConfig handler; class SecGate,NonInterGate,BypassPrompt detector; class PromptCmd,LogBypass newComponent; class TtyCheck stateNode; class RegAll phase; class NiTtyFail detector; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start, complete, and error terminals | | Teal | State | Decision points and routing conditions | | Orange | Handler | Unchanged processing nodes | | Red | Detector | Security gates, TTY validation, error paths | | Green | New/Modified | `★` new or `●` modified components | | Purple | Phase | Registration phase | ### Security Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; START([CLI Invocation]) BLOCKED([BLOCKED — SystemExit 1]) ALLOWED([ALLOWED — proceed]) subgraph Layer1 ["TRUST BOUNDARY 1 — Secret Scanning Gate (autoskillit init)"] InitEntry["● app.py: init() ━━━━━━━━━━ Entry point — scope validated"] SecGate["● _check_secret_scanning() ━━━━━━━━━━ ★ NOW RUNS FIRST — before any I/O returns ★ _ScanResult(passed, bypass_accepted)"] ScannerCheck{"scanner in .pre-commit-config.yaml? ━━━━━━━━━━ _detect_secret_scanner()"} TtyCheck1{"sys.stdin.isatty()? ━━━━━━━━━━ inside _check_secret_scanning"} PhraseCheck{"exact bypass phrase ━━━━━━━━━━ typed by human?"} BlockNonInter["ERROR: Non-interactive cannot bypass ━━━━━━━━━━ return _ScanResult(False)"] BlockWrongPhrase["Aborted: phrase mismatch ━━━━━━━━━━ return _ScanResult(False)"] BypassAccepted["bypass_accepted=True ━━━━━━━━━━ return _ScanResult(True, True)"] ScannerPass["scanner found ━━━━━━━━━━ return _ScanResult(True)"] end subgraph Layer2 ["TRUST BOUNDARY 2 — Interactive Consent Enforcement"] TTYGuard["★ _require_interactive_stdin() ━━━━━━━━━━ shared TTY contract enforcer called by ALL input() callers"] TtyCheck2{"sys.stdin.isatty()? ━━━━━━━━━━ pre-condition for input()"} BlockNonTTY["ERROR: requires interactive terminal ━━━━━━━━━━ SystemExit(1) — no bare EOFError"] end subgraph Layer3 ["COMMANDS PROTECTED BY TTY BOUNDARY"] PromptTest["● _prompt_test_command() ━━━━━━━━━━ calls _require_interactive_stdin first"] CookConfirm["● cook() launch confirm ━━━━━━━━━━ calls _require_interactive_stdin first"] WorkspaceClean["● workspace clean confirm ━━━━━━━━━━ calls _require_interactive_stdin first"] end subgraph Layer4 ["AUDIT TRAIL"] BypassLog["_log_secret_scan_bypass() ━━━━━━━━━━ timestamp → config.yaml called AFTER config write"] ConfigYaml["★ .autoskillit/config.yaml ━━━━━━━━━━ safety.secret_scan_bypass_accepted ISO-8601 timestamp persisted"] end START --> InitEntry InitEntry --> SecGate SecGate --> ScannerCheck ScannerCheck -->|"scanner found"| ScannerPass ScannerCheck -->|"no scanner"| TtyCheck1 TtyCheck1 -->|"not TTY"| BlockNonInter TtyCheck1 -->|"is TTY"| PhraseCheck PhraseCheck -->|"wrong phrase"| BlockWrongPhrase PhraseCheck -->|"exact match"| BypassAccepted BlockNonInter --> BLOCKED BlockWrongPhrase --> BLOCKED ScannerPass --> ALLOWED BypassAccepted --> ALLOWED ALLOWED -->|"init path"| TTYGuard TTYGuard --> TtyCheck2 TtyCheck2 -->|"not TTY"| BlockNonTTY BlockNonTTY --> BLOCKED TtyCheck2 -->|"is TTY"| PromptTest TtyCheck2 -->|"is TTY"| CookConfirm TtyCheck2 -->|"is TTY"| WorkspaceClean BypassAccepted -->|"bypass_accepted=True"| BypassLog BypassLog --> ConfigYaml class START,BLOCKED,ALLOWED terminal; class InitEntry cli; class SecGate,PromptTest,CookConfirm,WorkspaceClean handler; class ScannerCheck,TtyCheck1,TtyCheck2,PhraseCheck stateNode; class BlockNonInter,BlockWrongPhrase,BlockNonTTY detector; class ScannerPass phase; class TTYGuard,BypassAccepted newComponent; class BypassLog,ConfigYaml output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal/CLI | Start, allowed, blocked terminals and entry points | | Teal | State | Decision/routing points | | Orange | Handler | Commands and prompt functions | | Red | Detector | Blocking gates — security violations | | Purple | Phase | Passing validation states | | Green | New/Modified | `★` new or `●` modified components | | Dark Teal | Output | Audit artifacts persisted to disk | Closes #470 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260321-193454-179265/temp/rectify/rectify_secret_scanning_gate_ordering_2026-03-21_000000_part_a.md` ## Token Usage Summary No token data available 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary The `.autoskillit/` scope has a **complete, self-reinforcing protection feedback loop**: a canonical constant (`_AUTOSKILLIT_GITIGNORE_ENTRIES`) feeds an idempotent writer (`ensure_project_temp`), a runtime validator (`_check_gitignore_completeness`), and a structural integration test (`test_init_all_created_files_covered_by_gitignore`) that makes it impossible to add a new file to `.autoskillit/` without updating the constant — the test fails immediately. The root project scope had **none of this**. No constant. No programmatic writer. No structural test. The bug survived because there is no equivalent feedback loop for the root scope. This PR closes that gap by implementing both halves: - **Part A (Write Path):** A new `_ROOT_GITIGNORE_ENTRIES` constant in `core/io.py` and extended `ensure_project_temp()` that idempotently writes root `.gitignore` entries. Structural tests enforce the invariant permanently. - **Part B (Validate Path):** Extended `_check_gitignore_completeness()` in `_doctor.py` to also validate root `.gitignore` against `_ROOT_GITIGNORE_ENTRIES`. Doctor now reports false-OK when root gitignore is missing. - **Clarified comment** in `.secrets.yaml` template: "gitignored — do not commit" removes ambiguity about which `.gitignore` provides protection. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; INIT_START(["autoskillit init"]) DOCTOR_START(["autoskillit doctor"]) INIT_END(["project initialized"]) DOCTOR_END(["doctor report emitted"]) subgraph Constants ["● core/io.py — Constants"] direction LR ASKE["_AUTOSKILLIT_GITIGNORE_ENTRIES ━━━━━━━━━━ temp/ · .secrets.yaml · .onboarded · sync_manifest.json"] RSKE["● _ROOT_GITIGNORE_ENTRIES ━━━━━━━━━━ .autoskillit/.secrets.yaml .autoskillit/temp/ .autoskillit/.onboarded .autoskillit/sync_manifest.json"] end subgraph InitFlow ["● ensure_project_temp() — core/io.py"] direction TB EPT["● ensure_project_temp(project_dir) ━━━━━━━━━━ create .autoskillit/temp/"] CHECK_SUB{"● .autoskillit/.gitignore exists?"} WRITE_SUB_NEW["write .autoskillit/.gitignore ━━━━━━━━━━ from _AUTOSKILLIT_GITIGNORE_ENTRIES"] WRITE_SUB_APPEND["append missing entries ━━━━━━━━━━ to .autoskillit/.gitignore"] CHECK_ROOT{"● root .gitignore exists?"} WRITE_ROOT_NEW["● write {project_dir}/.gitignore ━━━━━━━━━━ from _ROOT_GITIGNORE_ENTRIES"] WRITE_ROOT_APPEND["● append missing root entries ━━━━━━━━━━ to {project_dir}/.gitignore"] end subgraph InitTail ["● _init_helpers.py"] CST["● _create_secrets_template() ━━━━━━━━━━ comment: gitignored — do not commit"] end subgraph DoctorFlow ["● _check_gitignore_completeness() — _doctor.py"] direction TB SCAN_SUB["scan .autoskillit/ entries ━━━━━━━━━━ check against .autoskillit/.gitignore"] SCAN_ENTRIES["check _AUTOSKILLIT_GITIGNORE_ENTRIES ━━━━━━━━━━ all entries present?"] CHECK_ROOT_EXISTS{"● root .gitignore exists?"} SCAN_ROOT["● scan _ROOT_GITIGNORE_ENTRIES ━━━━━━━━━━ check against root .gitignore"] VERDICT{"uncovered entries?"} WARN["emit WARNING ━━━━━━━━━━ list missing entries"] OK["emit OK ━━━━━━━━━━ all entries covered"] end INIT_START --> EPT ASKE -.->|"sources"| WRITE_SUB_NEW ASKE -.->|"sources"| WRITE_SUB_APPEND RSKE -.->|"sources"| WRITE_ROOT_NEW RSKE -.->|"sources"| WRITE_ROOT_APPEND EPT --> CHECK_SUB CHECK_SUB -->|"no"| WRITE_SUB_NEW CHECK_SUB -->|"yes"| WRITE_SUB_APPEND WRITE_SUB_NEW --> CHECK_ROOT WRITE_SUB_APPEND --> CHECK_ROOT CHECK_ROOT -->|"no"| WRITE_ROOT_NEW CHECK_ROOT -->|"yes"| WRITE_ROOT_APPEND WRITE_ROOT_NEW --> CST WRITE_ROOT_APPEND --> CST CST --> INIT_END DOCTOR_START --> SCAN_SUB ASKE -.->|"validates against"| SCAN_ENTRIES RSKE -.->|"validates against"| SCAN_ROOT SCAN_SUB --> SCAN_ENTRIES SCAN_ENTRIES --> CHECK_ROOT_EXISTS CHECK_ROOT_EXISTS -->|"exists"| SCAN_ROOT CHECK_ROOT_EXISTS -->|"missing"| VERDICT SCAN_ROOT --> VERDICT VERDICT -->|"yes"| WARN VERDICT -->|"no"| OK WARN --> DOCTOR_END OK --> DOCTOR_END class INIT_START,INIT_END,DOCTOR_START,DOCTOR_END terminal; class ASKE stateNode; class RSKE newComponent; class EPT,CST handler; class CHECK_SUB,CHECK_ROOT,CHECK_ROOT_EXISTS,VERDICT stateNode; class WRITE_SUB_NEW,WRITE_SUB_APPEND handler; class WRITE_ROOT_NEW,WRITE_ROOT_APPEND,SCAN_ROOT newComponent; class SCAN_SUB,SCAN_ENTRIES phase; class WARN detector; class OK output; ``` ### Operational Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph CLI ["CLI ENTRY POINTS"] direction LR INIT_CMD["autoskillit init ━━━━━━━━━━ project setup wizard"] DOCTOR_CMD["● autoskillit doctor ━━━━━━━━━━ 9 setup checks --json flag"] end subgraph Constants ["● core/io.py — Canonical Constants"] direction LR ASKE_CONST["_AUTOSKILLIT_GITIGNORE_ENTRIES ━━━━━━━━━━ temp/ · .secrets.yaml .onboarded · sync_manifest.json"] RSKE_CONST["● _ROOT_GITIGNORE_ENTRIES ━━━━━━━━━━ .autoskillit/.secrets.yaml .autoskillit/temp/ .autoskillit/.onboarded .autoskillit/sync_manifest.json"] end subgraph InitOutput ["● Init Outputs (ensure_project_temp)"] direction TB SUB_GI[".autoskillit/.gitignore ━━━━━━━━━━ idempotent write/append"] ROOT_GI["● {project}/.gitignore ━━━━━━━━━━ idempotent write/append from _ROOT_GITIGNORE_ENTRIES"] SECRETS[".autoskillit/.secrets.yaml ━━━━━━━━━━ ● comment: gitignored — do not commit"] CONFIG[".autoskillit/config.yaml ━━━━━━━━━━ project config"] end subgraph DoctorChecks ["● Doctor Checks (_doctor.py)"] direction TB CHECK1["stale_mcp_servers ━━━━━━━━━━ dead binary paths"] CHECK2["mcp_server_registered ━━━━━━━━━━ plugin or mcpServers entry"] CHECK3["hook_registration ━━━━━━━━━━ HOOK_REGISTRY scripts present"] CHECK4["● gitignore_completeness ━━━━━━━━━━ .autoskillit/ coverage + ● root .gitignore coverage"] CHECK5["secret_scanning_hook ━━━━━━━━━━ .pre-commit-config.yaml"] end subgraph DoctorReport ["Doctor Report Outputs"] direction LR OK_REPORT["✓ OK results ━━━━━━━━━━ all checks passed"] WARN_REPORT["⚠ WARNING results ━━━━━━━━━━ actionable guidance"] ERR_REPORT["✗ ERROR results ━━━━━━━━━━ blocking issues"] end INIT_CMD -->|"calls"| SUB_GI INIT_CMD -->|"calls"| ROOT_GI INIT_CMD -->|"calls"| SECRETS INIT_CMD -->|"calls"| CONFIG ASKE_CONST -.->|"sources"| SUB_GI RSKE_CONST -.->|"sources"| ROOT_GI DOCTOR_CMD --> CHECK1 DOCTOR_CMD --> CHECK2 DOCTOR_CMD --> CHECK3 DOCTOR_CMD --> CHECK4 DOCTOR_CMD --> CHECK5 ASKE_CONST -.->|"validates"| CHECK4 RSKE_CONST -.->|"validates"| CHECK4 CHECK4 --> WARN_REPORT CHECK1 --> ERR_REPORT CHECK2 --> OK_REPORT CHECK3 --> OK_REPORT CHECK5 --> ERR_REPORT class INIT_CMD,DOCTOR_CMD cli; class ASKE_CONST stateNode; class RSKE_CONST newComponent; class SUB_GI,CONFIG,SECRETS handler; class ROOT_GI newComponent; class CHECK1,CHECK2,CHECK3,CHECK5 phase; class CHECK4 newComponent; class OK_REPORT output; class WARN_REPORT detector; class ERR_REPORT detector; ``` Closes #471 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260321-195422-226999/temp/rectify/rectify_init-secrets-root-gitignore_2026-03-21_195422_part_a.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

…ART A ONLY (#476) ## Summary The config system had two crash-inducing architectural weaknesses: `_log_secret_scan_bypass` wrote `safety.secret_scan_bypass_accepted` into `config.yaml` — a key not in `SafetyConfig` — causing every subsequent `load_config()` call to raise `ConfigSchemaError` after a bypass-accepted `autoskillit init`. Additionally, when `github.token` appeared in `config.yaml`, the error message provided no actionable fix steps (no exact YAML to copy, no removal instruction). A third structural gap: `_SECRETS_ONLY_KEYS` was manually maintained with no test enforcing completeness against the config dataclasses. The fixes are threefold: (1) route bypass timestamps to `.autoskillit/.state.yaml` (never schema-validated), eliminating the self-inflicted crash; (2) make `validate_layer_keys` self-diagnosing for misplaced secrets — the error now includes the exact YAML block and a removal instruction; (3) add `test_secrets_only_keys_completeness` as a structural guard that fails if any secret-typed field in `GitHubConfig` is absent from `_SECRETS_ONLY_KEYS`. Part B (write-time config validation gateway and `autoskillit doctor` misplaced-secrets check) is included in this branch. ## Architecture Impact ### Error/Resilience Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; LOAD_CONFIG([load_config]) WRITE_CONFIG([● write_config_layer]) subgraph LayerPipeline ["LAYER LOAD PIPELINE (_make_dynaconf)"] direction TB DEFAULTS["defaults.yaml ━━━━━━━━━━ skip validation"] USER_CFG["~/.autoskillit/config.yaml ━━━━━━━━━━ validated, secrets disallowed"] PROJ_CFG[".autoskillit/config.yaml ━━━━━━━━━━ validated, secrets disallowed"] SECRETS_LAYER[".autoskillit/.secrets.yaml ━━━━━━━━━━ validated, secrets allowed"] end subgraph ValidationGate ["● VALIDATION GATE (validate_layer_keys)"] VLK{"validate layer keys"} ERR_TOP["unknown top key ━━━━━━━━━━ difflib suggestion"] ERR_SECRET["● secret in config.yaml ━━━━━━━━━━ exact YAML fix block removal instruction"] ERR_SUB["unknown sub-key ━━━━━━━━━━ difflib suggestion"] end subgraph WriteGate ["● WRITE-TIME GATE (write_config_layer)"] WCL["● write_config_layer ━━━━━━━━━━ validates BEFORE write"] WCL_FAIL["write blocked ━━━━━━━━━━ no file touched"] WCL_OK["atomic write ━━━━━━━━━━ schema-valid only"] end subgraph StateIsolation ["● STATE ISOLATION (_log_secret_scan_bypass)"] BYPASS["● _log_secret_scan_bypass ━━━━━━━━━━ bypass timestamp"] STATE_YAML[".autoskillit/.state.yaml ━━━━━━━━━━ never validated internal state"] end subgraph DoctorDiag ["● DOCTOR DIAGNOSTIC (_check_config_layers_for_secrets)"] DOCTOR["● _check_config_layers_for_secrets ━━━━━━━━━━ scans user + project config.yaml returns DoctorResult, never raises"] DOC_OK["DoctorResult.OK ━━━━━━━━━━ no secrets in config layers"] DOC_ERR["DoctorResult.ERROR ━━━━━━━━━━ actionable YAML fix + removal step"] end T_SCHEMA_ERR([ConfigSchemaError]) T_LOADED([AutomationConfig loaded]) T_WRITTEN([file written]) LOAD_CONFIG --> DEFAULTS DEFAULTS --> USER_CFG USER_CFG --> VLK VLK -->|"pass"| PROJ_CFG PROJ_CFG --> VLK VLK -->|"pass"| SECRETS_LAYER SECRETS_LAYER --> VLK VLK -->|"pass — all layers valid"| T_LOADED VLK -->|"unknown top key"| ERR_TOP VLK -->|"secret in non-secrets layer"| ERR_SECRET VLK -->|"unknown sub-key"| ERR_SUB ERR_TOP --> T_SCHEMA_ERR ERR_SECRET --> T_SCHEMA_ERR ERR_SUB --> T_SCHEMA_ERR WRITE_CONFIG --> WCL WCL -->|"validation fails"| WCL_FAIL WCL -->|"schema valid"| WCL_OK WCL_FAIL --> T_SCHEMA_ERR WCL_OK --> T_WRITTEN BYPASS --> STATE_YAML DOCTOR -->|"clean"| DOC_OK DOCTOR -->|"violation found"| DOC_ERR class LOAD_CONFIG,WRITE_CONFIG,T_SCHEMA_ERR,T_LOADED,T_WRITTEN terminal; class DEFAULTS stateNode; class USER_CFG,PROJ_CFG,SECRETS_LAYER handler; class VLK,WCL detector; class ERR_TOP,ERR_SUB,WCL_FAIL,DOC_ERR gap; class ERR_SECRET,BYPASS,DOCTOR,WCL newComponent; class STATE_YAML,WCL_OK,DOC_OK output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Entry points and final states | | Orange | Handler | Config layer paths (validated) | | Dark Teal | State/Output | Defaults layer, output states | | Red | Detector | Validation gates | | Yellow | Gap | Error paths: unknown keys, write blocked, doctor violation | | Green | New/Modified | `ERR_SECRET` (actionable message), `_log_secret_scan_bypass`, `_check_config_layers_for_secrets`, `write_config_layer` | ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; subgraph StorageLocations ["STORAGE LOCATIONS — MUTATION RULES"] direction LR DEFAULTS_YAML["config/defaults.yaml ━━━━━━━━━━ DEFAULTS read-only baseline never validated"] CONFIG_YAML["● .autoskillit/config.yaml ━━━━━━━━━━ SCHEMA_LOCKED secrets forbidden validated on every load + write"] SECRETS_YAML[".autoskillit/.secrets.yaml ━━━━━━━━━━ SECRETS_ONLY secrets allowed here only validated on every load"] STATE_YAML["● .autoskillit/.state.yaml ━━━━━━━━━━ INTERNAL_STATE no schema enforcement never passed to validate_layer_keys"] end subgraph SecretContract ["● SECRET KEY CONTRACT (_SECRETS_ONLY_KEYS)"] SOK["● _SECRETS_ONLY_KEYS ━━━━━━━━━━ frozenset{'github.token'} re-exported from config/__init__.py used by validate_layer_keys + doctor"] SOK_TEST["● test_secrets_only_keys_completeness ━━━━━━━━━━ structural guard: asserts every token/key/secret field in GitHubConfig is in _SECRETS_ONLY_KEYS"] end subgraph LoadGate ["LOAD-TIME SCHEMA GATE (validate_layer_keys)"] VLK{"validate layer keys"} VLK_SCHEMA["● schema check ━━━━━━━━━━ unknown top/sub keys → ConfigSchemaError"] VLK_SECRET["● secrets-placement check ━━━━━━━━━━ SECRETS_ONLY key in non-secrets layer → actionable ConfigSchemaError"] end subgraph WriteGate2 ["● WRITE-TIME SCHEMA GATE (write_config_layer)"] WCL2["● write_config_layer ━━━━━━━━━━ validates before write schema-valid only"] WCL_BLOCK["write blocked ━━━━━━━━━━ no file modified"] end subgraph BypassRoute ["● BYPASS STATE ISOLATION (_log_secret_scan_bypass)"] BYPASS_FN["● _log_secret_scan_bypass ━━━━━━━━━━ writes bypass timestamp to .state.yaml never touches config.yaml"] end subgraph DoctorGate2 ["● DOCTOR GATE (_check_config_layers_for_secrets)"] DOCTOR2["● _check_config_layers_for_secrets ━━━━━━━━━━ reads _SECRETS_ONLY_KEYS scans user + project config.yaml"] DOC_CLEAN["DoctorResult.OK"] DOC_CORRUPT["DoctorResult.ERROR ━━━━━━━━━━ contract violation detected actionable fix guidance"] end T_SCHEMA_ERR2([ConfigSchemaError]) T_LOADED2([AutomationConfig loaded]) T_WRITTEN2([file written]) DEFAULTS_YAML -->|"merged first, no validation"| VLK CONFIG_YAML -->|"validated, secrets forbidden"| VLK SECRETS_YAML -->|"validated, is_secrets=True"| VLK VLK -->|"pass"| T_LOADED2 VLK -->|"unknown key"| VLK_SCHEMA VLK -->|"secret in config.yaml"| VLK_SECRET VLK_SCHEMA --> T_SCHEMA_ERR2 VLK_SECRET --> T_SCHEMA_ERR2 WCL2 -->|"schema valid"| T_WRITTEN2 WCL2 -->|"validation fails"| WCL_BLOCK WCL_BLOCK --> T_SCHEMA_ERR2 BYPASS_FN -->|"writes to"| STATE_YAML SOK -->|"drives"| VLK_SECRET SOK -->|"drives"| DOCTOR2 SOK_TEST -.->|"prevents drift"| SOK DOCTOR2 -->|"clean"| DOC_CLEAN DOCTOR2 -->|"violation"| DOC_CORRUPT class DEFAULTS_YAML stateNode; class CONFIG_YAML,SECRETS_YAML handler; class STATE_YAML,T_LOADED2,T_WRITTEN2 newComponent; class VLK detector; class VLK_SCHEMA gap; class VLK_SECRET,WCL2,BYPASS_FN,DOCTOR2,SOK,SOK_TEST newComponent; class WCL_BLOCK,DOC_CORRUPT gap; class DOC_CLEAN output; class T_SCHEMA_ERR2,T_LOADED2,T_WRITTEN2 terminal; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Teal | Defaults | Baseline layer (never validated) | | Orange | Config/Secrets | Validated YAML storage locations | | Green | New/Modified | `write_config_layer`, `_log_secret_scan_bypass`, `_SECRETS_ONLY_KEYS`, doctor check, `.state.yaml` | | Red | Detector | `validate_layer_keys` gate | | Yellow | Gap | Error paths: unknown key, write blocked, doctor violation | | Dark Blue | Terminal | Entry/exit states and `ConfigSchemaError` | ### Operational Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 45, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; subgraph InitCmd ["autoskillit init (● modified)"] direction TB INIT_CMD["autoskillit init ━━━━━━━━━━ interactive project setup"] SCAN_CHECK["_check_secret_scanning ━━━━━━━━━━ detect pre-commit scanner or prompt for bypass consent"] BYPASS_OK["bypass accepted ━━━━━━━━━━ user typed exact phrase"] NO_BYPASS["scanner found / no consent ━━━━━━━━━━ normal path / abort"] LOG_BYPASS["● _log_secret_scan_bypass ━━━━━━━━━━ writes bypass timestamp to .state.yaml ONLY"] WRITE_CFG["write_config_layer ━━━━━━━━━━ writes user config to .autoskillit/config.yaml (schema-validated)"] REGISTER["_register_mcp_server ━━━━━━━━━━ writes ~/.claude.json"] end subgraph DoctorCmd ["autoskillit doctor (● modified)"] direction TB DOCTOR_CMD["autoskillit doctor ━━━━━━━━━━ runs 9 health checks always all, no early exit"] CHK1["Check 1–4: version, scanner, config existence, MCP server ━━━━━━━━━━ existing checks"] CHK4B["● Check 4b: _check_config_layers_for_secrets ━━━━━━━━━━ scans ~/.autoskillit/config.yaml + .autoskillit/config.yaml for _SECRETS_ONLY_KEYS violations"] CHK5N["Check 5–9: hooks, git, worktree, recipe, branch ━━━━━━━━━━ existing checks"] DOCTOR_OK["All checks OK ━━━━━━━━━━ Severity.OK for each"] DOCTOR_ERR["● Severity.ERROR ━━━━━━━━━━ Shows: dotted key path, exact YAML fix block, removal instruction"] end subgraph ConfigFiles ["CONFIGURATION FILES"] direction TB USER_CFG2["~/.autoskillit/config.yaml ━━━━━━━━━━ user-level config SCHEMA_LOCKED secrets forbidden"] PROJ_CFG2[".autoskillit/config.yaml ━━━━━━━━━━ project-level config SCHEMA_LOCKED secrets forbidden"] SECRETS_FILE2[".autoskillit/.secrets.yaml ━━━━━━━━━━ secret keys here only e.g. github.token"] STATE_FILE2["● .autoskillit/.state.yaml ━━━━━━━━━━ internal operational state bypass timestamp never schema-validated"] end subgraph LoadConfig2 ["load_config (called by all commands)"] LC2["● load_config ━━━━━━━━━━ merges all layers validates each (except defaults)"] LC_ERR2["● ConfigSchemaError ━━━━━━━━━━ actionable: exact YAML fix + removal instruction"] end INIT_CMD --> SCAN_CHECK SCAN_CHECK -->|"bypass phrase matched"| BYPASS_OK SCAN_CHECK -->|"scanner OK / no consent"| NO_BYPASS BYPASS_OK --> LOG_BYPASS LOG_BYPASS -->|"writes bypass timestamp"| STATE_FILE2 NO_BYPASS --> WRITE_CFG WRITE_CFG -->|"schema-validated write"| PROJ_CFG2 WRITE_CFG --> REGISTER DOCTOR_CMD --> CHK1 CHK1 --> CHK4B CHK4B -->|"reads"| USER_CFG2 CHK4B -->|"reads"| PROJ_CFG2 CHK4B -->|"clean"| DOCTOR_OK CHK4B -->|"violation found"| DOCTOR_ERR CHK4B --> CHK5N LC2 -->|"validated layers"| USER_CFG2 LC2 -->|"validated layers"| PROJ_CFG2 LC2 -->|"validated layers"| SECRETS_FILE2 LC2 -->|"secret in config.yaml"| LC_ERR2 class INIT_CMD,DOCTOR_CMD cli; class SCAN_CHECK,NO_BYPASS,WRITE_CFG,REGISTER,CHK1,CHK5N handler; class BYPASS_OK,LOG_BYPASS,CHK4B,DOCTOR_ERR,LC_ERR2,LC2 newComponent; class USER_CFG2,PROJ_CFG2,SECRETS_FILE2 phase; class STATE_FILE2 newComponent; class DOCTOR_OK,LC2 output; class LC2 detector; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | CLI | Entry-point commands | | Orange | Handler | Existing processing steps | | Green | New/Modified | `● _log_secret_scan_bypass`, `● _check_config_layers_for_secrets`, `● .state.yaml`, `● ConfigSchemaError` (actionable) | | Purple | Config | Configuration file layers | | Red | Detector | `load_config` schema validation gate | | Dark Teal | Output | Success results | ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260322-002721-721157/temp/rectify/rectify_config-schema-immunity_2026-03-22_000000_part_a.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

… ONLY (#478) ## Summary `ingredients_to_terminal` in `cli/_ansi.py` received a GFM markdown string from `format_ingredients_table`, parsed it back into rows, then computed column widths with `max(len(...))` — no ceiling. For `implementation.yaml`, the `run_mode` description collapsed to 220+ characters, forcing every terminal line to 245–254 characters. The architectural weakness is not just the missing cap: it is that the terminal rendering path took a roundabout route through a GFM serialized string when the structured `Recipe` object was already available at the call site. This creates implicit coupling, duplicated width computations, and a rendering contract that is enforced nowhere. Part A introduces a `TerminalColumn` abstraction that makes width bounds **structural** — part of the column specification itself — and routes the terminal path directly from structured data, eliminating the fragile GFM round-trip. `TerminalColumn` and `_render_terminal_table` are placed in `core/` (L0) to fix a pre-existing L1→L3 layer violation where `pipeline/telemetry_fmt.py` was importing from `cli/_ansi.py`. ## Architecture Impact ### Module Dependency Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 70, 'curve': 'basis'}}}%% graph TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph L0 ["L0 — CORE (Foundation · stdlib only)"] direction LR CORE_TT["★ core/_terminal_table.py ━━━━━━━━━━ TerminalColumn NamedTuple _render_terminal_table() color-agnostic · pure stdlib"] CORE_INIT["● core/__init__.py ━━━━━━━━━━ re-exports TerminalColumn re-exports _render_terminal_table"] end subgraph L1 ["L1 — PIPELINE (Service Layer)"] direction LR TELEM["● pipeline/telemetry_fmt.py ━━━━━━━━━━ imports TerminalColumn from core format_token_table_terminal() format_timing_table_terminal()"] end subgraph L2 ["L2 — RECIPE (Domain Layer)"] direction LR API["● recipe/_api.py ━━━━━━━━━━ _build_ingredient_rows() format_ingredients_table() load_and_validate()"] RECIPE_INIT["● recipe/__init__.py ━━━━━━━━━━ re-exports _build_ingredient_rows"] end subgraph L3 ["L3 — CLI (Application Layer)"] direction LR CLI_TT["★ cli/_terminal_table.py ━━━━━━━━━━ re-export shim ← imports from core/__init__"] ANSI["● cli/_ansi.py ━━━━━━━━━━ TerminalColumn (own copy·colored) _render_terminal_table (colored) ingredients_to_terminal()"] PROMPTS["● cli/_prompts.py ━━━━━━━━━━ show_cook_preview() ← TYPE_CHECKING import _build_ingredient_rows"] end CORE_TT -->|"defines · re-exported by"| CORE_INIT CORE_INIT -->|"imports TerminalColumn _render_terminal_table"| TELEM API -->|"_build_ingredient_rows re-exported"| RECIPE_INIT CORE_INIT -->|"imports TerminalColumn _render_terminal_table"| CLI_TT RECIPE_INIT -->|"TYPE_CHECKING import _build_ingredient_rows"| PROMPTS ANSI -.->|"defines own copy (color-enhanced variant)"| ANSI class CORE_TT newComponent; class CORE_INIT stateNode; class TELEM phase; class API,RECIPE_INIT handler; class CLI_TT newComponent; class ANSI,PROMPTS cli; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Green | New Component | New files introduced by this PR (★) | | Teal | Core Export | `core/__init__.py` — high fan-in re-export hub | | Purple | Pipeline | `pipeline/telemetry_fmt.py` (L1) — now correctly imports from L0 | | Orange | Recipe | `recipe/_api.py` and `recipe/__init__.py` (L2) | | Dark Blue | CLI | `cli/_terminal_table.py`, `cli/_ansi.py`, `cli/_prompts.py` (L3) | | Solid arrows | Valid | Downward dependencies (higher → lower layer) | ### Data Lineage Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart LR classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph Origin ["Data Origin"] YAML["implementation.yaml ━━━━━━━━━━ run_mode: 220-char desc structured YAML ingredients"] RECIPE["Recipe dataclass ━━━━━━━━━━ RecipeIngredient dict structured source of truth"] end subgraph TermPath ["★ Terminal Path (new structured route)"] direction TB ROWS["★ _build_ingredient_rows() ━━━━━━━━━━ recipe/_api.py list[tuple[str, str, str]] full-length descriptions"] COLS["● TerminalColumn spec ━━━━━━━━━━ core/_terminal_table.py max_desc=60, align=left"] RENDER["● ingredients_to_terminal() ━━━━━━━━━━ cli/_ansi.py accepts structured rows truncates at 60 chars + …"] ANSI["ANSI Terminal Output ━━━━━━━━━━ ≤90 chars/line run_mode: 'Execution mode…'"] end subgraph LLMPath ["LLM Path (unchanged)"] direction TB GFM["format_ingredients_table() ━━━━━━━━━━ recipe/_api.py GFM markdown string full 220-char descriptions"] LOAD["load_and_validate() ━━━━━━━━━━ ingredients_table field in LoadRecipeResult"] LLM["Claude / LLM UI ━━━━━━━━━━ markdown renderer displays full text"] end YAML -->|"load_yaml() + parse"| RECIPE RECIPE -->|"★ _build_ingredient_rows() structured rows"| ROWS ROWS -->|"column spec applied"| COLS COLS -->|"_render_terminal_table()"| RENDER RENDER -->|"bounded ANSI output"| ANSI RECIPE -->|"format_ingredients_table() GFM string"| GFM GFM -->|"LoadRecipeResult"| LOAD LOAD -->|"MCP response"| LLM class YAML cli; class RECIPE stateNode; class ROWS,COLS newComponent; class RENDER handler; class ANSI output; class GFM phase; class LOAD integration; class LLM integration; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Origin | YAML source data | | Teal | Source of Truth | `Recipe` dataclass — structured ingredient data | | Green | New/Modified | `_build_ingredient_rows()`, `TerminalColumn` spec (★/●) | | Orange | Transformer | `ingredients_to_terminal()` — structured-rows renderer | | Dark Teal | Terminal Output | Bounded ANSI output (≤90 chars/line) | | Purple | GFM Stage | `format_ingredients_table()` — LLM path, unchanged | | Red | External Consumer | `load_and_validate()` MCP response + Claude UI | ### Operational Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; subgraph Entrypoint ["CLI ENTRY POINT"] ORDER["autoskillit order ━━━━━━━━━━ cli/app.py:order() recipe selection TUI"] end subgraph Preview ["★ PARAMETER PREVIEW (changed rendering)"] direction TB PREVIEW["● show_cook_preview() ━━━━━━━━━━ cli/_prompts.py calls _build_ingredient_rows() NOT format_ingredients_table()"] BUILD["★ _build_ingredient_rows() ━━━━━━━━━━ recipe/_api.py (re-exported) list[tuple[str,str,str]] full-length descriptions"] RENDER["● ingredients_to_terminal() ━━━━━━━━━━ cli/_ansi.py accepts structured rows max_desc_width=60"] COL_SPEC["★ TerminalColumn spec ━━━━━━━━━━ core/_terminal_table.py Name≤30 Desc≤60 Default≤20"] end subgraph Output ["TERMINAL OUTPUT"] TERM["● Bounded ANSI Table ━━━━━━━━━━ ≤90 chars/line long descriptions → 'Execution…' columns stay aligned"] end subgraph Telemetry ["● TELEMETRY DISPLAY (also fixed)"] direction TB TOKEN["● format_token_table_terminal() ━━━━━━━━━━ pipeline/telemetry_fmt.py imports TerminalColumn from core/"] TIMING["● format_timing_table_terminal() ━━━━━━━━━━ pipeline/telemetry_fmt.py imports TerminalColumn from core/"] TEL_OUT["Telemetry Table Output ━━━━━━━━━━ step name ≤40 chars all columns bounded"] end ORDER -->|"recipe selected"| PREVIEW PREVIEW -->|"calls"| BUILD BUILD -->|"rows passed to"| RENDER COL_SPEC -->|"width constraints"| RENDER RENDER -->|"print()"| TERM ORDER -.->|"post-run telemetry"| TOKEN ORDER -.->|"post-run telemetry"| TIMING TOKEN -->|"print()"| TEL_OUT TIMING -->|"print()"| TEL_OUT class ORDER cli; class PREVIEW,RENDER handler; class BUILD,COL_SPEC newComponent; class TERM,TEL_OUT output; class TOKEN,TIMING phase; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | CLI Entry | `autoskillit order` command | | Orange | Renderer | `show_cook_preview()`, `ingredients_to_terminal()` (●) | | Green | New Component | `_build_ingredient_rows()`, `TerminalColumn` spec (★/●) | | Dark Teal | Terminal Output | Bounded ANSI output — operator-visible result | | Purple | Telemetry | `format_token_table_terminal()`, `format_timing_table_terminal()` (●) | Closes #475 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260322-003904-444622/temp/rectify/rectify_order-parameter-table-formatting_2026-03-22_000000_part_a.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary This PR completes the relocation of all temp directory references from bare `temp/` paths to `.autoskillit/temp/` throughout the codebase, addressing issue #468. The changes span skill SKILL.md instructions, recipe YAML files, Python source strings, tests, and documentation, with the final gap being a one-line fix to a prose `note:` field in `remediation.yaml` that still referenced `temp/diagnose-ci/`. A regression test is added to `tests/recipe/test_bundled_recipes.py` to guard against bare `temp/` paths re-appearing in bundled recipe note fields. <details> <summary>Individual Group Plans</summary> ### Group 1: Implementation Plan: Relocate Temp Directory from `temp/` to `.autoskillit/temp/` `ensure_project_temp()` in `core/io.py` already creates and returns `.autoskillit/temp/` — the runtime is already correct. The remaining work is a systematic text-replacement across skill SKILL.md instructions, recipe YAML files, Python source strings, tests, and documentation: every reference to `temp/skill-name/` must become `.autoskillit/temp/skill-name/`. ### Group 2: Implementation Plan: Relocate temp/ Path in remediation.yaml diagnose-ci Note A single note field in `src/autoskillit/recipes/remediation.yaml` (line 864) still references `temp/diagnose-ci/` instead of `.autoskillit/temp/diagnose-ci/`. This is the sole remaining gap from the `relocate-temp-directory` implementation (issue #468), as identified by the `audit-impl` remediation report. The fix is a one-line change to the prose `note:` field documenting where the `diagnose_ci` step writes its output. A regression test is also added to `tests/recipe/test_bundled_recipes.py` to ensure no bundled recipe YAML note fields carry bare `temp/` paths in the future. </details> ## Requirements ### CORE - **REQ-CORE-001:** The `ensure_project_temp` function must return a path rooted at `.autoskillit/temp/` relative to the project root, not `temp/`. - **REQ-CORE-002:** The `.autoskillit/temp/` directory must be created automatically if it does not exist when `ensure_project_temp` is called. - **REQ-CORE-003:** All production code that constructs temp file paths must use `ensure_project_temp` as the single source of truth — no hardcoded `temp/` literals. ### GIT - **REQ-GIT-001:** The project `.gitignore` must include a pattern that excludes `.autoskillit/temp/` from version control. - **REQ-GIT-002:** The legacy `temp/` gitignore entry may be retained for backward compatibility but must not be the primary mechanism. ### RECIPE - **REQ-RECIPE-001:** All bundled recipe YAML files that reference `temp/` paths must be updated to reference `.autoskillit/temp/`. - **REQ-RECIPE-002:** All bundled skill SKILL.md files that reference `temp/` output paths must be updated to reference `.autoskillit/temp/`. - **REQ-RECIPE-003:** Recipe validation and semantic rules that check temp directory paths must recognize `.autoskillit/temp/` as the canonical location. ### DOCS - **REQ-DOCS-001:** The CLAUDE.md "Temporary Files" rule must reference `.autoskillit/temp/` as the required destination for temp files. ### TEST - **REQ-TEST-001:** All existing tests that assert on temp directory paths must be updated to expect `.autoskillit/temp/`. - **REQ-TEST-002:** A test must verify that `ensure_project_temp` returns a path under `.autoskillit/temp/`. ## Architecture Impact ### Operational Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; subgraph Entry ["CLI ENTRY POINTS"] COOK["autoskillit cook ━━━━━━━━━━ Interactive skill session skill execution"] ORDER["autoskillit order ━━━━━━━━━━ Recipe pipeline orchestration"] end subgraph Validation ["● PATH VALIDATION HOOK"] HOOK["● skill_cmd_check.py ━━━━━━━━━━ _PATH_PREFIXES: ('/', './', '.autoskillit/')"] REJECT["bare temp/ rejected ━━━━━━━━━━ no longer a valid path prefix"] end subgraph Config ["● RECIPE CONFIGURATION"] IMPL["● implementation.yaml ━━━━━━━━━━ diagnose-ci paths updated"] MERGE["● merge-prs.yaml ━━━━━━━━━━ plans_dir default: .autoskillit/temp/merge-prs"] REMED["● remediation.yaml ━━━━━━━━━━ diagnose-ci note path corrected"] IMPLG["● implementation-groups.yaml ━━━━━━━━━━ group plan paths updated"] end subgraph Foundation ["CORE FOUNDATION (unchanged)"] EPT["ensure_project_temp() ━━━━━━━━━━ creates .autoskillit/temp/ manages .gitignore entries"] end subgraph Skills ["● SKILL OUTPUT PATHS (58 SKILL.md files)"] SKILLOUT["● SKILL.md instructions ━━━━━━━━━━ output → .autoskillit/temp/skill-name/ (was: temp/skill-name/)"] end subgraph Artifacts ["ARTIFACT STORAGE"] TEMPDIR[".autoskillit/temp/ ━━━━━━━━━━ .autoskillit/temp/make-plan/ .autoskillit/temp/investigate/ .autoskillit/temp/open-pr/ ..."] GITIGNORE[".autoskillit/.gitignore ━━━━━━━━━━ entry: temp/ (relative — correct as-is)"] end COOK -->|"run_skill"| HOOK ORDER -->|"run_skill"| HOOK HOOK -->|"valid .autoskillit/ path"| SKILLOUT HOOK -->|"invalid bare temp/"| REJECT SKILLOUT -->|"writes to"| TEMPDIR EPT -->|"creates"| TEMPDIR TEMPDIR --> GITIGNORE MERGE -->|"default plans_dir"| TEMPDIR IMPL -->|"diagnose path"| TEMPDIR REMED -->|"note corrected"| TEMPDIR IMPLG -->|"group paths"| TEMPDIR class COOK,ORDER cli; class HOOK,REJECT detector; class EPT stateNode; class SKILLOUT phase; class TEMPDIR,GITIGNORE output; class IMPL,MERGE,REMED,IMPLG handler; ``` ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; %% TERMINALS %% START([PreToolUse: run_skill invoked]) ALLOW([EXIT 0 — allowed]) DENY([EXIT 0 + JSON deny — blocked]) subgraph Parse ["PARSE PHASE"] direction TB READ["Read stdin JSON ━━━━━━━━━━ extract tool_input extract skill_command"] PARSESKILL{"skill_command present?"} PARSENAME{"skill in PATH_ARG_SKILLS? ━━━━━━━━━━ (implement-worktree, implement-worktree-no-merge, retry-worktree, resolve-failures)"} end subgraph Validate ["● PATH VALIDATION PHASE"] direction TB SPLIT["Split args into tokens"] FIRSTPATH{"first token path-like? ━━━━━━━━━━ ● _PATH_PREFIXES: ('/', './', '.autoskillit/') (was: + 'temp/')"} LATERPATH{"any later token path-like?"} BUILDFIX["Build correction: skill_name + path_token"] end %% FLOW %% START --> READ READ --> PARSESKILL PARSESKILL -->|"no"| ALLOW PARSESKILL -->|"yes"| PARSENAME PARSENAME -->|"not a path-arg skill"| ALLOW PARSENAME -->|"is path-arg skill"| SPLIT SPLIT --> FIRSTPATH FIRSTPATH -->|"yes — correct format"| ALLOW FIRSTPATH -->|"no — first token not path-like"| LATERPATH LATERPATH -->|"no path found anywhere (allow — skill Step 0 handles)"| ALLOW LATERPATH -->|"path found but not first (anti-pattern detected)"| BUILDFIX BUILDFIX --> DENY %% CLASS ASSIGNMENTS %% class START,ALLOW,DENY terminal; class READ,SPLIT handler; class PARSESKILL,PARSENAME stateNode; class FIRSTPATH,LATERPATH detector; class BUILDFIX phase; ``` Closes #468 ## Implementation Plan Plan files: - `/home/talon/projects/autoskillit-runs/impl-20260322-120405-819376/temp/make-plan/relocate_temp_directory_plan_2026-03-22_120000.md` - `/home/talon/projects/autoskillit-runs/impl-20260322-120405-819376/temp/make-plan/relocate_temp_directory_remediation_plan_2026-03-22_125200.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

… B ONLY (#483) ## Summary Part A added the `advisory-step-missing-context-limit` semantic rule and fixed the `review` step in three recipes. After Part A, the rule still emitted WARNINGs for three other advisory steps (`audit_impl`, `open_pr_step`, `ci_conflict_fix`) in each of the three main recipes. Part B resolves those remaining WARNINGs by adding explicit `on_context_limit` routes, then tightens the test gate so that zero advisory-step WARNINGs are permitted in bundled recipes going forward. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; START([START]) ESCALATE([ESCALATE_STOP]) RELEASE_FAIL([RELEASE_ISSUE_FAILURE]) subgraph AdvisoryGate ["Advisory Step Gate (skip_when_false)"] direction TB REVIEW["● review ━━━━━━━━━━ skip_when_false: review_approach retries: 1 on_context_limit: dry_walkthrough"] AUDIT["● audit_impl ━━━━━━━━━━ skip_when_false: inputs.audit on_context_limit: escalate_stop"] OPEN_PR["● open_pr_step ━━━━━━━━━━ skip_when_false: inputs.open_pr on_context_limit: release_issue_failure"] CI_FIX["● ci_conflict_fix ━━━━━━━━━━ skip_when_false: detect_ci_conflicts on_context_limit: release_issue_failure"] end subgraph SemanticRule ["● advisory-step-missing-context-limit Rule"] direction LR RULE["advisory-step-missing-context-limit ━━━━━━━━━━ WARNING: run_skill + skip_when_false but no on_context_limit"] RULECHECK{"on_context_limit set?"} end subgraph NormalFlow ["Normal Execution Path"] direction TB DRY["dry_walkthrough ━━━━━━━━━━ retries: 3"] TEST["test"] MERGE["merge"] PUSH["push"] CI_WATCH["ci_watch"] end subgraph AuditVerdict ["Audit Verdict Routing"] direction LR VERDICT{"audit verdict"} REMEDIATE["remediate → make_plan"] end START --> REVIEW REVIEW -->|"on_success"| DRY REVIEW -->|"on_context_limit (●new)"| DRY REVIEW -->|"on_failure"| RELEASE_FAIL DRY --> TEST TEST --> AUDIT AUDIT -->|"on_result: GO"| MERGE AUDIT --> VERDICT VERDICT -->|"NO GO"| REMEDIATE VERDICT -->|"error"| ESCALATE AUDIT -->|"on_context_limit (●new)"| ESCALATE AUDIT -->|"on_failure"| ESCALATE AUDIT -->|"skipped"| MERGE REMEDIATE --> DRY MERGE --> PUSH PUSH --> OPEN_PR OPEN_PR -->|"on_success"| CI_WATCH OPEN_PR -->|"on_context_limit (●new)"| RELEASE_FAIL OPEN_PR -->|"on_failure"| RELEASE_FAIL OPEN_PR -->|"skipped"| CI_WATCH CI_WATCH -->|"on_success"| MERGE CI_WATCH -->|"on_failure → detect_ci_conflict"| CI_FIX CI_FIX -->|"on_success"| PUSH CI_FIX -->|"on_context_limit (●new)"| RELEASE_FAIL CI_FIX -->|"on_failure"| RELEASE_FAIL RULE --> RULECHECK RULECHECK -->|"missing"| RULE RULECHECK -->|"present"| RULE %% CLASS ASSIGNMENTS %% class START terminal; class ESCALATE,RELEASE_FAIL terminal; class REVIEW,AUDIT,OPEN_PR,CI_FIX handler; class DRY,TEST,MERGE,PUSH,CI_WATCH phase; class VERDICT stateNode; class REMEDIATE detector; class RULE,RULECHECK output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | START, ESCALATE_STOP, RELEASE_ISSUE_FAILURE | | Orange | Handler | ● Modified advisory steps (review, audit_impl, open_pr_step, ci_conflict_fix) | | Purple | Phase | Normal execution steps (dry_walkthrough, test, merge, push, ci_watch) | | Teal | State | Decision/routing nodes | | Red | Detector | Remediation loop | | Dark Teal | Output | Semantic rule nodes | ### Error/Resilience Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; subgraph SemanticGate ["● Semantic Validation Gate (rules_worktree.py)"] direction LR RULE["● advisory-step-missing-context-limit ━━━━━━━━━━ run_skill + skip_when_false but no on_context_limit → WARNING"] RCHECK{"on_context_limit declared?"} RULE_OK["Rule passes ━━━━━━━━━━ step is compliant"] end subgraph ReviewResilience ["● review step (skip_when_false: review_approach, retries: 1)"] direction TB REVIEW_RUN["run_skill: review-approach ━━━━━━━━━━ advisory; can be skipped"] REVIEW_STALE["context exhausted / stale"] REVIEW_RETRY{"retries remaining?"} REVIEW_SKIP["● on_context_limit: dry_walkthrough ━━━━━━━━━━ skip to next step (safe)"] end subgraph AuditResilience ["● audit_impl step (skip_when_false: inputs.audit) — merge gate"] direction TB AUDIT_RUN["run_skill: audit-impl ━━━━━━━━━━ merge gate; GO/NO GO verdict"] AUDIT_FAIL["context exhausted ━━━━━━━━━━ verdict unavailable"] AUDIT_LIMIT["● on_context_limit: escalate_stop ━━━━━━━━━━ abort — unapproved merge unsafe"] end subgraph PRResilience ["● open_pr_step + ci_conflict_fix (skip_when_false guarded)"] direction TB OPENPR_RUN["run_skill: open-pr ━━━━━━━━━━ skip_when_false: inputs.open_pr"] OPENPR_LIMIT["● on_context_limit: release_issue_failure ━━━━━━━━━━ PR state unknown; safe release"] CIFIX_RUN["run_skill: ci-conflict-fix ━━━━━━━━━━ skip_when_false: detect_ci_conflicts retries: 1"] CIFIX_LIMIT["● on_context_limit: release_issue_failure ━━━━━━━━━━ incomplete fix; push unsafe"] end T_SKIP([dry_walkthrough — continue]) T_ABORT([ESCALATE_STOP]) T_RELEASE([RELEASE_ISSUE_FAILURE]) %% SEMANTIC GATE %% RULE --> RCHECK RCHECK -->|"missing"| RULE RCHECK -->|"present"| RULE_OK %% REVIEW RESILIENCE %% REVIEW_RUN -->|"stale / context limit"| REVIEW_STALE REVIEW_STALE --> REVIEW_RETRY REVIEW_RETRY -->|"yes (retries: 1)"| REVIEW_RUN REVIEW_RETRY -->|"exhausted"| REVIEW_SKIP REVIEW_SKIP --> T_SKIP %% AUDIT RESILIENCE %% AUDIT_RUN -->|"context exhausted"| AUDIT_FAIL AUDIT_FAIL --> AUDIT_LIMIT AUDIT_LIMIT --> T_ABORT %% PR RESILIENCE %% OPENPR_RUN -->|"context exhausted"| OPENPR_LIMIT OPENPR_LIMIT --> T_RELEASE CIFIX_RUN -->|"context exhausted (after retries)"| CIFIX_LIMIT CIFIX_LIMIT --> T_RELEASE %% CLASS ASSIGNMENTS %% class RULE,RCHECK detector; class RULE_OK output; class REVIEW_RUN,AUDIT_RUN,OPENPR_RUN,CIFIX_RUN handler; class REVIEW_STALE,AUDIT_FAIL gap; class REVIEW_RETRY stateNode; class REVIEW_SKIP,AUDIT_LIMIT,OPENPR_LIMIT,CIFIX_LIMIT output; class T_SKIP,T_ABORT,T_RELEASE terminal; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Red | Semantic Gate | Validation rule that detects missing on_context_limit | | Dark Teal | Output | Recovery routing destinations (● new on_context_limit routes) | | Orange | Handler | Advisory run_skill steps that were modified | | Yellow | Failed | Context-exhaustion failure states | | Teal | Circuit | Retry-remaining decision | | Dark Blue | Terminal | Final routing destinations | Closes #481 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260322-120142-791819/temp/rectify/rectify_review-step-context-limit-routing_2026-03-22_120142_part_b.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary Part A introduced `BackgroundTaskSupervisor` to supervise the `report_bug` background task. Part B hardens the broader architecture so the class of bugs — **unmonitored fire-and-forget tasks** and **unenforced "Never raises" contracts** — is impossible to reintroduce without tests failing. Two structural safeguards: 1. **Extend the anyio migration arch test to cover `server/`** — adds `asyncio.create_task` to the list of banned primitives in server code, so any future fire-and-forget introduction breaks CI immediately. 2. **Structural "Never raises" contract enforcement** — adds an arch test that finds all functions claiming "Never raises" in their docstrings and asserts they have a top-level `try:/except Exception:` block. Applies the missing guard to `_file_or_update_github_issue`, the other unenforced "Never raises" function in `tools_github.py`. ## Architecture Impact ### Concurrency Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; START([report_bug called]) subgraph EventLoop ["ASYNCIO EVENT LOOP (Single Thread)"] direction TB GATE["● Gate Check ━━━━━━━━━━ _require_enabled()"] SEV{"severity?"} subgraph BlockingPath ["BLOCKING PATH"] direction TB BLOCK_AWAIT["await _run_report_session() ━━━━━━━━━━ Runs inline, caller waits"] end subgraph NonBlockingPath ["NON-BLOCKING PATH"] direction TB STATUS_FILE["Write status.json ━━━━━━━━━━ status: pending"] SUBMIT["● tool_ctx.background.submit() ━━━━━━━━━━ Schedules coroutine as Task"] RETURN_DISP["Return 'dispatched' ━━━━━━━━━━ Caller unblocked immediately"] end end subgraph Supervisor ["★ BackgroundTaskSupervisor (pipeline/background.py)"] direction TB CREATE_TASK["asyncio.create_task(_supervise_task) ━━━━━━━━━━ Task added to _tasks set"] DONE_CB["task.add_done_callback ━━━━━━━━━━ _tasks.discard — auto-cleanup"] SUPERVISE["_supervise_task(coro) ━━━━━━━━━━ Wraps coro in try/except"] end subgraph BackgroundTask ["BACKGROUND TASK (asyncio.Task — runs concurrently)"] direction TB RUN_CORO["await _run_report_session() ━━━━━━━━━━ Headless Claude session"] EXC_CHECK{"outcome?"} SUCCESS_PATH["Return result dict ━━━━━━━━━━ Normal completion"] CANCEL_PATH["Write 'cancelled' ━━━━━━━━━━ Re-raise CancelledError"] FAIL_PATH["Log error ━━━━━━━━━━ Write 'failed' status.json Record to AuditStore Call on_exception callback"] end subgraph SharedState ["SHARED STATE (thread-safe via asyncio)"] direction TB TASKS_SET["★ _tasks: set[asyncio.Task] ━━━━━━━━━━ Protected by event loop GIL"] AUDIT_LOG["● AuditStore ━━━━━━━━━━ record_failure() on exception"] STATUS_JSON["★ status.json files ━━━━━━━━━━ atomic_write — never races"] end DRAIN["drain() — asyncio.gather(*_tasks) ━━━━━━━━━━ Await all → tests / shutdown"] START --> GATE GATE --> SEV SEV -->|"blocking"| BLOCK_AWAIT SEV -->|"non_blocking"| STATUS_FILE STATUS_FILE --> SUBMIT SUBMIT --> RETURN_DISP SUBMIT --> CREATE_TASK CREATE_TASK --> DONE_CB CREATE_TASK --> SUPERVISE DONE_CB -.->|"on done"| TASKS_SET SUPERVISE --> RUN_CORO RUN_CORO --> EXC_CHECK EXC_CHECK -->|"success"| SUCCESS_PATH EXC_CHECK -->|"CancelledError"| CANCEL_PATH EXC_CHECK -->|"Exception"| FAIL_PATH CREATE_TASK --> TASKS_SET FAIL_PATH --> AUDIT_LOG FAIL_PATH --> STATUS_JSON CANCEL_PATH --> STATUS_JSON BLOCK_AWAIT --> STATUS_JSON TASKS_SET --> DRAIN class START terminal; class GATE handler; class SEV stateNode; class BLOCK_AWAIT phase; class STATUS_FILE,RETURN_DISP output; class SUBMIT handler; class CREATE_TASK,DONE_CB,SUPERVISE newComponent; class RUN_CORO phase; class EXC_CHECK detector; class SUCCESS_PATH output; class CANCEL_PATH,FAIL_PATH detector; class TASKS_SET,AUDIT_LOG,STATUS_JSON newComponent; class DRAIN stateNode; ``` ### Error/Resilience Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; CALL_START([report_bug tool called]) subgraph ArchGuards ["★ ARCH TEST GUARDS (CI Prevention Layer)"] direction LR ARCH1["★ test_never_raises_contracts ━━━━━━━━━━ AST-scans server/ for 'Never raises' functions without top-level try/except → fails CI on violation"] ARCH2["● test_server_has_no_asyncio_create_task ━━━━━━━━━━ Bans asyncio.create_task in server/ → fails CI if fire-and-forget reintroduced"] end subgraph SessionLayer ["_run_report_session() — EXISTING GUARD"] direction TB RUN_TRY["try: (top-level) ━━━━━━━━━━ Entire session execution wrapped"] SESSION_OK["Return result dict ━━━━━━━━━━ success=True"] SESSION_EXC["except Exception ━━━━━━━━━━ logger.error() + write status 'failed' Return {success: False}"] end subgraph GitHubLayer ["● _file_or_update_github_issue() — NEW GUARD"] direction TB GH_TRY["try: (top-level — ● NEW) ━━━━━━━━━━ All GitHub API calls wrapped"] GH_CONFIG["Check default_repo config ━━━━━━━━━━ Returns {skipped} if not set"] GH_SEARCH["search_issues() ━━━━━━━━━━ GitHubFetcher — never raises"] GH_ACT{"duplicate?"} GH_COMMENT["add_comment()"] GH_CREATE["create_issue()"] GH_EXC["except Exception (● NEW) ━━━━━━━━━━ logger.error() Return {skipped: True, reason: ...}"] end subgraph SupervisorLayer ["★ BackgroundTaskSupervisor._supervise_task()"] direction TB SUP_AWAIT["await coro (_run_report_session) ━━━━━━━━━━ Background execution"] SUP_CANCEL["except CancelledError ━━━━━━━━━━ Write 'cancelled' status.json Re-raise (propagates normally)"] SUP_EXC["except Exception ━━━━━━━━━━ logger.error() Write 'failed' status.json AuditStore.record_failure() on_exception callback"] end subgraph StatusFiles ["STATUS FILE OUTPUTS"] direction LR STATUS_CANCELLED["status.json {'status': 'cancelled'}"] STATUS_FAILED["status.json {'status': 'failed', 'error': ...}"] STATUS_SUCCESS["status.json {'status': 'success'}"] end AUDIT["● AuditStore ━━━━━━━━━━ record_failure(subtype='background_exception')"] CALL_START --> RUN_TRY RUN_TRY --> SESSION_OK RUN_TRY --> GH_TRY RUN_TRY -->|"Exception"| SESSION_EXC SESSION_EXC --> STATUS_FAILED GH_TRY --> GH_CONFIG GH_CONFIG --> GH_SEARCH GH_SEARCH --> GH_ACT GH_ACT -->|"yes"| GH_COMMENT GH_ACT -->|"no"| GH_CREATE GH_TRY -->|"Exception (NEW)"| GH_EXC SUP_AWAIT --> SESSION_OK SUP_AWAIT -->|"CancelledError"| SUP_CANCEL SUP_AWAIT -->|"Exception"| SUP_EXC SUP_CANCEL --> STATUS_CANCELLED SUP_EXC --> STATUS_FAILED SUP_EXC --> AUDIT RUN_TRY -.->|"called by"| SUP_AWAIT class CALL_START terminal; class ARCH1 newComponent; class ARCH2 handler; class RUN_TRY,SESSION_OK phase; class SESSION_EXC detector; class GH_TRY,GH_CONFIG,GH_SEARCH phase; class GH_ACT stateNode; class GH_COMMENT,GH_CREATE handler; class GH_EXC detector; class SUP_AWAIT phase; class SUP_CANCEL,SUP_EXC detector; class STATUS_CANCELLED,STATUS_FAILED,STATUS_SUCCESS output; class AUDIT newComponent; ``` ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; START([report_bug called]) DONE_BLOCK([COMPLETE — blocking]) DONE_NBCK([DISPATCHED — non_blocking]) DONE_GATE([ERROR — kitchen closed]) subgraph Validation ["ENTRY & VALIDATION"] direction TB GATE["_require_enabled() ━━━━━━━━━━ Kitchen gate check"] EXEC_CHK{"executor configured?"} CTX_SETUP["Resolve config + paths ━━━━━━━━━━ report_dir, report_path, skill_command, write_spec"] end subgraph Routing ["SEVERITY ROUTING"] direction TB SEV{"severity?"} end subgraph BlockingPath ["BLOCKING PATH"] direction TB B_AWAIT["await _run_report_session() ━━━━━━━━━━ Caller waits for full result"] B_NOTIFY{"success?"} end subgraph NonBlockingPath ["● NON-BLOCKING PATH"] direction TB NB_STATUS["Write status.json ━━━━━━━━━━ {status: 'pending', dispatched_at: ...}"] NB_SUBMIT["● tool_ctx.background.submit() ━━━━━━━━━━ Schedules coroutine as supervised Task status_path, label='report_bug'"] NB_RETURN["Return immediately ━━━━━━━━━━ {success: true, status: 'dispatched', report_path: ...}"] end subgraph SessionExec ["_run_report_session() — ASYNC COROUTINE"] direction TB EXEC_RUN["executor.run(skill_command, cwd) ━━━━━━━━━━ Headless Claude session"] READ_RPT["Read report.md + parse fingerprint ━━━━━━━━━━ Extract dedup fingerprint"] GH_CHK{"github_filing + has_token?"} GH_FILE["await _file_or_update_github_issue()"] SKIP_GH["github = {skipped: True, reason: no_token}"] end subgraph GitHubFiling ["● _file_or_update_github_issue()"] direction TB CFG_CHK{"default_repo configured?"} SEARCH["search_issues(fingerprint)"] DUP_CHK{"duplicate found?"} DUP_BODY{"error_context already in body?"} ADD_CMT["add_comment()"] CRT_ISS["create_issue()"] SKIP_CFG["Return {skipped: True}"] GH_OUT["Return result dict"] end START --> GATE GATE -->|"disabled"| DONE_GATE GATE -->|"enabled"| EXEC_CHK EXEC_CHK -->|"no"| DONE_GATE EXEC_CHK -->|"yes"| CTX_SETUP CTX_SETUP --> SEV SEV -->|"blocking"| B_AWAIT SEV -->|"non_blocking"| NB_STATUS B_AWAIT --> B_NOTIFY B_NOTIFY -->|"yes"| DONE_BLOCK B_NOTIFY -->|"no"| DONE_BLOCK NB_STATUS --> NB_SUBMIT NB_SUBMIT --> NB_RETURN NB_RETURN --> DONE_NBCK NB_SUBMIT -.->|"async (background)"| EXEC_RUN EXEC_RUN --> READ_RPT READ_RPT --> GH_CHK GH_CHK -->|"yes"| GH_FILE GH_CHK -->|"no"| SKIP_GH GH_FILE --> CFG_CHK CFG_CHK -->|"not set"| SKIP_CFG CFG_CHK -->|"set"| SEARCH SEARCH --> DUP_CHK DUP_CHK -->|"yes"| DUP_BODY DUP_CHK -->|"no"| CRT_ISS DUP_BODY -->|"already present"| GH_OUT DUP_BODY -->|"new occurrence"| ADD_CMT ADD_CMT --> GH_OUT CRT_ISS --> GH_OUT SKIP_CFG --> GH_OUT class START terminal; class DONE_BLOCK,DONE_NBCK,DONE_GATE terminal; class GATE detector; class EXEC_CHK stateNode; class CTX_SETUP phase; class SEV stateNode; class B_AWAIT phase; class B_NOTIFY stateNode; class NB_STATUS output; class NB_SUBMIT newComponent; class NB_RETURN output; class EXEC_RUN handler; class READ_RPT phase; class GH_CHK stateNode; class GH_FILE handler; class SKIP_GH output; class CFG_CHK stateNode; class SEARCH handler; class DUP_CHK,DUP_BODY stateNode; class ADD_CMT,CRT_ISS handler; class GH_OUT,SKIP_CFG output; ``` Closes #480 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260322-120142-276410/temp/rectify/rectify_non_blocking_dispatch_immunity_2026-03-22_122500_part_b.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary Part A addressed the execution engine: when a skill writes a file but omits the structured output token, the system can now recover using tool call evidence (`_synthesize_from_write_artifacts`) and promotes the result to `RETRIABLE(CONTRACT_RECOVERY)` instead of abandoning with a terminal failure. Part B addresses the source: SKILL.md instruction quality across 20+ path-capture skills. Two compounding defects caused models to intermittently omit the structured output token — late instruction positioning (token requirement only in `## Output`, not `## Critical Constraints`) and a relative/absolute path contradiction between the save instruction and the contract regex. This PR establishes the "Concrete Token Instruction" canonical pattern in Critical Constraints for every affected skill and adds a static CI test that prevents regression as new skills are added. Together, Part A (recovery when the model still fails) and Part B (reduced failure rate from improved instructions) provide defense-in-depth for structured output compliance. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; %% TERMINALS %% START([run_skill invoked]) SUCCEEDED([SkillResult: SUCCEEDED]) RETRIABLE([SkillResult: RETRIABLE]) FAILED([SkillResult: FAILED]) subgraph Parsing ["Phase 1 — NDJSON Parsing"] direction TB Parse["● parse_session_result ━━━━━━━━━━ Scan stdout NDJSON Accumulate tool_uses + messages"] CSR["ClaudeSessionResult ━━━━━━━━━━ result, subtype, is_error tool_uses, assistant_messages write_call_count"] end subgraph Recovery ["Phase 2 — Recovery Chain"] direction TB RecA["Recovery A ━━━━━━━━━━ _recover_from_separate_marker Standalone %%ORDER_UP%% → join messages"] RecB["● Recovery B ━━━━━━━━━━ _recover_block_from_assistant_messages Channel confirmed + patterns missing? Scan assistant_messages for tokens"] RecC["● Recovery C (NEW) ━━━━━━━━━━ _synthesize_from_write_artifacts write_count≥1 + patterns still missing? Synthesize token from Write tool_use file_path"] end subgraph Outcome ["Phase 3 — Outcome Computation"] direction TB CompS["● _compute_success ━━━━━━━━━━ CHANNEL_B bypass gate TerminationReason dispatch _check_session_content"] CompR["_compute_retry ━━━━━━━━━━ context_exhausted → RESUME kill_anomaly → RESUME marker absent → EARLY_STOP"] ContradictionGuard{"Contradiction Guard success ∧ retry?"} DeadEnd{"Dead-End Guard ¬success ∧ ¬retry + channel confirmed?"} ContentEval["● _evaluate_content_state ━━━━━━━━━━ COMPLETE / ABSENT CONTRACT_VIOLATION / SESSION_ERROR"] end subgraph PostProcess ["Phase 4 — Post-Processing"] direction TB NormSub["● _normalize_subtype ━━━━━━━━━━ Resolve CLI vs adjudicated contradiction → adjudicated_failure / empty_result / etc."] BudgetG1["_apply_budget_guard (pass 1) ━━━━━━━━━━ Consecutive failures > max? Override needs_retry=False"] CRGate["● CONTRACT_RECOVERY gate ━━━━━━━━━━ adjudicated_failure + write_count≥1? Promote to RETRIABLE(CONTRACT_RECOVERY)"] BudgetG2["_apply_budget_guard (pass 2) ━━━━━━━━━━ Cap CONTRACT_RECOVERY retries → BUDGET_EXHAUSTED"] ZeroWrite["Zero-Write Gate ━━━━━━━━━━ success + write=0 + expected? Demote to RETRIABLE(ZERO_WRITES)"] end %% MAIN FLOW %% START --> Parse Parse --> CSR CSR --> RecA RecA -->|"completion_marker configured"| RecB RecA -->|"no marker config"| RecB RecB -->|"channel confirmed + patterns found in messages"| RecC RecB -->|"patterns not in messages"| RecC RecC -->|"write evidence + path-token patterns → synthesize tokens"| CompS RecC -->|"no write evidence or non-path patterns"| CompS CompS --> CompR CompR --> ContradictionGuard ContradictionGuard -->|"success=True AND retry=True demote success"| DeadEnd ContradictionGuard -->|"no contradiction"| DeadEnd DeadEnd -->|"¬success ∧ ¬retry ∧ channel confirmed"| ContentEval DeadEnd -->|"otherwise"| NormSub ContentEval -->|"ABSENT → DRAIN_RACE promote to RETRIABLE"| NormSub ContentEval -->|"CONTRACT_VIOLATION SESSION_ERROR → FAILED"| NormSub NormSub --> BudgetG1 BudgetG1 -->|"needs_retry=True → budget check"| CRGate BudgetG1 -->|"budget exhausted → BUDGET_EXHAUSTED"| FAILED CRGate -->|"adjudicated_failure + write_count≥1 → needs_retry=True, CONTRACT_RECOVERY"| BudgetG2 CRGate -->|"conditions not met"| ZeroWrite BudgetG2 -->|"budget not exhausted"| ZeroWrite BudgetG2 -->|"budget exhausted"| FAILED ZeroWrite -->|"success + write=0 + expected"| RETRIABLE ZeroWrite -->|"all gates passed"| SUCCEEDED ZeroWrite -->|"success=False, no retry"| FAILED ZeroWrite -->|"needs_retry=True"| RETRIABLE %% CLASS ASSIGNMENTS %% class START terminal; class SUCCEEDED,RETRIABLE,FAILED terminal; class Parse,RecA handler; class CSR stateNode; class RecB,RecC newComponent; class CompS,CompR phase; class ContradictionGuard,DeadEnd,ContentEval detector; class NormSub,BudgetG1,BudgetG2,ZeroWrite handler; class CRGate newComponent; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start and result states (SUCCEEDED / RETRIABLE / FAILED) | | Orange | Handler | Processing nodes (parse, normalize, budget guard) | | Teal | State | ClaudeSessionResult data container | | Purple | Phase | Outcome computation nodes | | Green | New/Modified | Nodes changed in this PR (● Recovery B, ● Recovery C, ● CONTRACT_RECOVERY gate) | | Red | Detector | Validation and guard nodes (dead-end guard, content evaluation) | ### Error Resilience Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; T_COMPLETE([SUCCEEDED]) T_RETRIABLE([RETRIABLE]) T_FAILED([FAILED — terminal]) subgraph Prevention ["PREVENTION — Part B: SKILL.md Instruction Hardening"] direction TB SKILLMd["● SKILL.md ━━━━━━━━━━ Token instruction moved to Critical Constraints section Absolute path example given"] StaticTest["● test_skill_output_compliance.py ━━━━━━━━━━ Static regex: token instruction must appear inside ## Critical Constraints Catches regression as new skills added"] Contracts["● skill_contracts.yaml ━━━━━━━━━━ setup-project contract removed (no emit instruction existed)"] end subgraph Detection ["DETECTION — Contract Violation Recognition"] direction TB PatternCheck["● _check_expected_patterns ━━━━━━━━━━ Normalize bold markdown AND-match all regex patterns vs session.result"] ContentEval["● _evaluate_content_state ━━━━━━━━━━ COMPLETE / ABSENT CONTRACT_VIOLATION / SESSION_ERROR"] DeadEnd{"Dead-End Guard ━━━━━━━━━━ ¬success ∧ ¬retry + channel confirmed?"} end subgraph RecoveryChain ["RECOVERY CHAIN — Three-Stage Fallback"] direction TB RecA["Recovery A: Separate Marker ━━━━━━━━━━ Standalone %%ORDER_UP%% message → join assistant_messages"] RecB["● Recovery B: Assistant Messages ━━━━━━━━━━ Channel confirmed + patterns missing → scan all assistant_messages (drain-race artifact fix)"] RecC["● Recovery C: Artifact Synthesis (NEW) ━━━━━━━━━━ write_count≥1 + patterns still absent → scan tool_uses for Write file_path → synthesize token = /abs/path"] end subgraph CircuitBreakers ["CIRCUIT BREAKERS — Retry Caps"] direction TB BudgetG1["_apply_budget_guard (pass 1) ━━━━━━━━━━ consecutive failures > max_consecutive_retries → BUDGET_EXHAUSTED, needs_retry=False"] CRGate["● CONTRACT_RECOVERY Gate (NEW) ━━━━━━━━━━ adjudicated_failure + write_count≥1 → promote to RETRIABLE(CONTRACT_RECOVERY)"] BudgetG2["_apply_budget_guard (pass 2) ━━━━━━━━━━ Caps CONTRACT_RECOVERY retries (prevents infinite loop)"] DrainRace["Dead-End Guard → DRAIN_RACE ━━━━━━━━━━ ABSENT state: channel confirmed completion but result empty → transient → retry"] end %% PREVENTION → DETECTION %% SKILLMd -->|"reduced omission rate"| PatternCheck StaticTest -->|"regression guard"| SKILLMd Contracts -->|"removes false positives"| PatternCheck %% DETECTION %% PatternCheck -->|"patterns match"| T_COMPLETE PatternCheck -->|"patterns absent"| ContentEval ContentEval --> DeadEnd %% RECOVERY CHAIN (pre-detection) %% RecA -->|"token found in messages"| PatternCheck RecA -->|"not found"| RecB RecB -->|"token found in assistant_messages"| PatternCheck RecB -->|"not found"| RecC RecC -->|"synthesized token → updated result"| PatternCheck RecC -->|"no write evidence"| PatternCheck %% DEAD-END ROUTING %% DeadEnd -->|"ABSENT → drain-race"| DrainRace DeadEnd -->|"CONTRACT_VIOLATION"| BudgetG1 DeadEnd -->|"SESSION_ERROR"| T_FAILED %% CIRCUIT BREAKERS %% DrainRace -->|"RETRIABLE(DRAIN_RACE)"| T_RETRIABLE BudgetG1 -->|"budget not exhausted"| CRGate BudgetG1 -->|"budget exhausted"| T_FAILED CRGate -->|"write_count≥1 → RETRIABLE(CONTRACT_RECOVERY)"| BudgetG2 CRGate -->|"no write evidence → terminal"| T_FAILED BudgetG2 -->|"budget not exhausted"| T_RETRIABLE BudgetG2 -->|"budget exhausted → BUDGET_EXHAUSTED"| T_FAILED %% CLASS ASSIGNMENTS %% class T_COMPLETE,T_RETRIABLE,T_FAILED terminal; class SKILLMd,Contracts newComponent; class StaticTest newComponent; class PatternCheck,ContentEval detector; class DeadEnd stateNode; class RecA handler; class RecB,RecC newComponent; class BudgetG1,BudgetG2 phase; class CRGate newComponent; class DrainRace output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Final states: SUCCEEDED, RETRIABLE, FAILED | | Green | New/Modified | Components changed in this PR | | Red | Detector | Pattern matching and content state evaluation | | Teal | State | Dead-end guard decision node | | Orange | Handler | Recovery A (existing) | | Purple | Phase | Budget guard passes | | Dark Teal | Recovery | Drain-race promotion | ### State Lifecycle Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; T_OK([Contract Satisfied]) T_RETRY([Contract: Retry Eligible]) T_VIOLATED([Contract Violated — Terminal]) subgraph ContractDef ["CONTRACT DEFINITION LAYER"] direction LR SkillContracts["● skill_contracts.yaml ━━━━━━━━━━ expected_output_patterns completion_marker write_behavior setup-project contract removed"] SkillMD["● SKILL.md ━━━━━━━━━━ Critical Constraints section Concrete token instruction Absolute path example (20+ skills updated)"] StaticTest["● test_skill_output_compliance.py ━━━━━━━━━━ CI gate: token instruction must be in ## Critical Constraints Regex: r'## Critical Constraints.*plan_path\\s*=' Covers all path-capture skills"] end subgraph ModelExecution ["MODEL EXECUTION — Headless Session"] direction TB WriteArtifact["Model writes artifact ━━━━━━━━━━ Write tool call file_path → disk write_call_count += 1"] EmitToken{"● Emits structured token? ━━━━━━━━━━ plan_path = /abs/path or investigation_path = ... or diagram_path = ..."} end subgraph RuntimeRecovery ["RUNTIME RECOVERY — Three-Stage Chain"] direction TB RecovB["● Recovery B ━━━━━━━━━━ Scan assistant_messages for token in JSONL stream (drain-race: stdout not flushed)"] RecovC["● Recovery C (NEW) ━━━━━━━━━━ Scan tool_uses for Write.file_path Synthesize: token_name = file_path Only for path-capture patterns"] Synthesized["● Synthesized contract token ━━━━━━━━━━ plan_path = /abs/path/plan.md (from Write tool_use metadata) Prepended to session.result"] end subgraph ContentStateEval ["CONTENT STATE EVALUATION — session.py"] direction TB MarkerCheck{"Completion marker ━━━━━━━━━━ %%ORDER_UP%% present in session.result?"} PatternCheck{"● Patterns match? ━━━━━━━━━━ _check_expected_patterns AND-match all regexes normalize bold markup"} StateDecide["● _evaluate_content_state ━━━━━━━━━━ COMPLETE / ABSENT CONTRACT_VIOLATION SESSION_ERROR"] end subgraph ContractGates ["CONTRACT GATES — Dead-End Guard"] direction TB AbsentGate{"ContentState ABSENT? ━━━━━━━━━━ result empty or marker missing"} CVGate{"ContentState CONTRACT_VIOLATION? ━━━━━━━━━━ marker present patterns failed"} WriteEvidence{"● Write evidence? ━━━━━━━━━━ write_call_count ≥ 1 AND adjudicated_failure"} end %% CONTRACT DEFINITION FLOW %% StaticTest -->|"CI enforces"| SkillMD SkillMD -->|"instructs model"| EmitToken SkillContracts -->|"defines patterns"| PatternCheck %% MODEL EXECUTION %% WriteArtifact --> EmitToken EmitToken -->|"YES — token emitted"| PatternCheck EmitToken -->|"NO — token omitted"| RecovB %% RECOVERY %% RecovB -->|"found in messages"| PatternCheck RecovB -->|"not found"| RecovC RecovC -->|"Write.file_path found"| Synthesized RecovC -->|"no write evidence"| PatternCheck Synthesized --> PatternCheck %% CONTENT STATE EVALUATION %% PatternCheck -->|"all patterns match"| MarkerCheck PatternCheck -->|"patterns absent"| StateDecide MarkerCheck -->|"present"| T_OK MarkerCheck -->|"absent"| StateDecide StateDecide --> AbsentGate AbsentGate -->|"ABSENT"| T_RETRY AbsentGate -->|"not ABSENT"| CVGate CVGate -->|"SESSION_ERROR"| T_VIOLATED CVGate -->|"CONTRACT_VIOLATION"| WriteEvidence WriteEvidence -->|"write evidence present CONTRACT_RECOVERY gate"| T_RETRY WriteEvidence -->|"no write evidence terminal violation"| T_VIOLATED %% OUTCOMES %% T_OK -->|"subtype=success"| T_OK T_RETRY -->|"DRAIN_RACE or CONTRACT_RECOVERY budget-capped by _apply_budget_guard"| T_RETRY %% CLASS ASSIGNMENTS %% class T_OK,T_RETRY,T_VIOLATED terminal; class SkillContracts,SkillMD newComponent; class StaticTest newComponent; class WriteArtifact handler; class EmitToken stateNode; class RecovB,RecovC,Synthesized newComponent; class PatternCheck,MarkerCheck detector; class StateDecide phase; class AbsentGate,CVGate,WriteEvidence stateNode; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Contract outcomes: Satisfied, Retry Eligible, Violated | | Green | New/Modified | Components changed in this PR | | Red | Detector | Pattern matching gates and marker checks | | Purple | Phase | ContentState evaluation dispatcher | | Teal | State | Decision nodes (marker check, content state, write evidence) | | Orange | Handler | Model Write tool call execution | Closes #477 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260322-120141-753065/temp/rectify/rectify_artifact-aware-contract-recovery_2026-03-22_120141_part_b.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary `format_ingredients_table` (the GFM/MCP rendering path for recipe ingredients) computes column widths via raw `max(len(...))` with floors but no ceilings. The result: a 220-char `run_mode` description in `implementation.yaml` forces the GFM table to 220+ column-wide rows, bloating every MCP response that loads that recipe. The immunity solution: extend `core/_terminal_table.py` with `_render_gfm_table`, accepting the same `TerminalColumn` specs already used by the terminal path. Both rendering paths now share the same L0 primitive and the same column-spec source of truth. Width capping becomes structurally implicit — any new GFM renderer must declare `TerminalColumn` specs and automatically inherits the cap. ## Architecture Impact ### Module Dependency Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 70, 'curve': 'basis'}}}%% graph TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph L0 ["L0 — core/ (stdlib only, zero autoskillit imports)"] direction LR TT["● core/_terminal_table.py ━━━━━━━━━━ TerminalColumn (NamedTuple) _render_terminal_table() ★ _render_gfm_table() NEW"] INIT["● core/__init__.py ━━━━━━━━━━ Re-exports TerminalColumn Re-exports _render_terminal_table ★ Re-exports _render_gfm_table NEW"] end subgraph L1 ["L1 — pipeline/"] direction LR TFM["pipeline/telemetry_fmt.py ━━━━━━━━━━ TelemetryFormatter imports: TerminalColumn imports: _render_terminal_table"] end subgraph L2 ["L2 — recipe/"] direction LR API["● recipe/_api.py ━━━━━━━━━━ format_ingredients_table() _GFM_INGREDIENT_COLUMNS imports: TerminalColumn ★ imports: _render_gfm_table NEW"] end subgraph L3 ["L3 — cli/"] direction LR ANSI["cli/_ansi.py ━━━━━━━━━━ ingredients_to_terminal() imports: TerminalColumn only (inline _render_terminal_table)"] SHIM["cli/_terminal_table.py ━━━━━━━━━━ Re-export shim TerminalColumn _render_terminal_table"] end subgraph TESTS ["Tests"] direction LR GUARD["★ tests/arch/test_gfm_rendering_guard.py ━━━━━━━━━━ Arch guard: asserts delegation and bounded column specs"] TAPI["● tests/recipe/test_api.py ━━━━━━━━━━ GFM width cap behavioral tests Integration test vs real recipe"] end TT -->|"defines"| INIT TFM -->|"imports TerminalColumn _render_terminal_table"| INIT API -->|"imports TerminalColumn ★ + _render_gfm_table"| INIT ANSI -->|"imports TerminalColumn"| INIT SHIM -->|"imports TerminalColumn _render_terminal_table"| INIT GUARD -->|"asserts delegation + bounded max_width"| API GUARD -->|"verifies export surface"| INIT TAPI -->|"behavioral tests width cap + truncation"| API class TT,INIT stateNode; class TFM handler; class API phase; class ANSI,SHIM cli; class GUARD newComponent; class TAPI output; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Teal | Core (L0) | `core/_terminal_table.py` and `core/__init__.py` — high fan-in primitives | | Orange | Pipeline (L1) | `pipeline/telemetry_fmt.py` — unchanged consumer | | Purple | Recipe (L2) | `recipe/_api.py` — key change: now imports `_render_gfm_table` | | Dark Blue | CLI (L3) | `cli/_ansi.py` and re-export shim — unchanged consumers | | Green (★) | New | `test_gfm_rendering_guard.py` — new arch guard | | Dark Teal | Modified Test | `test_api.py` — new behavioral tests added | ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 55, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; START([Caller: load_and_validate or MCP tools]) subgraph FIT ["● recipe/_api.py — format_ingredients_table()"] direction TB G1{"ingredients non-empty?"} BIR["build_ingredient_rows() ━━━━━━━━━━ Full-length tuples (name, desc, default) No truncation at data layer"] G2{"rows non-empty?"} DELEGATE["● delegate to _render_gfm_table() ━━━━━━━━━━ _GFM_INGREDIENT_COLUMNS spec: Name: max_width=30 Description: max_width=60 Default: max_width=20"] end subgraph RGT ["● core/_terminal_table.py — _render_gfm_table()"] direction TB WIDTHS["● Compute column widths ━━━━━━━━━━ col_w = min( max(cell_lengths, label_width), max_width ← BOUNDED"] HEADER["Render header row ━━━━━━━━━━ | Name | Description | Default |"] SEP["Render separator row ━━━━━━━━━━ | ---: | :--- | ---: | (alignment from TerminalColumn.align)"] ROWLOOP{"For each data row"} TRUNC{"cell length > col_w?"} PAD["Pad cell to col_w ━━━━━━━━━━ f'{cell:<{col_w}}' or right-aligned"] ELLIPSIS["Truncate + append '…' ━━━━━━━━━━ cell[:col_w-1] + '…'"] EMIT["Emit row: | cell | cell | cell |"] JOIN["Join all rows with newline ━━━━━━━━━━ Return GFM table string"] end ELIMINATED["✗ ELIMINATED: inline ad-hoc width math ━━━━━━━━━━ dw = max(len(r[1])...) — no ceiling f'| {desc:<{dw}} |' — uncapped padding 220-char description → 220-wide column"] NONE([Return None]) RESULT([Return GFM table string All rows ≤ 120 chars wide]) START --> G1 G1 -->|"empty"| NONE G1 -->|"non-empty"| BIR BIR --> G2 G2 -->|"empty"| NONE G2 -->|"non-empty"| DELEGATE DELEGATE --> WIDTHS WIDTHS --> HEADER HEADER --> SEP SEP --> ROWLOOP ROWLOOP -->|"next row"| TRUNC TRUNC -->|"yes"| ELLIPSIS TRUNC -->|"no"| PAD ELLIPSIS --> EMIT PAD --> EMIT EMIT -->|"more rows"| ROWLOOP EMIT -->|"done"| JOIN JOIN --> RESULT ELIMINATED -.->|"replaced by DELEGATE"| DELEGATE class START terminal; class NONE terminal; class RESULT terminal; class G1,G2 stateNode; class BIR handler; class DELEGATE phase; class WIDTHS,HEADER,SEP newComponent; class ROWLOOP stateNode; class TRUNC stateNode; class PAD newComponent; class ELLIPSIS newComponent; class EMIT newComponent; class JOIN newComponent; class ELIMINATED gap; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Entry caller, return paths | | Teal | Decision | Empty guards, per-row loop, truncation decision | | Orange | Handler | `build_ingredient_rows` — unchanged data producer | | Purple | Delegate | Delegation call to `_render_gfm_table` via column spec | | Green (●) | New Logic | Width computation, truncation, table emission — all now in L0 primitive | | Amber | Eliminated | Inline ad-hoc width math removed by this PR | ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260322-173128-741993/.autoskillit/temp/rectify/rectify_format_ingredients_table_gfm_width_cap_2026-03-22_000000.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit

## Summary The clone isolation contract (`origin=file://`, `upstream=real-url`) is enforced at the Python execution layer (`remote_resolver.py`, `REMOTE_PRECEDENCE = ("upstream", "origin")`) but no mechanism propagated this contract to the bash layer of SKILL.md files — skills could freely write `git fetch origin` or `git rebase origin/{branch}`, silently operating against the stale `file://` clone path rather than the real GitHub remote. Part A establishes the **architectural immunity guard**: a new semantic rule `hardcoded-origin-remote` in `rules_skill_content.py` that fires during `validate_recipe` / `run_semantic_rules` whenever any recipe-referenced skill uses a literal `origin` remote name in a bash-level git command that contacts a remote. Part B (landed in the same branch) fixes the immediate violations in `resolve-merge-conflicts`, `retry-worktree`, and `implement-worktree` SKILL.md files by introducing the `REMOTE=$(git remote get-url upstream 2>/dev/null && echo upstream || echo origin)` pattern. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; START([START: validate_recipe]) END_PASS([PASS: no hardcoded-origin findings]) END_WARN([WARN: RuleFinding emitted]) subgraph Validation ["● VALIDATION PIPELINE (rules_skill_content.py)"] direction TB RSR["run_semantic_rules() ━━━━━━━━━━ iterates recipe steps"] SKIP1{"step.tool == run_skill?"} RESOLVE["resolve_skill_name() ━━━━━━━━━━ extract skill name from skill_command"] READMD["_resolve_skill_md() ━━━━━━━━━━ locate SKILL.md"] EXTRACT["extract_bash_blocks() ━━━━━━━━━━ _skill_placeholder_parser.py"] CHKGIT{"_GIT_REMOTE_COMMAND_RE ━━━━━━━━━━ git fetch/rebase/ log/show/rev-parse?"} CHKLIT{"_LITERAL_ORIGIN_RE ━━━━━━━━━━ literal 'origin' (not $REMOTE)?"} EMIT["● _check_hardcoded_origin_remote ━━━━━━━━━━ RuleFinding(WARNING) rule=hardcoded-origin-remote"] end subgraph Runtime ["● RUNTIME SKILL EXECUTION (SKILL.md bash blocks)"] direction TB INVOKE["skill invoked ━━━━━━━━━━ resolve-merge-conflicts retry-worktree implement-worktree"] REMOTE_DETECT{"git remote get-url upstream ━━━━━━━━━━ upstream reachable?"} USE_UP["REMOTE=upstream ━━━━━━━━━━ real GitHub URL"] USE_OR["REMOTE=origin ━━━━━━━━━━ fallback (non-clone env)"] FETCH["git fetch $REMOTE ━━━━━━━━━━ fetches from correct remote"] REBASE["git rebase $REMOTE/{base_branch} ━━━━━━━━━━ rebases against live state"] end START --> RSR RSR --> SKIP1 SKIP1 -->|"yes"| RESOLVE SKIP1 -->|"no: skip step"| END_PASS RESOLVE --> READMD READMD --> EXTRACT EXTRACT --> CHKGIT CHKGIT -->|"no git remote cmd"| END_PASS CHKGIT -->|"git remote cmd found"| CHKLIT CHKLIT -->|"no literal origin"| END_PASS CHKLIT -->|"literal origin detected"| EMIT EMIT --> END_WARN INVOKE --> REMOTE_DETECT REMOTE_DETECT -->|"yes"| USE_UP REMOTE_DETECT -->|"no"| USE_OR USE_UP --> FETCH USE_OR --> FETCH FETCH --> REBASE %% CLASS ASSIGNMENTS %% class START,END_PASS,END_WARN terminal; class RSR,RESOLVE,READMD,EXTRACT phase; class CHKGIT,CHKLIT,REMOTE_DETECT stateNode; class EMIT detector; class INVOKE handler; class USE_UP,USE_OR,FETCH,REBASE newComponent; class SKIP1 stateNode; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start, pass, and warn terminal states | | Purple | Phase | Validation pipeline processing steps | | Teal | Decision | Guard conditions and remote detection | | Red | Detector | Rule finding emission on violation | | Orange | Handler | Skill invocation entry point | | Green | New/Modified | Updated remote resolution pattern in SKILL.md files | Closes #487 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260322-173128-278959/.autoskillit/temp/rectify/rectify_hardcoded-origin-remote_2026-03-22_180100_part_a.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

…Title (#492) ## Summary The `open-pr` skill's Step 2 extracts the first `# ` heading from the plan file verbatim and uses it as the PR title. Because `make-plan` and `rectify` mandate that multi-part plan files include `— PART X ONLY` in their heading (e.g., `# Implementation Plan: Foo — PART A ONLY`), this internal scope marker leaks directly into the PR title — making PRs appear partial when all parts are implemented. The fix is two-part: (1) update the bash block at Step 2 to pipe through a `sed` that strips the suffix from the extracted heading, and (2) update the Step 2 prose to explicitly instruct stripping the suffix before passing headings to the multi-plan subagent synthesis path. A contract test is added to guard against regression. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; START([START]) END([END]) subgraph Extract ["● Step 2: Title Extraction (open-pr/SKILL.md)"] direction TB HeadExtract["Extract heading ━━━━━━━━━━ head -1 {plan_path} sed 's/^# //'"] StripSuffix["● Strip PART X ONLY suffix ━━━━━━━━━━ sed 's/ *— *PART [A-Z] ONLY$//' guards against scope-marker leakage"] end subgraph PlanRoute ["Plan Count Routing"] PlanCount{"single or multiple plans?"} SingleUse["Use directly ━━━━━━━━━━ BASE_TITLE = stripped heading"] MultiSynth["Subagent synthesis ━━━━━━━━━━ sonnet subagent synthesizes clean title"] end subgraph PrefixApply ["Step 2b: run_name Prefix"] RunNameCheck{"run_name switch"} FeaturePrefix["[FEATURE] BASE_TITLE ━━━━━━━━━━ run_name starts with 'feature'"] FixPrefix["[FIX] BASE_TITLE ━━━━━━━━━━ run_name starts with 'fix'"] NoPrefix["BASE_TITLE unchanged ━━━━━━━━━━ any other run_name (e.g. 'impl')"] end PRCreate["gh pr create ━━━━━━━━━━ --title TITLE"] ContractTests["● Contract Tests ━━━━━━━━━━ test_part_suffix_stripped_in_bash_block test_step2_prose_instructs_suffix_stripping"] START --> HeadExtract HeadExtract --> StripSuffix StripSuffix --> PlanCount PlanCount -->|"single"| SingleUse PlanCount -->|"multiple"| MultiSynth SingleUse --> RunNameCheck MultiSynth --> RunNameCheck RunNameCheck -->|"feature*"| FeaturePrefix RunNameCheck -->|"fix*"| FixPrefix RunNameCheck -->|"other"| NoPrefix FeaturePrefix --> PRCreate FixPrefix --> PRCreate NoPrefix --> PRCreate PRCreate --> END ContractTests -.->|"guards"| StripSuffix class START,END terminal; class HeadExtract handler; class StripSuffix newComponent; class PlanCount,RunNameCheck detector; class SingleUse,MultiSynth phase; class FeaturePrefix,FixPrefix,NoPrefix stateNode; class PRCreate output; class ContractTests newComponent; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | Start and end points | | Orange | Handler | Heading extraction processing | | Green | Modified | ● Nodes modified by this PR (suffix strip, contract tests) | | Red | Detector | Decision points (plan count, run_name switch) | | Purple | Phase | Synthesis paths (direct use, subagent) | | Teal | State | Prefix variant state nodes | | Dark Teal | Output | gh pr create invocation | Closes #488 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-20260322-203748-209804/temp/make-plan/open_pr_part_suffix_plan_2026-03-22_000000.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary When multiple pipelines run in parallel, each one independently reaches `confirm_cleanup` (an interactive `action: confirm` step) and prompts the user before calling `remove_clone`. This interactive prompt blocks the shared execution context and stalls all sibling pipelines that are still executing. The fix introduces a **deferred cleanup mode** controlled by a `defer_cleanup` ingredient. When enabled, each individual pipeline skips the interactive cleanup gate and instead registers its clone path and completion status to a shared file-based registry. After all parallel pipelines complete, the orchestrator runs a single batch cleanup phase — one confirmation prompt, one bulk delete of successful clones, all error clones preserved automatically. Changes span: a new `workspace/clone_registry.py` module, two new MCP tools (`register_clone_status`, `batch_cleanup_clones`), a new routing utility in `smoke_utils.py`, and updated terminal steps in four recipe YAML files. ## Requirements ### Clone Lifecycle Management - **REQ-CLONE-001:** The system must suppress clone deletion prompts while any sibling pipeline in the same batch is still executing. - **REQ-CLONE-002:** The system must defer clone cleanup to a single batch operation that runs only after all pipelines in the batch have completed. - **REQ-CLONE-003:** The system must exclude from cleanup any clone whose pipeline encountered an error, preserving it for investigation. - **REQ-CLONE-004:** The system must only offer deletion for clones whose pipelines completed successfully and without errors. ### Orchestration Coordination - **REQ-ORCH-001:** The orchestrator must track completion state of all parallel pipelines before initiating any cleanup phase. - **REQ-ORCH-002:** The orchestrator must treat clone cleanup as a distinct terminal phase that cannot overlap with pipeline execution. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; %% TERMINALS %% DONE([done]) ESC([escalate_stop]) subgraph IssueRelease ["Issue Release (per-pipeline)"] direction LR RIS["● release_issue_success ━━━━━━━━━━ release_issue tool target_branch passed"] RIT["● release_issue_timeout ━━━━━━━━━━ release_issue tool no target_branch"] RIF["● release_issue_failure ━━━━━━━━━━ release_issue tool releases claim"] end subgraph CleanupRouting ["★ Cleanup Mode Routing (per-pipeline)"] direction TB CDC{"★ check_defer_cleanup ━━━━━━━━━━ check_cleanup_mode() success path"} CDOF{"★ check_defer_on_failure ━━━━━━━━━━ check_cleanup_mode() failure path"} end subgraph ImmediateCleanup ["Immediate Cleanup (defer_cleanup=false)"] direction TB CC["● confirm_cleanup ━━━━━━━━━━ action: confirm user gate"] DC["delete_clone ━━━━━━━━━━ remove_clone keep=false"] CF["cleanup_failure ━━━━━━━━━━ remove_clone keep=true preserves on error"] end subgraph DeferredReg ["★ Deferred Registration (defer_cleanup=true)"] direction TB REG["★ register_success_deferred ━━━━━━━━━━ register_clone_status status=success"] REGE["★ register_error_deferred ━━━━━━━━━━ register_clone_status status=error"] RF[("★ clone-cleanup-registry.json ━━━━━━━━━━ atomic JSON writes {path, status} entries")] end subgraph BatchPhase ["★ Batch Cleanup Phase (orchestrator — runs ONCE after all pipelines)"] direction TB BCC["★ batch_confirm_cleanup ━━━━━━━━━━ action: confirm single user prompt"] BDC["★ batch_delete_clones ━━━━━━━━━━ batch_cleanup_clones tool reads registry"] end %% SUCCESS PATH %% RIS -->|"on_success / on_failure"| CDC RIT -->|"on_success / on_failure"| CDC CDC -->|"deferred == 'false'"| CC CDC -->|"deferred == 'true'"| REG CC -->|"user: yes"| DC CC -->|"user: no"| DONE DC -->|"on_success / on_failure"| DONE REG -->|"on_success / on_failure"| DONE REG -->|"writes"| RF %% FAILURE PATH %% RIF -->|"on_success / on_failure"| CDOF CDOF -->|"deferred == 'false'"| CF CDOF -->|"deferred == 'true'"| REGE CF -->|"on_success / on_failure"| ESC REGE -->|"on_success / on_failure"| ESC REGE -->|"writes"| RF %% BATCH PATH (orchestrator-level) %% RF -->|"reads all entries"| BDC BCC -->|"user: yes"| BDC BCC -->|"user: no"| DONE BDC -->|"on_success / on_failure"| DONE %% CLASS ASSIGNMENTS %% class DONE,ESC terminal; class RIS,RIT,RIF handler; class DC,CF handler; class CC handler; class CDC,CDOF stateNode; class REG,REGE,BCC,BDC newComponent; class RF newComponent; ``` ### Module Dependency Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 70, 'curve': 'basis'}}}%% graph TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; subgraph L3 ["LAYER 3 — SERVER"] direction LR TOOLS_CLONE["● server/tools_clone.py ━━━━━━━━━━ +register_clone_status +batch_cleanup_clones"] HELPERS["● server/helpers.py ━━━━━━━━━━ re-exports clone_registry module object"] SRV_INIT["● server/__init__.py ━━━━━━━━━━ 40 kitchen tools (was 38)"] end subgraph L1 ["LAYER 1 — WORKSPACE"] direction LR CLONE_REG["★ workspace/clone_registry.py ━━━━━━━━━━ register_clone() read_registry() cleanup_candidates() batch_delete()"] WS_INIT["● workspace/__init__.py ━━━━━━━━━━ +register_clone +read_registry +cleanup_candidates"] end subgraph L0C ["LAYER 0 — CORE"] direction LR TYPE_CONST["● core/_type_constants.py ━━━━━━━━━━ +register_clone_status +batch_cleanup_clones in GATED_TOOLS, TOOL_SUBSET_TAGS, TOOL_CATEGORIES"] CORE_IO["core/io.py ━━━━━━━━━━ atomic_write()"] CORE_LOG["core/logging.py ━━━━━━━━━━ get_logger()"] end subgraph L0S ["LAYER 0 — STANDALONE UTILS"] SMOKE["● smoke_utils.py ━━━━━━━━━━ +check_cleanup_mode() stdlib only — no autoskillit deps"] end subgraph EXT ["EXTERNAL"] direction LR FASTMCP["fastmcp ━━━━━━━━━━ Context, CurrentContext @mcp.tool decorator"] STDLIB["stdlib ━━━━━━━━━━ json, pathlib, asyncio typing, collections.abc"] end %% L3 → L1 (valid downward) %% HELPERS -->|"imports module clone_registry"| CLONE_REG TOOLS_CLONE -->|"uses clone_registry via helpers"| HELPERS %% L3 → L0 (valid downward) %% TOOLS_CLONE -->|"imports get_logger"| CORE_LOG SRV_INIT -->|"reads GATED_TOOLS tool counts"| TYPE_CONST %% L1 → L1 (intra-layer, package init) %% WS_INIT -->|"re-exports 3 functions"| CLONE_REG %% L1 → L0 (valid downward) %% CLONE_REG -->|"atomic_write()"| CORE_IO CLONE_REG -->|"get_logger()"| CORE_LOG %% External %% TOOLS_CLONE -->|"@mcp.tool Context"| FASTMCP CLONE_REG --> STDLIB SMOKE --> STDLIB %% CLASS ASSIGNMENTS %% class TOOLS_CLONE,SRV_INIT cli; class HELPERS phase; class WS_INIT handler; class CLONE_REG newComponent; class TYPE_CONST,CORE_IO,CORE_LOG stateNode; class SMOKE handler; class FASTMCP,STDLIB integration; ``` ### C4 Container Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 50, 'rankSpacing': 60, 'curve': 'basis'}}}%% graph TB %% CLASS DEFINITIONS %% classDef cli fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef integration fill:#c62828,stroke:#ef9a9a,stroke-width:2px,color:#fff; %% CALLERS %% PIPE1(["Parallel Pipeline 1 ━━━━━━━━━━ headless session calls on completion"]) PIPE2(["Parallel Pipeline N ━━━━━━━━━━ headless session calls on completion"]) ORCH(["Orchestrator ━━━━━━━━━━ interactive session runs batch phase once"]) subgraph Server ["● FastMCP Server (42 tools, 40 kitchen-tagged)"] direction TB subgraph CloneTools ["● tools_clone.py — Clone & Remote category"] direction LR CLONE_REPO["clone_repo ━━━━━━━━━━ clone GitHub repos for pipeline isolation"] REMOVE_CLONE["remove_clone ━━━━━━━━━━ delete clone dir (immediate path)"] PUSH["push_to_remote ━━━━━━━━━━ push branch to remote"] REG_TOOL["★ register_clone_status ━━━━━━━━━━ status: success | error clone_path, registry_path"] BATCH_TOOL["★ batch_cleanup_clones ━━━━━━━━━━ reads registry deletes success clones"] end TYPE_CONST["● core/_type_constants.py ━━━━━━━━━━ GATED_TOOLS: +2 tools TOOL_SUBSET_TAGS: clone tag TOOL_CATEGORIES: Clone & Remote"] end subgraph WorkspaceLib ["Workspace Library (L1)"] direction TB CLONE_REG_MOD["★ workspace/clone_registry.py ━━━━━━━━━━ register_clone() — atomic append read_registry() — safe read cleanup_candidates() — partition batch_delete() — bulk remove"] end subgraph Storage ["Storage"] REGISTRY[("★ clone-cleanup-registry.json ━━━━━━━━━━ JSON on disk [ {path, status}, … ] atomic writes — parallel-safe")] end %% CONNECTIONS %% PIPE1 -->|"MCP: register_clone_status status=success|error"| REG_TOOL PIPE2 -->|"MCP: register_clone_status status=success|error"| REG_TOOL ORCH -->|"MCP: batch_cleanup_clones (after all pipelines done)"| BATCH_TOOL REG_TOOL -->|"calls register_clone()"| CLONE_REG_MOD BATCH_TOOL -->|"calls batch_delete()"| CLONE_REG_MOD BATCH_TOOL -->|"calls remove_clone (sync)"| REMOVE_CLONE CLONE_REG_MOD -->|"atomic writes"| REGISTRY CLONE_REG_MOD -->|"reads entries"| REGISTRY Server -.->|"gating lookup"| TYPE_CONST %% CLASS ASSIGNMENTS %% class PIPE1,PIPE2,ORCH cli; class CLONE_REPO,REMOVE_CLONE,PUSH handler; class REG_TOOL,BATCH_TOOL,CLONE_REG_MOD newComponent; class REGISTRY stateNode; class TYPE_CONST phase; ``` Closes #486 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/impl-20260322-203748-529080/temp/make-plan/defer_clone_cleanup_plan_2026-03-22_205207.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

## Summary `_format_response()` in `pretty_output.py` conflated two independent concerns: **envelope detection** (did the Claude Code `{"result": "..."}` wrapper contain a JSON dict or plain text?) and **formatter dispatch** (which formatter handles this tool?). When a tool returned plain text, an `except: pass` silently left `data` as the raw envelope dict, causing named formatters to receive the wrong shape — producing truncated near-empty output (`## token_summary`) with no error. The fix extracts `_resolve_payload()` which produces a typed `_DictPayload` or `_PlainTextPayload` before any dispatch, making it structurally impossible for a named dict-formatter to receive a plain-text envelope. A secondary defect — `_UNFORMATTED_TOOLS` was declared but never consulted — is also corrected by making it an active behavioral gate. ## Architecture Impact ### Process Flow Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; %% TERMINALS %% START([PostToolUse Event]) PASSTHRU([Pass-Through: exit 0]) FORMATTED([Formatted Output: exit 0]) subgraph Entry ["Hook Entry — main()"] direction TB ParseStdin["ParseStdin ━━━━━━━━━━ json.loads(stdin)"] ParseErr{"JSONDecodeError or non-dict?"} ExtractFields["ExtractFields ━━━━━━━━━━ tool_name + tool_response"] MissingFields{"Missing fields?"} PipelineMode["_is_pipeline_mode() ━━━━━━━━━━ Read hook config JSON"] end subgraph Resolution ["★ _resolve_payload() — NEW typed resolver"] direction TB ParseOuter["ParseOuter ━━━━━━━━━━ json.loads(tool_response)"] OuterErr{"parse error or non-dict?"} EnvCheck{"mcp__ prefix + single result key + str value?"} InnerParse["★ InnerParse ━━━━━━━━━━ json.loads(data[result])"] InnerDict{"inner is dict?"} DictPayload["★ _DictPayload(data=inner) ━━━━━━━━━━ Unwrapped JSON object"] PlainText["★ _PlainTextPayload(text) ━━━━━━━━━━ Pre-formatted string"] BareDict["_DictPayload(data) ━━━━━━━━━━ No envelope to strip"] end subgraph PlainDispatch ["★ Plain-Text Dispatch — NEW branch"] direction TB PlainLookup{"★ short_name in _PLAIN_TEXT_FORMATTERS?"} PlainHandler["★ _fmt_open_kitchen_plain_text() ━━━━━━━━━━ Custom plain-text render"] PlainPassThru["PassThru ━━━━━━━━━━ Return text unchanged"] end subgraph DictDispatch ["● Dict Dispatch — _UNFORMATTED_TOOLS now active gate"] direction TB GateErr{"subtype == gate_error?"} ToolExc{"subtype == tool_exception?"} UnformCheck{"● short_name in _UNFORMATTED_TOOLS?"} FormatLookup{"short_name in _FORMATTERS?"} GateFmt["_fmt_gate_error() ━━━━━━━━━━ Gate error renderer"] ExcFmt["_fmt_tool_exception() ━━━━━━━━━━ Exception renderer"] NamedFmt["_FORMATTERS[short_name] ━━━━━━━━━━ Named formatter dispatch"] Generic["_fmt_generic() ━━━━━━━━━━ Fallback KV renderer"] end %% FLOW %% START --> ParseStdin ParseStdin --> ParseErr ParseErr -->|"yes"| PASSTHRU ParseErr -->|"no"| ExtractFields ExtractFields --> MissingFields MissingFields -->|"yes"| PASSTHRU MissingFields -->|"no"| PipelineMode PipelineMode --> ParseOuter ParseOuter --> OuterErr OuterErr -->|"yes"| PASSTHRU OuterErr -->|"no"| EnvCheck EnvCheck -->|"bare dict"| BareDict EnvCheck -->|"envelope detected"| InnerParse InnerParse -->|"success"| InnerDict InnerParse -->|"JSONDecodeError"| PlainText InnerDict -->|"true"| DictPayload InnerDict -->|"false: list/str"| PlainText DictPayload --> GateErr BareDict --> GateErr PlainText --> PlainLookup PlainLookup -->|"found"| PlainHandler PlainLookup -->|"not found"| PlainPassThru PlainHandler --> FORMATTED PlainPassThru --> FORMATTED GateErr -->|"yes"| GateFmt GateErr -->|"no"| ToolExc ToolExc -->|"yes"| ExcFmt ToolExc -->|"no"| UnformCheck UnformCheck -->|"yes"| Generic UnformCheck -->|"no"| FormatLookup FormatLookup -->|"found"| NamedFmt FormatLookup -->|"not found"| Generic GateFmt --> FORMATTED ExcFmt --> FORMATTED NamedFmt --> FORMATTED Generic --> FORMATTED %% CLASS ASSIGNMENTS %% class START,PASSTHRU,FORMATTED terminal; class ParseErr,MissingFields,OuterErr,EnvCheck,InnerDict,PlainLookup,GateErr,ToolExc,FormatLookup stateNode; class ParseStdin,ExtractFields,PipelineMode,InnerParse,BareDict,GateFmt,ExcFmt,NamedFmt,Generic handler; class UnformCheck detector; class DictPayload,PlainText,PlainHandler,PlainPassThru,PlainDispatch newComponent; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | PostToolUse event, pass-through, formatted output | | Teal | State | Decision points and routing conditions | | Orange | Handler | Processing nodes (parse, extract, format) | | Green | New Component | ★ New: `_resolve_payload`, `_PlainTextPayload`, `_DictPayload`, `_PLAIN_TEXT_FORMATTERS` | | Red | Detector | ● `_UNFORMATTED_TOOLS` behavioral gate (previously dead code, now active) | ### Error Resilience Diagram ```mermaid %%{init: {'flowchart': {'nodeSpacing': 40, 'rankSpacing': 50, 'curve': 'basis'}}}%% flowchart TB %% CLASS DEFINITIONS %% classDef terminal fill:#1a237e,stroke:#7986cb,stroke-width:2px,color:#fff; classDef stateNode fill:#004d40,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef handler fill:#e65100,stroke:#ffb74d,stroke-width:2px,color:#fff; classDef phase fill:#6a1b9a,stroke:#ba68c8,stroke-width:2px,color:#fff; classDef newComponent fill:#2e7d32,stroke:#81c784,stroke-width:2px,color:#fff; classDef detector fill:#b71c1c,stroke:#ef5350,stroke-width:2px,color:#fff; classDef output fill:#00695c,stroke:#4db6ac,stroke-width:2px,color:#fff; classDef gap fill:#ff6f00,stroke:#ffa726,stroke-width:2px,color:#000; %% TERMINALS %% HOOK_START([PostToolUse Event]) PASSTHRU([Pass-Through: exit 0 Claude Code unaffected]) FORMATTED([Formatted Output: exit 0]) subgraph FailOpen ["Fail-Open Sentinels — main()"] direction TB OuterParse["json.loads(stdin) ━━━━━━━━━━ Parse hook event"] ParseFail{"JSONDecodeError?"} ExtractOK["Extract tool_name + tool_response"] FieldsMissing{"Fields missing?"} FormatCall["● _format_response() ━━━━━━━━━━ Format + dispatch"] AnyException{"Exception raised?"} NullCheck{"formatted is None?"} end subgraph PayloadGates ["★ _resolve_payload() — Validation Gates"] direction TB ParseTR["json.loads(tool_response) ━━━━━━━━━━ Outer parse"] TRFail{"JSONDecodeError or non-dict?"} EnvGate{"MCP envelope detected?"} InnerAttempt["★ json.loads(inner) ━━━━━━━━━━ Inner parse attempt"] InnerFail{"JSONDecodeError or non-dict inner?"} end subgraph InnerFixGap ["Inner Parse: Error becomes Signal (★ Fixed)"] direction TB OldBroken["BROKEN (pre-fix) ━━━━━━━━━━ except: pass → wrong shape → empty ## token_summary"] NewFixed["★ _PlainTextPayload(text) ━━━━━━━━━━ JSONDecodeError is the signal → correct plain-text dispatch"] end subgraph SafeGuards ["● _UNFORMATTED_TOOLS — Structural Safeguard (now active)"] direction TB UnformGate{"● short_name in _UNFORMATTED_TOOLS?"} SafeRoute["_fmt_generic() ━━━━━━━━━━ Guaranteed safe fallback"] NamedDispatch["_FORMATTERS[short_name] ━━━━━━━━━━ Named formatter (dict only)"] end %% FLOW %% HOOK_START --> OuterParse OuterParse --> ParseFail ParseFail -->|"yes"| PASSTHRU ParseFail -->|"no"| ExtractOK ExtractOK --> FieldsMissing FieldsMissing -->|"yes"| PASSTHRU FieldsMissing -->|"no"| FormatCall FormatCall --> AnyException AnyException -->|"yes"| PASSTHRU AnyException -->|"no"| NullCheck NullCheck -->|"yes: None"| PASSTHRU NullCheck -->|"no: str"| FORMATTED FormatCall --> ParseTR ParseTR --> TRFail TRFail -->|"yes → None"| PASSTHRU TRFail -->|"no"| EnvGate EnvGate -->|"yes: MCP envelope"| InnerAttempt EnvGate -->|"no: bare dict"| UnformGate InnerAttempt --> InnerFail InnerFail -->|"yes: JSONDecodeError"| NewFixed InnerFail -->|"no: inner dict"| UnformGate NewFixed --> FORMATTED OldBroken -.->|"replaced by ★"| NewFixed UnformGate -->|"yes → _fmt_generic"| SafeRoute UnformGate -->|"no"| NamedDispatch SafeRoute --> FORMATTED NamedDispatch --> FORMATTED %% CLASS ASSIGNMENTS %% class HOOK_START,PASSTHRU,FORMATTED terminal; class ParseFail,FieldsMissing,AnyException,NullCheck,TRFail,EnvGate,InnerFail stateNode; class OuterParse,ExtractOK,ParseTR,InnerAttempt,SafeRoute,NamedDispatch handler; class FormatCall phase; class UnformGate detector; class NewFixed,PayloadGates newComponent; class OldBroken gap; ``` **Color Legend:** | Color | Category | Description | |-------|----------|-------------| | Dark Blue | Terminal | PostToolUse event, pass-through, formatted output | | Teal | State | Decision points and failure checks | | Orange | Handler | Parse and processing nodes | | Purple | Phase | `_format_response` entry point | | Green | New Component | ★ Fixed: `_PlainTextPayload` captures inner JSONDecodeError correctly | | Red | Detector | ● `_UNFORMATTED_TOOLS` behavioral gate (structural safeguard) | | Yellow/Amber | Gap | Pre-fix broken behavior: silent data corruption via `except: pass` | Closes #494 ## Implementation Plan Plan file: `/home/talon/projects/autoskillit-runs/remediation-20260323-081448-474525/.autoskillit/temp/rectify/rectify_pretty_output_hook_payload_dispatch_2026-03-23_084500.md` 🤖 Generated with [Claude Code](https://claude.com/claude-code) via AutoSkillit --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

Remove the generic open-pr-main skill from skills_extended/ and replace it with a comprehensive project-local promote-to-main skill in .claude/skills/. The new skill adds pre-flight checks, commit categorization, per-domain risk scoring, breaking change audit, test coverage delta, regression risk analysis, release notes generation, traceability matrix, migration detection, cross-domain dependency analysis, and dry-run mode. Subagents are granted autonomy to spawn their own sub-subagents at discretion. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

When >= 20 PRs have been squash-merged into integration since divergence from main, the skill bumps the minor version (X.Y+1.0) on integration before creating the promotion PR. Below 20, the CI workflow's automatic patch bump suffices. In dry-run mode, the bump is reported but not applied. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The test name claimed to verify on_failure routing but only asserted path_contamination appeared in the prompt. Now mirrors the drain_race test pattern with segment proximity check. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…tchen rule tests Split combined assert with logical-and into separate assertions for clear failure identification. Scope gh-pr-merge prohibition check to the specific rule containing the phrase rather than all rules. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Replace relative Path("src/autoskillit/cli") with __file__-based resolution to prevent vacuous pass when pytest runs from a non-root directory. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Log OSError in is_first_run() and gather_intel timeout/failure at debug level instead of silently swallowing. Capture intel_future result value for debug visibility. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The kitchen rule uses "Never" (title case) which was missed by the exact-case keyword list. Switch to .lower() comparison. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Promote integration to main (28 PRs, 25 issues, 16 fixes, 16 features)

Trecek · 2026-03-24T03:47:24Z

src/autoskillit/cli/_init_helpers.py

+        "  before proceeding.\n"
+    )
+    print("  To bypass, type exactly:\n")
+    print(f"  {_D}{_SECRET_SCAN_BYPASS_PHRASE}{_R}\n")


In general, to fix clear-text logging of sensitive information, avoid printing or logging secret or password-like values directly. Instead, describe to the user what to do without echoing the sensitive value, or handle the interaction in a way that doesn’t expose the value in logs (e.g., non-echoed input, partial redaction, or interactive-only presentation).

For this specific case, we should stop printing _SECRET_SCAN_BYPASS_PHRASE directly. The simplest fix that preserves behavior is:

Keep the internal constant _SECRET_SCAN_BYPASS_PHRASE unchanged so the comparison logic works as-is.

Change the instructions printed to the user so they no longer embed the full phrase in clear text. Instead, we can:

Either describe the phrase (e.g., “Type the exact bypass phrase shown above in your documentation”).

Or print a partially redacted version (e.g., with some characters replaced by *), so the rule no longer detects a full secret/password being logged, but the user still sees most of it.

Leave the input and comparison logic unchanged; the user still has to type the full phrase correctly, but we are not logging it.

Because we can only change the shown snippet, the best minimal change is to modify line 255 so it no longer directly interpolates _SECRET_SCAN_BYPASS_PHRASE. For example, we can construct a masked version of the phrase in code right at that point and print the masked version instead of the full phrase. This keeps the same UI style, requires no new imports, and doesn’t affect any other behavior.

Concretely in src/autoskillit/cli/_init_helpers.py:

Right before printing the bypass phrase, introduce a local variable masked_phrase that redacts some of the characters of _SECRET_SCAN_BYPASS_PHRASE (for instance, replace the middle part with ***).

Change the print(f" {_D}{_SECRET_SCAN_BYPASS_PHRASE}{_R}\n") line to print masked_phrase instead.

No new imports, methods, or global definitions are required.

Investigated — this is intentional. Line 41 defines _SECRET_SCAN_BYPASS_PHRASE as the fixed string "I accept the risk of leaking secrets without pre-commit scanning". Line 255 prints this constant to the terminal inside _check_secret_scanning() as an interactive consent prompt — it is not logging any user-supplied secret or password. The scanner flagged the words "secret"/"secrets" in the variable name and string value, but no sensitive data (credentials, keys, passwords) is involved.

Trecek and others added 30 commits March 21, 2026 00:41

feat: enhance GitHub token retrieval with gh CLI fallback

8e20a23

fix: update CI workflow to use GH_PAT for authentication and remove u…

607eac6

…nnecessary permissions

Update asset link in README.md

d8a1b2a

Update README.md

91e808b

Update README.md

e0b9f13

Trecek and others added 15 commits March 23, 2026 05:15

Lower minor version bump threshold from 20 to 15 PRs

25b1a8f

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: bump minor version to 0.6.0 for promotion (28 PRs)

7184729

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix(review): anchor TTY contract test path to repo root

9846ae3

Replace relative Path("src/autoskillit/cli") with __file__-based resolution to prevent vacuous pass when pytest runs from a non-root directory. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix(review): use case-insensitive match for prohibition keywords

93b1565

The kitchen rule uses "Never" (title case) which was missed by the exact-case keyword list. Switch to .lower() comparison. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge pull request #496 from TalonT-Org/integration

63f6124

Promote integration to main (28 PRs, 25 issues, 16 fixes, 16 features)

chore: bump version to 0.6.1

33d1a0e

Merge branch 'stable' into main

3d5fdf7

github-advanced-security bot found potential problems Mar 24, 2026

View reviewed changes

Trecek merged commit 86fa625 into stable Mar 24, 2026
7 of 8 checks passed

@@ -252,7 +252,12 @@
                     "  before proceeding.\n"
                 )
                 print("  To bypass, type exactly:\n")
-                print(f"  {_D}{_SECRET_SCAN_BYPASS_PHRASE}{_R}\n")
+                masked_phrase = (
+                    _SECRET_SCAN_BYPASS_PHRASE[:10]
+                    + "..."
+                    + _SECRET_SCAN_BYPASS_PHRASE[-10:]
+                )
+                print(f"  {_D}{masked_phrase}{_R}\n")
                 response = input("  > ").strip()
                 if response != _SECRET_SCAN_BYPASS_PHRASE:
                     print(f"\n  {_B}Aborted.{_R} Phrase did not match.")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor update#497

Minor update#497
Trecek merged 45 commits intostablefrom
main

Trecek commented Mar 24, 2026

Uh oh!

Check failure

Copilot Autofix

Trecek Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Trecek commented Mar 24, 2026

Uh oh!

Check failure

Uh oh!

Uh oh!

Copilot Autofix

Trecek Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants