Claude Code skills for high-stakes backend features — codebase-aware task breakdown + 4-round AI verification
🇨🇳 中文用户推荐阅读 中文文档 — 包含完整故事背景、设计原理和使用指南。
A payment refund API passed two code reviews — including the author's own — and still shipped with a concurrency race condition. Users got double-refunded within two hours of launch.
The root cause wasn't carelessness. Code review has a structural blind spot: every reviewer shares the author's design assumptions. The more context you give reviewers, the harder it is for them to spot holes in your belief system.
This workflow fixes that with 4 independent AI verification passes:
- 🔍 Catches 5–15 concurrency & idempotency bugs per review — by an AI that sees only the code, not the design doc
- ⚡ Parallel subagents for phases 4.3–4.5 — behavior diff, cross-repo scan, and business invariant check run simultaneously
- 🏗️ No extra plugins required — every phase has a built-in fallback using Claude Code's native tools
Does the feature involve any of the following?
├── 💰 Financial transactions, payments, refunds, settlements? → YES → Use this workflow
├── 🔄 Order / inventory state machines with status transitions? → YES → Use this workflow
├── 🔒 Distributed locks, concurrency control, idempotent retry? → YES → Use this workflow
├── 🔗 Cross-service MQ/RPC contracts or shared proto/model change? → YES → Use this workflow
├── 🗄️ Online schema migration or dual-write strategy? → YES → Use this workflow
└── ⏱️ Estimated effort ≥ 3 person-days with high cost-of-failure? → YES → Use this workflow
None of the above? → Standard workflow is fine ✓
7-phase Claude Code workflow for high-stakes backend feature development:
① Requirements & Design → brainstorming skill
①.5 Architecture Review → ADR (required for high-risk features)
② Implementation Plan → writing-plans skill
③ Implementation → subagent-driven-development skill
④ 🔥 4 Cross-Verification Passes ← the core innovation
⑤ Fix Iteration → writing-plans + subagent-driven-development
⑥ Careful Simplification → skeptical optimization with anti-pattern checklist
⑦ Doc Sync → backfill evolution log + notify downstream
The 4 AI verification passes:
| Pass | Perspective | Typical yield |
|---|---|---|
| 4.1 Systematic self-review | Author as bug hunter | 1–3 bugs |
| 4.2 Cold-context review ⭐ | AI reviewer with no design docs | 5–15 concurrency / idempotency bugs |
| 4.3 Behavior-preservation diff | Side-effect comparison vs master | 2–5 regressions |
| 4.4 Cross-repo impact scan | External services that may break | 0–3 impact points |
| 4.5 Business invariant matrix | Hard constraints: money / state / inventory | 0–2 violations |
Phase 4.2 is the highest-value step. The AI reviewer sees only the code — not the design doc assumptions baked in by the author. That's exactly where systematic concurrency and idempotency bugs hide.
Passes 4.3–4.5 are dispatched in parallel via subagents.
| Skill | What it does | When to use |
|---|---|---|
prd-to-tasks |
PRD → codebase-aware task breakdown with risk-driven workflow routing | Any PRD or requirements doc that needs to be translated into concrete engineering tasks |
cross-verified-feature-development |
7-phase workflow with 4 independent AI verification passes | Payments, state machines, distributed locks, cross-service contracts, schema migrations |
Skills in this repo use the Open Agent Skills standard format (SKILL.md + YAML frontmatter) — natively compatible with Claude Code, Codex CLI, and OpenClaw. OpenCode requires manual adaptation.
Step 1: Clone the repo
git clone https://github.com/MageByte-Zero/magebyte-power.git
export SKILLS_REPO="$PWD/magebyte-power"Step 2: Install for your platform
| Platform | Format compatibility | Skills directory |
|---|---|---|
| Claude Code | ✅ Native | ~/.claude/skills/ |
| Codex CLI | ✅ Native | ~/.agents/skills/ |
| OpenClaw | ✅ Native | ~/.openclaw/skills/ |
| OpenCode | ~/.config/opencode/agents/ |
# Claude Code
mkdir -p ~/.claude/skills
ln -sf "$SKILLS_REPO/skills/cross-verified-feature-development" \
~/.claude/skills/cross-verified-feature-development
ln -sf "$SKILLS_REPO/skills/prd-to-tasks" \
~/.claude/skills/prd-to-tasks
# Codex CLI
mkdir -p ~/.agents/skills
ln -sf "$SKILLS_REPO/skills/cross-verified-feature-development" \
~/.agents/skills/cross-verified-feature-development
ln -sf "$SKILLS_REPO/skills/prd-to-tasks" \
~/.agents/skills/prd-to-tasks
# OpenClaw
mkdir -p ~/.openclaw/skills
ln -sf "$SKILLS_REPO/skills/cross-verified-feature-development" \
~/.openclaw/skills/cross-verified-feature-development
ln -sf "$SKILLS_REPO/skills/prd-to-tasks" \
~/.openclaw/skills/prd-to-tasksCross-platform tip:
~/.agents/skills/is the standard user-scoped directory in the Open Agent Skills ecosystem. Claude Code, Codex CLI, and OpenClaw all scan it automatically — install once, works across all three.
Step 3 (recommended): Install Superpowers
Both skills in this repo route to Superpowers sub-skills for downstream phases:
prd-to-tasks— Phase 5 routes 🟡 High / 🟢 Standard tasks tosuperpowers:brainstorming/writing-plans/subagent-driven-developmentcross-verified-feature-development— Phases ①②③ orchestrate the same Superpowers sub-skills
Install once (Claude Code):
claude mcp add --transport http superpowers https://superpowers.anthropic.com/mcpSkipping Superpowers is fine — every phase has a native-tool fallback documented inside each SKILL.md. You only lose the polished prompts that Superpowers provides.
OpenCode adaptation (click to expand)
OpenCode uses its own agent file format and does not read SKILL.md directly.
Copy the skill content as a system prompt into an OpenCode agent file:
mkdir -p ~/.config/opencode/agents
# Create ~/.config/opencode/agents/cross-verified.md
# Example frontmatter:
# ---
# description: 7-phase high-stakes feature workflow with 4 independent AI verification passes
# mode: primary
# ---
# (paste SKILL.md body as the system prompt)See opencode.ai/docs/agents for the full format spec.
Verify:
# Claude Code
ls ~/.claude/skills/cross-verified-feature-development/SKILL.md
ls ~/.claude/skills/prd-to-tasks/SKILL.md
# Codex CLI / OpenClaw (shared path)
ls ~/.agents/skills/cross-verified-feature-development/SKILL.md
ls ~/.agents/skills/prd-to-tasks/SKILL.mdUsage:
# prd-to-tasks — paste a PRD link or describe the requirement
"help me break down this PRD into tasks"
"turn this requirements doc into engineering tasks"
# cross-verified-feature-development
/cross-verified-workflow implement idempotent refund API with distributed lockPRD documents describe what to build. They don't tell you where in the codebase to build it, which services are affected, or what patterns are required. The gap between "PM handed me a doc" and "I have executable tasks with file paths and verify commands" is where hours disappear.
This skill closes that gap with a 6-phase structured pipeline + a user-evolved knowledge base:
Phase 0: PRD Ingestion → Lark / pasted text / local file
Phase 1: Scope + PM Audit → 10-item PM scope checklist + 🔴/🟡/🟢 risk classification with audit trail (HARD-GATE)
Phase 2: Codebase Scan → LSP-first; confidence-tagged affected scope; propose new KB entries
Phase 3: Spec Generation → YAML frontmatter + Boundaries (Always/AskFirst/Never) + Invariants + Open Questions table (HARD-GATE)
Phase 4: Task Breakdown → XS–XL sizing + spec-refs back-references + Mermaid DAG + plan frontmatter (HARD-GATE)
Phase 5: Workflow Routing → Plan-header `routed-to:` machine-readable contract → downstream skill
+ Knowledge Base → 3-tier loading: project / user / skill-seed (~/.claude/prd-to-tasks/)
- HARD-GATE at Phase 1 / 3 / 4 — phases advance only on explicit approval ("approve Phase N" / "确认 Phase N"); conversational "ok" doesn't count.
- 10-item PM scope-clarity checklist drives Phase 1 — JTBD, Not Doing, Deferred-to-v2, worst-case failure, rollback trigger, success metrics, …
- Boundaries triad in Spec — Always-do / Ask-first / Never-do entries with stable IDs (B-Always-N, B-Never-N), back-referenced by tasks.
- Spec-refs on every task —
spec-refs:maps tasks back to the spec sections / Boundaries / Invariants they implement; Phase 4 gate self-check enforces completeness. - XS–XL sizing + Mermaid task DAG — critical path and parallelizable batches visible at a glance; XL must justify its rejection of further splitting.
- Plan frontmatter routing —
routed-to: cross-verified-feature-development | superpowers:writing-plans | directis a machine-readable hand-off contract, not a verbal recommendation. - Evolvable Knowledge Base —
repo-map.md/service-patterns.mdlive in~/.claude/prd-to-tasks/(per-user) or<repo>/.claude/prd-to-tasks/(team-shared via git). Phase 2 proposes new mappings with explicit user approval — never silent writes.
Generic spec tools produce: "implement user login → modify UserService"
This skill produces:
In the
CreateBookingmethod atorder-service/internal/service/booking/booking_service.go:142, add a feature flag check (key from your project's flag conventions), generate the primary key with your project's centralized ID helper (documented in your KB'sservice-patterns.md), write to the DB, then follow your project's cache invalidation discipline — finally run your project'sbuild+testcommands.
The skill knows about "your project's centralized ID helper" and "your project's cache invalidation discipline" because you've documented them once in ~/.claude/prd-to-tasks/service-patterns.md. Every future task inherits that knowledge automatically.
Phase 5 writes the decision to the plan file's frontmatter routed-to: field — downstream skills consume by fixed path, not by verbal hand-off.
| Risk level | Criteria | routed-to: |
|---|---|---|
| 🔴 Critical | Financial flows, state machines, distributed locks, MQ contracts, schema migration | cross-verified-feature-development |
| 🟡 High | ≥ 3 person-days, multi-repo, core path | superpowers:writing-plans → subagent-driven-development |
| 🟢 Standard | New endpoints, stateless, single repo, < 3 days | direct (you read spec + tasks and implement) |
The bundled references/*.md files are industry-neutral seeds. Your actual codebase mappings should live in your local KB:
mkdir -p ~/.claude/prd-to-tasks
cp ~/.claude/skills/prd-to-tasks/references/*.md ~/.claude/prd-to-tasks/
# Then edit to reflect your stack: services, ID helpers, cache discipline, MQ topology, idempotency conventionsThe skill auto-detects this directory on Phase 0 and prefers it over the bundled seed. Team-shared mappings can live in <repo>/.claude/prd-to-tasks/ (project-level — takes priority over user-level).
| File | Content |
|---|---|
references/repo-map.md |
Generic e-commerce service map — replace with your real service names |
references/service-patterns.md |
6 principle templates (ID generation / cache invalidation / MQ topic registration / distributed locks / idempotency / shared contracts) — fill in your project's actual helpers |
Code Review has a structural blind spot: every reviewer shares the same design assumptions as the author. The more context you give reviewers, the harder it becomes for them to spot holes in your belief system. This is why the concurrency race in a payment refund API passed two reviewers — including the author himself.
The fix: multiple independent perspectives. N reviewers who don't share context produce a bug set that approaches a union, not a duplicate. This workflow operationalizes that insight:
Design → Implement → 4 independent AI verification passes → Fix → Simplify → Sync docs
① Requirements & Design → superpowers:brainstorming
①.5 Architecture Review → ADR (required for high-risk features)
② Implementation Plan → superpowers:writing-plans
③ Implementation → superpowers:subagent-driven-development
④ 🔥 4 Cross-Verification Passes ← the core innovation
⑤ Fix Iteration → writing-plans + subagent-driven-development
⑥ Careful Simplification → skeptical optimization with anti-pattern checklist
⑦ Doc Sync → backfill evolution log + notify downstream
Each phase has explicit Exit Criteria — a checklist that must pass before moving forward.
| Metric | Standard workflow | This workflow |
|---|---|---|
| Extra time | — | +40–50% |
| Critical bug detection rate | ~40% | ~95% |
| Phase 4.2 typical yield | 0 | 5–15 High/Critical bugs |
40% more time for a 55-point improvement in bug detection rate. In payment and core state machine features, that trade-off is worth it.
| File | When to read |
|---|---|
references/cross-verification-techniques.md |
Before Phase 4 — agent prompt templates for all 4 passes |
references/anti-patterns.md |
Before Phase 6 — 12 high-frequency trap patterns |
references/doc-sync-playbook.md |
Before Phase 7 — structured doc backfill playbook |
references/case-studies.md |
Optional — real-world bug museum from e-commerce order domain |
The best contributions are workflows proven in production — not perfect methodology, just processes that solved real problems.
mkdir -p skills/<skill-name>/references
# Create skills/<skill-name>/SKILL.md with name + description frontmatter
# Test on real tasks, then open a PR describing the domain and the problem it solves📖 How I distilled 4 years of production incidents into a Claude Code Skill (Chinese) — the production outage that started this, the "context contamination" insight, and the full design rationale.


