🐙 Coding Tentacle v0.9.0

Safety-first guardian layer that controls LLM code-fixing agents.
OpenCode writes fixes. CT analyzes, reviews, blocks danger, requires human approval, and learns from every run.

Why Coding Tentacle?

OpenCode, Codex, and Claude Code are brilliant at generating code. But they have zero safety guarantees. They can output DROP TABLE, eval(user_input), or rm -rf / — and nothing stops them.

Coding Tentacle sits in front of any LLM fix engine and acts as a guardian.

🛡️ Safety VETO	Blocks dangerous patterns (SQL injection, eval, shell commands) — before execution. Base64 and HTML-encoded payloads are decoded and caught.
🔍 SkepticBrain	Adversarial review of every fix. "Why could this be wrong?" Risk score, objections, recommendation.
🧠 Self-Learning	BLM stores every bug experience. EngineLearning calibrates trust per engine + bug type. Later runs get better context.
🔗 Engine Router	Routes bugs to the best engine. OpenCode primary. Ollama fallback. Codex (API key needed). Bug-type-specific trust routing.
👤 Human Approval	Every fix requires human APPROVE/REJECT/REQUEST_CHANGES. Safety VETO can NEVER be overridden — even by humans.
📊 Impact Analysis	Predicts which files, tests, skills, and procedures are affected by a change. Risk score before approval.

Architecture

                    ┌─────────────────────────┐
                    │    Coding Tentacle       │
                    │    ┌───────────────────┐ │
  Bug Report ──────►│    │  Safety VETO 🛡️   │ │
                    │    │  SkepticBrain 🔍  │ │
                    │    │  Engine Router 🔗  │ │
                    │    │  Trust Calibration │ │
                    │    │  Learning Loop 🧠  │ │
                    │    └───────┬───────────┘ │
                    │            │             │
                    │    APPROVE / REJECT      │
                    │    / REQUEST_CHANGES     │
                    └────────────┬────────────┘
                                 │
                    ┌────────────▼────────────┐
                    │   Fix Engines            │
                    │   ┌──────┐ ┌──────────┐ │
                    │   │OpenCode│ │ Ollama  │ │
                    │   └──────┘ └──────────┘ │
                    └─────────────────────────┘

Quick Start

git clone https://github.com/nessos666/coding-tentacle.git
cd coding-tentacle

# Verify everything works
python3 scripts/full_regression.py
# → ✅ RC2 ALL TESTS PASSED

# Analyze a bug with full pipeline
python3 -c "
from coding_tentacle.orchestrator.shadow_mode import ShadowModeRunner, GitHubIssueRun
from coding_tentacle.orchestrator.metabrain import MetaBrain, SafetyBrain
from coding_tentacle.safety.inhibitory_control import InhibitoryControl
from coding_tentacle.knowledge.security_store import create_seed_security_store
from coding_tentacle.orchestrator.engine_router import EngineRouter
from coding_tentacle.orchestrator.skeptic_brain import SkepticBrain
from coding_tentacle.safety.approval_gate import ApprovalGate

sec = create_seed_security_store()
ic = InhibitoryControl(security_store=sec)
safety = SafetyBrain(ic=ic, security_store=sec)
mb = MetaBrain(safety=safety)
er = EngineRouter(); er.check_health()
sb = SkepticBrain(); ag = ApprovalGate()

runner = ShadowModeRunner(meta_brain=mb, engine_router=er,
                          approval_gate=ag, skeptic_brain=sb,
                          safety_brain=safety)

r = runner.analyze_issue(GitHubIssueRun(
    'https://github.com/user/repo', '#1',
    'NullPointer in views.py',
    'NoneType has no attribute at line 42'))

print(f'Bug Type: {r.detected_bug_type}')
print(f'Engine:   {r.engine_used}')
print(f'Safety:   {\"BLOCKED\" if r.safety_events else \"OK\"}')
print(f'Skeptic:  risk={r.skeptic_risk:.2f} {r.skeptic_recommendation}')
print(f'Approval: {r.approval_status}')
print(f'BLM:      {\"Learned\" if r.blm_written else \"Error: \" + r.blm_error}')
"

Kombinationen

CT mit	Ergebnis
CT + OpenCode	✅ Empfohlen. OpenCode (deepseek-v4-pro) erzeugt Fix. CT prüft + lernt.
CT + Claude Code	✅ Top-tier. Claude Code (2.1.86) — alternativ zu OpenCode.
CT + Ollama	🔵 Fallback. granite3.2-vision lokal. Langsamer, offline-fähig.
CT + Codex	⚠️ Braucht OpenAI API-Key.
CT alleine	❌ Klassifiziert Bugs, erzeugt Template-Fixes (keine echte Reparatur).

Was passiert im Hintergrund?

1. Bug → CT klassifiziert (18 Typen)
2. Safety check: DROP TABLE? eval()? → BLOCK
3. EngineRouter wählt OpenCode/Ollama
4. Engine erzeugt echten Code-Diff
5. CT scannt Diff auf Gefahren (Base64/HTML-decodiert)
6. SkepticBrain: "Warum könnte das falsch sein?"
7. Sandbox testet isoliert (Originale UNVERÄNDERT)
8. HumanApprovalGate: APPROVE/REJECT/REQUEST_CHANGES
9. BLM speichert, EngineLearning kalibriert Vertrauen
10. Nächster Bug bekommt ähnliche Erfahrungen im Prompt

Pipeline (Shadow Mode)

  GitHub Issue
      │
      ▼
  ┌─────────────┐
  │ Classifier   │  18 bug types, 100% accuracy
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ SafetyBrain  │  VETO: DROP TABLE, eval(), system() → BLOCKED
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ EngineRouter │  OpenCode primary, Ollama fallback
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ Fix Engine   │  Generates real code diff
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ Safety scan  │  Scans engine output for dangerous patterns
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ SkepticBrain │  "Why could this fix be WRONG?"
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ Sandbox      │  Isolated test. Original files NEVER touched.
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ ApprovalGate │  APPROVE / REJECT / REQUEST_CHANGES
  └──────┬──────┘
         ▼
  ┌─────────────┐
  │ BLM + Trust  │  Store experience + update engine trust
  └─────────────┘

CT vs The World

Feature	CT	Codex	Devin	Claude Code	OpenHands
Safety VETO	✅	❌	❌	❌	❌
SkepticBrain	✅	❌	❌	❌	❌
Bayesian Trust	✅	❌	❌	❌	❌
Human Approval	✅	⚠️	⚠️	⚠️	❌
Self-Learning	✅	❌	❌	❌	❌
Bug Classification	✅	❌	❌	❌	❌
Engine Router	✅	❌	❌	❌	⚠️
Impact Analysis	✅	❌	❌	❌	❌
Open Source	✅	❌	❌	❌	✅
Cost/Task	$0	$12	$500/mo	$20	$0
SWE-bench	N/A	88.7%	87%	95.5%	65%

CT is not a competitor. CT is the safety layer that controls them.

What CT Is NOT

❌ Not a replacement for Codex, Devin, or Claude Code
❌ Not an autonomous bug fixer (requires OpenCode/Ollama for code generation)
❌ Not production-ready (Research / Shadow Release)

What CT IS

✅ Safety-first guardian that controls LLM fix engines
✅ Self-learning bug analysis system
✅ The only agent with Safety VETO + SkepticBrain + Bayesian Trust
✅ 100% open source, zero API costs

Requirements

Python 3.10+
OpenCode CLI (opencode) — for actual code fixing
Ollama + granite3.2-vision — for local fallback
No API keys required

Community

Found a bug? Open an issue
Want to contribute? CONTRIBUTING.md
Security concern? SECURITY.md

License

MIT — free, open source, no restrictions.

_{Built by David + Hermes. June 2026. 🦑}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github		.github
scripts		scripts
src/coding_tentacle		src/coding_tentacle
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
banner.png		banner.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐙 Coding Tentacle v0.9.0

Why Coding Tentacle?

Architecture

Quick Start

Kombinationen

Was passiert im Hintergrund?

Pipeline (Shadow Mode)

CT vs The World

What CT Is NOT

What CT IS

Requirements

Community

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🐙 Coding Tentacle v0.9.0

Why Coding Tentacle?

Architecture

Quick Start

Kombinationen

Was passiert im Hintergrund?

Pipeline (Shadow Mode)

CT vs The World

What CT Is NOT

What CT IS

Requirements

Community

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages