I build tools and products around the awkward parts of working with AI agents: vague briefs, fake "done" claims, stale memory, repeated behaviour failures, and the strange tendency of coding agents to produce shiny nonsense unless you box them in properly.
The flagship product direction is WrenLore: a knowledge and memory layer for teams that want AI to work against source-backed company context rather than a pile of chat history and hope.
Around that, I keep a set of small operating tools for AI agent workflows: clearer briefs, evidence-backed completion, evals for repeated failures, source-backed memory, and frontend guardrails.
| Repo | Use it when |
|---|---|
| Proof Loop | a coding task needs evidence before anyone calls it done |
| Sovereign Brain | long-running agents need source-backed memory and freshness review |
| WrenLore | teams need the product-grade version of source-backed knowledge and agent memory |
| Loopsmith | the same agent failure keeps recurring and should become an eval |
| Brief Master | the task is still fuzzy and needs to become a precise agent brief |
| no-slop-ui | frontend agents need guardrails against generic AI UI sludge |
The workflow is simple: write a better brief, freeze the acceptance criteria, verify the work with evidence, turn repeated failures into evals, and keep durable decisions attached to sources.
Most of this came from running agents on real work and getting tired of confident final messages that were only half true.


