Skip to content
View LeoStehlik's full-sized avatar

Block or report LeoStehlik

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
LeoStehlik/README.md

Leo Stehlik

I build tools and products around the awkward parts of working with AI agents: vague briefs, fake "done" claims, stale memory, repeated behaviour failures, and the strange tendency of coding agents to produce shiny nonsense unless you box them in properly.

The flagship product direction is WrenLore: a knowledge and memory layer for teams that want AI to work against source-backed company context rather than a pile of chat history and hope.

Around that, I keep a set of small operating tools for AI agent workflows: clearer briefs, evidence-backed completion, evals for repeated failures, source-backed memory, and frontend guardrails.

Agent Operating Tools

Repo Use it when
Proof Loop a coding task needs evidence before anyone calls it done
Sovereign Brain long-running agents need source-backed memory and freshness review
WrenLore teams need the product-grade version of source-backed knowledge and agent memory
Loopsmith the same agent failure keeps recurring and should become an eval
Brief Master the task is still fuzzy and needs to become a precise agent brief
no-slop-ui frontend agents need guardrails against generic AI UI sludge

The workflow is simple: write a better brief, freeze the acceptance criteria, verify the work with evidence, turn repeated failures into evals, and keep durable decisions attached to sources.

Most of this came from running agents on real work and getting tired of confident final messages that were only half true.

Pinned Loading

  1. proof-loop proof-loop Public

    Repo-local verification protocol for AI coding agents: acceptance criteria, separate verifier roles, proof artifacts, and evidence-backed done claims.

    Python 3

  2. decoupled-agent-memory decoupled-agent-memory Public

    Source-backed memory for long-running AI agents: maintained synthesis, freshness review, evidence links, and MCP/API access.

    Python 2

  3. loopsmith loopsmith Public

    Eval and promotion harness for AI agents: compare baseline vs candidate behaviour and promote only changes that survive evidence.

    Python

  4. brief-master brief-master Public

    Agent brief writer for AI coding workflows: turns fuzzy requests into precise briefs with acceptance criteria, constraints, and verification steps.

    Python 1

  5. no-slop-ui no-slop-ui Public

    Frontend design rules for AI coding agents: prevents generic AI UI slop, glassmorphism, gradient abuse, and unusable dashboards.

    Python

  6. wrenlore/wrenlore wrenlore/wrenlore Public

    Self-hosted knowledge infrastructure for humans and AI agents.

    TypeScript 1