memory-purifier

Post-consolidation, non-destructive canonicalization layer for OpenClaw memory substrates. Sits between the consolidators (reflections + native dreaming) and the downstream wiki reconciler. Reads consolidated lower-substrate memory, emits purified machine-facing artifacts and human-facing markdown views, leaves wiki reconciliation downstream.

Non-goals: it is not a consolidator, not the reconciler, not authority control, not a turn-time layer. It fires on cron or explicit operator command only.

What it does

Runs a cron-driven pipeline of deterministic scripts with two scoring passes (narrow schema-bound model calls; the orchestrating agent lives at the cron-supervisor level):

Pass 1 — Promotion scoring: for each candidate memory unit, decide survival (reject / defer / compress / merge / promote).
Pass 2 — Canonicalization: for each surviving cluster, assign canonical wording, one primary home, provenance, contradiction state, and freshness/confidence posture.

Scripts own orchestration, file I/O, validation, and retry. Prompts own semantic judgment only.

How scoring works

Purifier is not "summarizing memory" — it runs deterministic orchestration around two narrow scoring passes. The model's job is bounded judgment within a schema; the scripts' job is everything else (sourcing, batching, validation, retry, merging, routing, manifest commit). Scores drive bounded decisions, not open-ended behavior.

Pass 1 — Promotion scoring

For every extracted candidate memory unit, Pass 1 emits a six-dimension score profile and a single verdict. Scripts then route the candidate: survivors enter Pass 2, non-survivors land in rejected-candidates.jsonl / deferred-candidates.jsonl.

Scoring dimensions (each ∈ [0.0, 1.0]):

Dimension	What it measures
`durability`	How long this memory unit stays true — minutes vs. months vs. permanent
`future_judgment_value`	Whether knowing this later will change a decision
`action_value`	Whether it unlocks or constrains specific future actions
`identity_relationship_weight`	Whether it anchors who/what the operator is or relates to
`cross_time_persistence`	Expected recurrence across unrelated contexts
`noise_risk`	Penalty — how likely this is a one-off artifact or transient chatter

Strength is a deterministic re-derivation: sum(first five) − noise_risk. The validator recomputes it and rejects Pass 1 output whose emitted strength drifts more than ±0.01 from the formula — so the model can't freelance on scoring math.

Verdicts (mutually exclusive):

Verdict	Meaning	Next step
`reject`	Noise, transient, zero downstream value	Persisted to `rejected-candidates.jsonl` for audit; never re-considered
`defer`	Not enough signal yet; revisit when a later candidate provides context	Persisted to `deferred-candidates.jsonl`; natural re-promotion on future runs
`compress`	Duplicate/redundant with another candidate in the same batch	Folded into the `compress_target` (required field); source candidate is subsumed
`merge`	Semantically linked with one or more siblings; becomes one cluster	Joined via `merge_candidate_ids[]`; survives together into Pass 2
`promote`	Stands on its own; goes to canonicalization	Survives into Pass 2 as a single-candidate cluster

Pass 2 — Canonicalization and adjudication

For each surviving cluster, Pass 2 produces one canonical claim with an eight-dimension score profile plus explicit routing/reuse/supersession decisions:

Dimension	What it measures
`semantic_cluster_confidence`	How confident we are that the clustered candidates genuinely belong together
`canonical_clarity`	How crisp and unambiguous the canonical wording is
`provenance_strength`	Quality + quantity of source pointers (direct > inferred > merged)
`contradiction_pressure`	How much the cluster disagrees with active prior claims
`freshness`	How recent the underlying material is
`confidence`	Model's posterior confidence in the canonical text
`route_fitness`	How well the chosen `primary_home` matches the claim type
`supersession_confidence`	If the claim supersedes a prior, how confident the link is

Deterministic scripts do the load-bearing decisions

The scoring passes produce structured judgment; deterministic script code downstream uses bounded, rerunnable rules to turn that judgment into artifact state. Examples:

Reuse — (normalized_subject, normalized_predicate, normalized_home) triple match against active prior claims → stable id reuse; multi-match broken by most-recent updatedAt.
Probable-duplicate — graded similarity (composite 0.4 × subject + 0.2 × predicate + 0.2 × home + 0.2 × text_jaccard + 0.1 object bonus) above a 0.80 threshold → flag as probable_duplicate with back-pointer; wiki reconciler decides final collapse.
Route affinity — deterministic (type, home) table decides strong / acceptable / suspicious / impossible routing; impossible pairs hard-fail, suspicious ones warn.
Prior-claim retrieval for Pass 2 — per-cluster top-K hard floor + global cap + same-subject bonus window + bounded contradiction-pressure widening → rerun-deterministic.
Supersession-chain sanity — warn (but don't auto-chain) when a new claim supersedes an already-superseded prior; reconciliation-mode Pass 2 resolves in its normal course.

Scores inform decisions; scripts enforce the boundaries. Neither replaces the other.

Inputs

Read-only, at <workspace>/:

MEMORY.md (OpenClaw native dreaming)
RTMEMORY.md, PROCEDURES.md, episodes/*.md (reflections-hybrid)
CHRONICLES.md, DREAMS.md (personal profile only)

Never read: memory/*.md (raw logs), authority docs (CONSTITUTION.md etc.), already-purified outputs, other packages' runtime state. Full boundary: references/source-contract.md.

Outputs

Machine artifacts — authoritative, at <workspace>/runtime/:

purified-claims.jsonl, purified-contradictions.jsonl
purified-entities.json, purified-routes.json
purified-manifest.json, purifier-last-run-summary.json
deferred-candidates.jsonl, rejected-candidates.jsonl

Human markdown views — derived from artifact state, at <workspace>/:

LTMEMORY.md, PLAYBOOKS.md, EPISODES.md (always)
HISTORY.md, WISHES.md (personal profile only)

Views are regenerated every run. Do not edit them by hand.

Telemetry + reports:

Shared memory-log (primary) — ~/.openclaw/telemetry/memory-log-YYYY-MM-DD.jsonl, append-only. Every run appends one event with domain: "memory", component: "memory-purifier.purifier", and event ∈ {run_started, run_completed, run_skipped, run_failed}. The log is shared across memory plugins (reflections consolidator, purifier, etc.) so filters on component / domain cross-query the whole memory stack.
Latest-run report — ~/.openclaw/telemetry/memory-purifier/last-run.md, overwritten each run. Deterministic operator-facing markdown (run id, counts, warnings, downstream flag, token-usage line). Not a log — a convenience surface.
Package telemetry dir — holds last-run.md only; no per-package JSONL is written.

Token usage is scoring-pass-only: only Pass 1 and Pass 2 model invocations contribute. Deterministic script work (discover, scope, extract, cluster, assemble, render, validate) is never counted. The runtime reports source: "exact" when the provider returns usage metadata (e.g. anthropic-sdk), "approximate" when computed from actual prompt/completion char counts, "unavailable" when no real model was invoked (fixture-backed runs).

Reporting modes (in <workspace>/runtime/memory-state.json under memoryPurifier.reporting):

enabled: bool — hard gate; false (default) means fully silent in chat regardless of mode. Seeded at install time from --cron-announce.
mode: "silent" | "summary" | "full" — when enabled, shapes chat output
- silent — nothing in chat
- summary (default) — one-line compact report
- full — bullet-list report with a Token Usage block (omitted when tokenUsage.source == "unavailable")

Telemetry and last-run.md are written regardless of reporting settings — only chat delivery is gated.

Cron delivery mode (--no-deliver vs announce) and reporting.enabled must agree for chat output to actually reach the operator. scripts/sync_cron_delivery.py is the single deterministic actor that reconciles drift; the cron supervisor prompts call it each fire so the next run is aligned. It reads only the one boolean — the full config is never loaded into the prompt's context.

Runtime split

Layer	Created when	Committed to repo?
Install-time seeded (control-plane JSON)	`install.sh` creates them empty-but-valid: `purifier-metadata.json`, `purified-manifest.json`, `purifier-last-run-summary.json`, `locks/`	No — seeded into the live workspace
First-run populated (live artifacts)	The first successful `run_purifier.py` creates them: `purified-claims.jsonl`, `purified-contradictions.jsonl`, `purified-entities.json`, `purified-routes.json`, `deferred-candidates.jsonl`, `rejected-candidates.jsonl`	No — live runtime state
Markdown views	Each successful run regenerates them atomically from artifact state	No — derived, not source

Cadence

Default timezone: Asia/Manila (overridable via --cron-tz). Incremental expressions exclude Wed + Sun so reconciliation owns its slot on those days without collision.

Profile	Incremental	Reconciliation
`business`	`15 13 * * 1,2,4,5,6` (Mon/Tue/Thu/Fri/Sat 13:15)	`15 13 * * 3,0` (Wed/Sun 13:15)
`personal`	`15 5 * * 1,2,4,5,6` + `15 17 * * *` (morning excludes Wed/Sun; evening daily)	`15 5 * * 3,0` (Wed/Sun 05:15)

Cron fires a short launcher message — Run memory purifier. Read <prompt path> and follow every step strictly. — pointing at the step-by-step execution prompt (prompts/incremental-purifier-prompt.md or prompts/reconciliation-purifier-prompt.md). The prompt runs scripts/run_purifier.py as the orchestrator. See references/cadence-profiles.md.

Installation

# Optional — choose where the skill package lands.
# Defaults to $HOME/.openclaw/workspace/skills/ when unset.
export SKILLS_PATH="$HOME/.openclaw/workspace/skills"

# Uses default profile (personal), tz Asia/Manila, announce=false, timeout 1200s:
curl -fsSL https://raw.githubusercontent.com/catx0rr/memory-purifier/main/install.sh | bash

# Or specify explicitly:
curl -fsSL https://raw.githubusercontent.com/catx0rr/memory-purifier/main/install.sh | \
  bash -s -- --agent-profile business --cron-tz Asia/Manila --cron-announce false --timeout-seconds 1200

Other path overrides follow the same pattern — export CONFIG_ROOT=…, export WORKSPACE=…, export TELEMETRY_ROOT=… before the curl | bash line. Full list in INSTALL.md §2.

Installer flags:

Flag	Default	Effect
`--agent-profile business\|personal`	`personal`	Seeded profile + cadence.
`--local`	off	Install from the directory containing `install.sh` (offline; no git).
`--cron-tz <IANA>`	`Asia/Manila`	Timezone for cron registration. Minimum-shape validation.
`--cron-announce true\|false`	`false`	`true` registers cron with OpenClaw's explicit announce delivery (`--announce --channel <channel> --to <id>`) and seeds `reporting.enabled = true`; `false` registers with `--no-deliver` (silent) and seeds `reporting.enabled = false`.
`--timeout-seconds <int>`	`1200`	Positive-integer per-run timeout passed to `openclaw cron add`.
`--skip-cron`	off	Skip cron registration.
`--force-config`	off	Overwrite `memory-purifier.json`; also reseeds `reporting.enabled` from `--cron-announce`.

install.sh installs the package skeleton, seeds control-plane JSONs, and registers cron. It does not create live artifacts — follow INSTALL.md to complete first-time initialization (verify → dry-run → first live run → confirm cron).

Package layout

memory-purifier/
├─ README.md, INSTALL.md, SKILL.md, install.sh
├─ references/           (schemas, contracts, routing + render rules)
├─ prompts/
│  ├─ incremental-purifier-prompt.md     (cron entrypoint — lean execution prompt)
│  ├─ reconciliation-purifier-prompt.md  (cron entrypoint — lean execution prompt)
│  ├─ promotion-pass.md                  (scoring sub-prompt — Pass 1, execution-oriented)
│  └─ purifier-pass.md                   (scoring sub-prompt — Pass 2, execution-oriented)
├─ scripts/              (orchestration scripts, entrypoint: run_purifier.py;
│                         also scripts/sync_cron_delivery.py for delivery drift)
└─ runtime/              (repo scaffold: .gitkeep only)

Prompt philosophy: cron entrypoints are strictly execution prompts (no architecture essays). Pass prompts are execution-oriented with minimum necessary enum/schema references; long explanations live in references/prompt-contracts.md. Further trimming may continue as the package matures.

Claim IDs — purifier-local bookkeeping only

Stable hash IDs in purified-claims.jsonl (cl-<16-hex>) are purifier-local artifact identifiers. They exist for idempotency, supersession linkage, and contradiction cluster bookkeeping within purified state. They are not the canonical truth identifiers used by the downstream reconciler or wiki — the reconciler mints its own identity scheme when it compiles the wiki vault. The purifier may suggest identity (by reusing a prior id on a semantic (subject, predicate, primary_home) match), but the wiki decides final cross-layer canonical identity.

Maintenance behaviors (v1.7.0)

Emergency output-hardening patch. Scope: scoring-pass prompts + parser recovery only. Strictly additive — zero artifact-shape change, zero manifest schema change, zero refuse-and-lock gate fire for the v1.6.0 → v1.7.0 upgrade.

Prompt output discipline tightened. Both scoring prompts (prompts/promotion-pass.md and prompts/purifier-pass.md) replaced the single loose "Return exactly one JSON object, no prose, no markdown fences" line with a CRITICAL three-step block: (1) begin with {, nothing before; (2) end with }, nothing after; (3) output ONLY the JSON — no fences, no code blocks, no commentary, no preambles, no trailing sentences. Placed immediately before the schema so the model reads the constraint right before emitting. Schema + all hard constraints (candidate_id-once, strength ±0.01, merge_candidate_ids only for merge, compress_target only for compress, probable-duplicate discipline) preserved verbatim.
Hardened shared parser with a formal recovery ladder. New scripts/_lib/parse.py exposes parse_pass_output(raw, *, required_top_level=()) and PassOutputParseError. Recovery ladder tries direct JSON parse → fence-strip (for blocks) → brace-scan (for prose-around-JSON) before giving up. Both score_promotion.py and score_purifier.py migrated to the shared parser; their local extract_json is now a thin wrapper. A recoverable drift (fenced JSON, chatty preamble, trailing sentence) no longer surfaces as a hard partialFailures[] entry.
Fast-fail invariants. Parser hard-rejects inputs that can't safely become Pass output: empty / whitespace-only, non-object top-level (array / string / number), JSON missing any key in required_top_level (run_id + verdicts for Pass 1, run_id + canonical_claims for Pass 2). Fast-fail messages name the specific missing key.
Raw preservation on every failure. PassOutputParseError.raw carries the byte-for-byte original model output alongside reason and a recovery_attempts log of every ladder step tried. Failure records in purifier-failed-*.json now carry exactly what the model returned plus why each recovery step failed — no more "the parser gave up somewhere, good luck."
Version bump policy — logic stays at 1.6.0. PURIFIER_PACKAGE_VERSION bumped to "1.7.0" (release identifier; manifest packageVersion, install seeds). PURIFIER_LOGIC_VERSION deliberately stays at "1.6.0": v1.7.0 is strictly additive hardening (stricter prompt rules + more-forgiving parser), no artifact shape change, no reprocessing needed. Firing the Contract 3 refuse-and-lock gate for v1.6.0 → v1.7.0 would be user-hostile churn for zero payoff. Fresh cron fires on existing v1.6.0 installs proceed normally after pulling v1.7.0 code.
Test coverage: 18 new regression tests in tests/test_parser_hardening.py (default suite 157 → 175) covering: direct parse, prose-before JSON, prose-after JSON, fenced json block, bare fence, prose-around-fenced combined, empty output, whitespace-only, gibberish with recovery-attempts log, top-level array rejected, top-level string rejected, Pass 1 missing run_id, Pass 1 missing verdicts, Pass 2 missing canonical_claims, raw-preserved across scenarios, integration smoke for both scoring scripts.

For older maintenance behavior history (v1.6.0 and earlier), see CHANGELOG.md.

License

MIT — see LICENSE.

Operator procedural contract: SKILL.md. Schema/contract reference: references/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

memory-purifier

What it does

How scoring works

Pass 1 — Promotion scoring

Pass 2 — Canonicalization and adjudication

Deterministic scripts do the load-bearing decisions

Inputs

Outputs

Runtime split

Cadence

Installation

Package layout

Claim IDs — purifier-local bookkeeping only

Maintenance behaviors (v1.7.0)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
prompts		prompts
references		references
runtime		runtime
scripts		scripts
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md
install.sh		install.sh

Folders and files

Latest commit

History

Repository files navigation

memory-purifier

What it does

How scoring works

Pass 1 — Promotion scoring

Pass 2 — Canonicalization and adjudication

Deterministic scripts do the load-bearing decisions

Inputs

Outputs

Runtime split

Cadence

Installation

Package layout

Claim IDs — purifier-local bookkeeping only

Maintenance behaviors (v1.7.0)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages