A minimal self-improving agent that scores its own behavior and adapts over time.
Most “AI agents” just generate outputs.
This one:
- takes actions
- evaluates itself
- adapts over time
A live feedback loop you can actually watch.
Action → Outcome → Self-Critique → Score → Adapt → Repeat
[ITER 7] action=consolidate
[AGI] success: stabilized memory
[CRITIC] score=2 total=-4
📈 SCORE TREND:
1: -1
2: -2
...
7: -4
👉 The agent discovers beneficial actions.
python3 -m mempalace.learning_loop
This shows a real feedback signal, not just output generation.
- Oscillates, not fully learning yet
- adaptive policy
- memory-driven decisions
- multi-agent dynamics
A tiny system that acts, judges itself, and evolves.