Skip to content

0.8128 BPB: Classical Compression Eval + N-gram Backoff on PR #549 Base#786

Open
shinegami-2002 wants to merge 2 commits intoopenai:mainfrom
shinegami-2002:submission/classical-compression-eval
Open

0.8128 BPB: Classical Compression Eval + N-gram Backoff on PR #549 Base#786
shinegami-2002 wants to merge 2 commits intoopenai:mainfrom
shinegami-2002:submission/classical-compression-eval

Conversation

@shinegami-2002
Copy link
Copy Markdown

@shinegami-2002 shinegami-2002 commented Mar 26, 2026

Summary

Approach

Eval-time augmentation inspired by classical data compression (cmix/PAQ). Multi-order n-gram backoff with entropy-adaptive alpha, vectorized numpy implementation. All backward-looking, zero artifact cost.

Key Numbers

Eval Method val_bpb
Standard sliding window (stride=64) 1.1218
+ N-gram backoff + entropy-adaptive alpha 0.8128

Test plan

  • Full training run on 8xH100 (600s, 7135 steps)
  • Compressed eval completes within eval budget (383s)
  • Artifact under 16 MB (15.88 MB)
  • 2 more seeds for statistical significance (pending compute grant)
  • Ablation: per-order contribution

Credits

Built on PR #549 base. N-gram technique inspired by PR #727. Classical compression research from cmix/PAQ.

🤖 Generated with Claude Code

shinegami-2002 and others added 2 commits March 25, 2026 20:14
Novel approach bringing cmix/PAQ techniques (n-gram backoff, match model,
APM error correction, logistic mixing) as eval-time augmentation on top
of the PR openai#549 neural model stack. Initial proof of concept on 1xH100
shows compression pipeline working. Pending full 8xH100 run.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Full training run on 8xH100 SXM (7135 steps, 600s wallclock).
Base model: 1.1218 BPB (sliding window). With n-gram backoff
(orders 2-7) + entropy-adaptive alpha: 0.8128 BPB.
Artifact: 15.88 MB. Eval time: 383s.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@shinegami-2002 shinegami-2002 changed the title Classical Compression Eval-Time Augmentation (WIP) 0.8128 BPB: Classical Compression Eval + N-gram Backoff on PR #549 Base Mar 26, 2026
@shinegami-2002 shinegami-2002 marked this pull request as ready for review March 27, 2026 01:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant