SKaiNET Ground Truth already defines PyTorch as the reference and stores generated validation artifacts in GGUF. IREE provides both runtime tooling and test patterns for end-to-end execution. This milestone closes the loop: export from SKaiNET, compile with IREE, run on CPU, compare results against PyTorch-derived references, and block merges when numeric tolerances fail. That gives NLnet a milestone with an objective pass/fail gate instead of a documentation claim. ([GitHub][6])
Background links:
- SKaiNET Ground Truth README. ([GitHub][6])
- SKaiNET core and transformer scope for example models. ([GitHub][1])
- IREE
iree-run-module, Python packages, console scripts, and e2e testing. ([IREE][7])
Acceptance criteria:
SKaiNET Ground Truth already defines PyTorch as the reference and stores generated validation artifacts in GGUF. IREE provides both runtime tooling and test patterns for end-to-end execution. This milestone closes the loop: export from SKaiNET, compile with IREE, run on CPU, compare results against PyTorch-derived references, and block merges when numeric tolerances fail. That gives NLnet a milestone with an objective pass/fail gate instead of a documentation claim. ([GitHub][6])
Background links:
iree-run-module, Python packages, console scripts, and e2e testing. ([IREE][7])Acceptance criteria: