Add OpenDataLoader parity coverage and Java-core auto rescue#13
Add OpenDataLoader parity coverage and Java-core auto rescue#13JamesbbBriz wants to merge 283 commits into
Conversation
|
CI note: the current failing |
|
Updated after closing the concrete low-score artifact gap. Latest head: Fresh release full200, Java-core Fresh verification run locally: cargo fmt --manifest-path runtime/doctruth-runtime/Cargo.toml -- --check
cargo test --manifest-path runtime/doctruth-runtime/Cargo.toml --test benchmark_corpus_contract opendataloader_ -- --nocapture
cargo test --manifest-path runtime/doctruth-runtime/Cargo.toml
sh scripts/smoke-doctruth-runtime.sh
sh scripts/smoke-doctruth-runtime-model-worker.sh
sh scripts/smoke-doctruth-runtime-benchmark-corpus.sh
git diff --check
DOCTRUTH_RUNTIME_BUILD_PROFILE=release DOCTRUTH_OPENDATALOADER_PRESET=auto sh scripts/run-opendataloader-java-core-parity.sh --full200CI note remains: |
Summary
preset=autoprediction so readable Java/PDFBox output stays canonical, while sparse output can route through Rust/MNN OCR rescue.low-score-buckets.jsonartifacts from evaluator output so full200 gaps are machine-readable by metric bucket and matrix-aligned behavior family.Full200 Verification
Latest release full200, Java-core
auto+ Rust/MNN rescue:Earlier release full200, deterministic Java-core
litebaseline:Test Plan
CI Note
The GitHub CI
build (25)job is still red onmvn -B -ntp spotless:check checkstyle:check. I reproduced the same 117-file Spotless failure onmainin a temporary worktree, so it is a baseline formatting issue rather than a regression from this PR. I am intentionally not mixing a repo-wide Java formatting sweep into this parser parity PR.Notes
TrustDocumentremains canonical.low-score-buckets.jsonnow separates raw metric buckets from matrix-aligned behavior buckets. Behavior buckets are metric-proxy classifications until the evaluator consumes richer layout tags.