feat: Jina neural reranker + prompt tuning — 57.5% on LoCoMo (dev → main) by CodeNinjaSarthak · Pull Request #63 · CodeNinjaSarthak/eidetic-memory

CodeNinjaSarthak · 2026-04-09T15:00:28Z

Summary

This PR brings the full retrieval improvement pipeline from dev into main,
achieving 57.5% overall accuracy on LoCoMo (n=1540) — up from 46.6% in v2.

What Changed

Jina Neural Reranker

Over-fetch top_k×3 candidates from Qdrant vector search
Rerank via Jina Reranker v2 API before answer generation
Exponential backoff on 429 rate limits, graceful fallback on 403
Adds jina_api_key to Settings, httpx to retrieval dependencies

Prompt Tuning

Category 4 (open-domain) uses unconstrained answer prompt
Recovers open-domain regression: 47.3% → 62.0%

Reproducibility

Set temperature=0 in AzureService.complete() and complete_with_tool()
Improved judge date proximity rule (+0.2pp overall)
Added eval/rejudge_temporal.py for surgical re-judging

Docs

Updated README with SOTA comparison table
Added Mermaid pipeline and retrieval architecture diagrams
Streamlined README for research audience
Added citation block and Apache 2.0 LICENSE

Benchmark Results

Category	RAG Baseline	Pipeline v2	This PR
Single-hop	31.9%	38.7%	38.7%
Temporal	24.9%	57.3%	68.2%
Multi-hop	22.9%	28.1%	38.5%
Open-domain	58.4%	47.3%	62.0%
Overall	44.4%	46.6%	57.5%

Tests

152 passing — make check clean

…rule - Set temperature=0 in AzureService.complete() and complete_with_tool() - Add 7-day proximity rule to judge for relative date gold answers - Re-judge temporal: 57.3% → 57.5% overall, 67.3% → 68.2% temporal - Add eval/rejudge_temporal.py for surgical re-judging

CodeNinjaSarthak added 2 commits April 9, 2026 19:49

docs: fix 57.3→57.5 in blockquote, restore citation section

98fbb6b

CodeNinjaSarthak merged commit 4f8a8e6 into main Apr 9, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Jina neural reranker + prompt tuning — 57.5% on LoCoMo (dev → main)#63

feat: Jina neural reranker + prompt tuning — 57.5% on LoCoMo (dev → main)#63
CodeNinjaSarthak merged 2 commits into
mainfrom
dev

CodeNinjaSarthak commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

CodeNinjaSarthak commented Apr 9, 2026

Summary

What Changed

Jina Neural Reranker

Prompt Tuning

Reproducibility

Docs

Benchmark Results

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant