Skip to content

Pull requests: LumiOpen/evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

use per-job tmp dirs to avoid cross-job interference
#38 opened Mar 12, 2026 by dzautner Contributor Loading…
Add RULER Long Context Evaluation Support
#33 opened Nov 24, 2025 by luomajouni Loading…
add multiblimp eng and fin evals for testing
#23 opened Sep 17, 2025 by jonabur Contributor Loading…
ProTip! no:milestone will show everything without a milestone.