-
Notifications
You must be signed in to change notification settings - Fork 25
Pull requests: EfficientMoE/MoE-Infinity
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: expert I/O microbenchmark suite with NVTX instrumentation
#89
opened Apr 4, 2026 by
drunkcoding
Loading…
feat(benchmarks): add Dockerized comparison benchmark suite (vLLM, llama.cpp, gpt-oss-20b)
#88
opened Apr 3, 2026 by
drunkcoding
Loading…
feat(quant): add GPTQ/AWQ quantized checkpoint support (fixes #70)
#82
opened Apr 1, 2026 by
drunkcoding
Loading…
ProTip!
Follow long discussions with comments:>50.