Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Enable shuffled KV cache layout for MiniMax vLLM
#1199 opened Apr 27, 2026 by jiacao-amd Loading…
[AMD/ROCM] Update gptoss-fp4-mi355x-atom config AMD
#1195 opened Apr 27, 2026 by seungrokj Collaborator Loading…
1 task
[AMD/ROCM] Update minimaxm2.5-fp8-mi355x-atom config AMD
#1194 opened Apr 27, 2026 by seungrokj Collaborator Loading…
1 task
Add vLLM DSv4 FP8 MI355X benchmark (vllm#40889) full-sweep-enabled
#1188 opened Apr 26, 2026 by Oseltamivir Collaborator Loading…
3 of 4 tasks
dsv4-b300-sglang: conc=2048 mega_moe deepep recipe
#1179 opened Apr 26, 2026 by yhyang201 Collaborator Loading…
3 tasks
gb300 1k1k sglang sweep-enabled
#1169 opened Apr 26, 2026 by Oseltamivir Collaborator Loading…
4 of 5 tasks
[DON'T MERGE] [NV] dsv4-fp4-gb200-dynamo-vllm sweep-enabled
#1163 opened Apr 26, 2026 by Ankur-singh Collaborator Loading…
Day 0 DeepSeek V4 Pro FP4 GB200 disaggregated SGLang benchmarks sweep-enabled
#1157 opened Apr 25, 2026 by Oseltamivir Collaborator Loading…
4 of 5 tasks
Day 0 GB300 DeepSeek-V4-Pro FP4 vLLM disagg sweep-enabled
#1150 opened Apr 25, 2026 by Oseltamivir Collaborator Loading…
3 tasks
[AMD/ROCM] Qwen3.5-397B-A17B BF16 MI355X Atom benchmarks
#1149 opened Apr 25, 2026 by seungrokj Collaborator Loading…
[NVIDIA] chore: B200 single node DeepSeek v4 SGLang MTP NVIDIA sweep-enabled
#1145 opened Apr 24, 2026 by cquil11 Collaborator Loading…
1 task
[draft] minimax & kimi Amd/vllm disagg mvp dev
#1141 opened Apr 24, 2026 by ichbinblau Collaborator Loading…
[AMD/ROCM] atom qwen3.5 fp4 on mi355x AMD
#1133 opened Apr 24, 2026 by seungrokj Collaborator Loading…
1 task
[AMD/ROCM] atom glm5 fp8 on mi355x AMD
#1126 opened Apr 24, 2026 by seungrokj Collaborator Loading…
2 tasks
[AMD/ROCM] GLM5.1 FP8 MTP Support on MI355X AMD
#1122 opened Apr 23, 2026 by ajith-sirra-amd Contributor Loading…
[WIP] Allow overriding srt-slurm repo/ref at the launcher level
#1118 opened Apr 22, 2026 by Oseltamivir Collaborator Loading…
Add new feature for kimi k2.5 mtp support
#1115 opened Apr 22, 2026 by haic0 Collaborator Draft
Add haic0 patch for AMD kimi k2.5 MTP support
#1108 opened Apr 21, 2026 by haic0 Collaborator Draft
ProTip! Follow long discussions with comments:>50.