-
Notifications
You must be signed in to change notification settings - Fork 149
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AMD/ROCM] Update gptoss-fp4-mi355x-atom config
AMD
#1195
opened Apr 27, 2026 by
seungrokj
Collaborator
Loading…
1 task
[AMD/ROCM] Update minimaxm2.5-fp8-mi355x-atom config
AMD
#1194
opened Apr 27, 2026 by
seungrokj
Collaborator
Loading…
1 task
Add vLLM DSv4 FP8 MI355X benchmark (vllm#40889)
full-sweep-enabled
#1188
opened Apr 26, 2026 by
Oseltamivir
Collaborator
Loading…
3 of 4 tasks
dsv4-b300-sglang: conc=2048 mega_moe deepep recipe
#1179
opened Apr 26, 2026 by
yhyang201
Collaborator
Loading…
3 tasks
gb300 1k1k sglang
sweep-enabled
#1169
opened Apr 26, 2026 by
Oseltamivir
Collaborator
Loading…
4 of 5 tasks
[DON'T MERGE] [NV] dsv4-fp4-gb200-dynamo-vllm
sweep-enabled
#1163
opened Apr 26, 2026 by
Ankur-singh
Collaborator
Loading…
Day 0 DeepSeek V4 Pro FP4 GB200 disaggregated SGLang benchmarks
sweep-enabled
#1157
opened Apr 25, 2026 by
Oseltamivir
Collaborator
Loading…
4 of 5 tasks
Day 0 GB300 DeepSeek-V4-Pro FP4 vLLM disagg
sweep-enabled
#1150
opened Apr 25, 2026 by
Oseltamivir
Collaborator
Loading…
3 tasks
[AMD/ROCM] Qwen3.5-397B-A17B BF16 MI355X Atom benchmarks
#1149
opened Apr 25, 2026 by
seungrokj
Collaborator
Loading…
[NVIDIA] chore: B200 single node DeepSeek v4 SGLang MTP
NVIDIA
sweep-enabled
#1145
opened Apr 24, 2026 by
cquil11
Collaborator
Loading…
1 task
Add H100 config: dsv4-fp8-dynamo-vllm (DeepSeek-V4-Pro multinode disagg)
sweep-enabled
#1142
opened Apr 24, 2026 by
Oseltamivir
Collaborator
Loading…
[draft] minimax & kimi Amd/vllm disagg mvp dev
#1141
opened Apr 24, 2026 by
ichbinblau
Collaborator
Loading…
Add DeepSeek-V4-Pro SGLang aggregated GB200 benchmarks (NVIDIA srt-slurm PR #69)
sweep-enabled
#1137
opened Apr 24, 2026 by
Oseltamivir
Collaborator
Loading…
3 of 5 tasks
[AMD/ROCM] atom qwen3.5 fp4 on mi355x
AMD
#1133
opened Apr 24, 2026 by
seungrokj
Collaborator
Loading…
1 task
[AMD/ROCM] atom glm5 fp8 on mi355x
AMD
#1126
opened Apr 24, 2026 by
seungrokj
Collaborator
Loading…
2 tasks
[AMD/ROCM] GLM5.1 FP8 MTP Support on MI355X
AMD
#1122
opened Apr 23, 2026 by
ajith-sirra-amd
Contributor
Loading…
[WIP] Allow overriding srt-slurm repo/ref at the launcher level
#1118
opened Apr 22, 2026 by
Oseltamivir
Collaborator
Loading…
[AMD/ROCm] Add Kimi-K2.5 FP4 vLLM Eagle3 speculative decoding config for MI355X
#1116
opened Apr 22, 2026 by
chunfangamd
Collaborator
•
Draft
[AMD/Hyperloom] Tune dsr1-fp8-mi355x-sglang: --num-continuous-decode-steps 4 → 8
#1109
opened Apr 21, 2026 by
lishuoshuo-amd
Loading…
4 tasks done
[sglang broken] Add MI355X config: qwen3.5-fp4-sglang-mtp
vllm/sglang release broken -need to wait
#1078
opened Apr 18, 2026 by
functionstackx
Contributor
Loading…
3 of 4 tasks
[vllm broken - waiting for 0.20] Add B300 config: kimi-k2.5-int4-vllm
vllm/sglang release broken -need to wait
#1071
opened Apr 17, 2026 by
cquil11
Collaborator
Loading…
2 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.