-
Notifications
You must be signed in to change notification settings - Fork 14.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
sampling: reuse token data buffer in llama_sampler_sample
#18365
opened Dec 25, 2025 by
JayZenith
Loading…
cuda: optimize cumsum cub path
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#18362
opened Dec 25, 2025 by
am17an
Loading…
ggml-cuda: fix blackwell native builds
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18361
opened Dec 25, 2025 by
am17an
Loading…
NLLB-600 language translation implementation
model
Model specific
python
python script changes
#18359
opened Dec 25, 2025 by
Acceldium
Loading…
11 of 16 tasks
feat: Add memory factory hook for custom KV cache implementations
#18357
opened Dec 24, 2025 by
rmarnold
Loading…
[WIP] tool-call: experimental migration of all parsers to peg-parser infra (w/ better test coverage)
documentation
Improvements or additions to documentation
examples
python
python script changes
script
Script related
server
testing
Everything test related
vulkan: preprocess mul_mat_id experts and discard workgroups more quickly
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18352
opened Dec 24, 2025 by
jeffbolznv
Loading…
vulkan: optimize decodeFuncB in coopmat2 mul_mat_id shader
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18349
opened Dec 24, 2025 by
jeffbolznv
Loading…
ggml-cpu : add riscv vec dot kernel dispatch based on vlen
ggml
changes relating to the ggml tensor library for machine learning
Work around broken IntelSYCLConfig.cmake in Intel oneAPI 2025.x
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18345
opened Dec 24, 2025 by
rrsathe
Loading…
common/grammar : replace problematic backtracking regex Everything test related
[\s\S]*
testing
#18342
opened Dec 24, 2025 by
aldehir
Loading…
ggml-cuda : fix INT_MAX overflow in cpy kernels (#18140)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18340
opened Dec 24, 2025 by
Muhammad-Kamran-Khan
Loading…
android: routine maintenance - Dec 2025
android
Issues specific to Android
examples
#18338
opened Dec 24, 2025 by
naco-siren
Loading…
[WIP]ggml-hexagon: improve leftover element calc at changes relating to the ggml tensor library for machine learning
vec_dot_f16_f32
ggml
vulkan: Use BK=32 for coopmat2 mul_mat_id
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18332
opened Dec 23, 2025 by
jeffbolznv
Loading…
full modern bert support
python
python script changes
#18330
opened Dec 23, 2025 by
ryan-mangeno
Loading…
vulkan: Support UPSCALE w/antialias
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18327
opened Dec 23, 2025 by
jeffbolznv
Loading…
Add metal count equal op
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#18314
opened Dec 23, 2025 by
gatbontonpc
Loading…
utils: beging using log.h in tokenize.cpp
examples
#18307
opened Dec 22, 2025 by
syedshazli
Loading…
vulkan: handle rope with large number of rows
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18306
opened Dec 22, 2025 by
jeffbolznv
Loading…
Webui/prompt processing progress
examples
server
#18300
opened Dec 22, 2025 by
ServeurpersoCom
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-25.