-
Notifications
You must be signed in to change notification settings - Fork 13k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#16018
opened Sep 15, 2025 by
reeselevine
Loading…
Deterministic inference mode (CUDA): RMSNorm, MatMul, Attention, KV-cache
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
script
Script related
testing
Everything test related
Add Olmo3 implementation
python
python script changes
#16015
opened Sep 15, 2025 by
2015aroras
Loading…
Guard ThreadPowerThrottling for non-MSVC builds
ggml
changes relating to the ggml tensor library for machine learning
#16014
opened Sep 15, 2025 by
B1rds3y
Loading…
Add ROUND operator support for CPU and SYCL backends
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#16011
opened Sep 15, 2025 by
safranowith
Loading…
ci : upload xcframework artifact from ios-xcode-build job
devops
improvements to build systems and github actions
#16010
opened Sep 15, 2025 by
danbev
Loading…
SYCL: Add GGML_OP_MEAN operator support
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16009
opened Sep 15, 2025 by
yael-works
Loading…
ci : create git tags for released docker images
devops
improvements to build systems and github actions
#16008
opened Sep 15, 2025 by
rgerganov
Loading…
SYCL/SET: Implement and document full support for SET operator
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16006
opened Sep 15, 2025 by
GittyBurstein
Loading…
Add CEIL operator support for CPU and SYCL backends
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#16005
opened Sep 15, 2025 by
safranowith
Loading…
Add LLaDA-7b-MoE diffusion model
examples
python
python script changes
#16003
opened Sep 15, 2025 by
am17an
Loading…
examples : support encoder-decoder models in the simple example
examples
#16002
opened Sep 15, 2025 by
DamonFool
Loading…
--numa mirror
: mirror model weights to every Numa node in the system
Apple Metal
metal : refactor + optimize v2
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
vulkan : shader development improvements
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15993
opened Sep 14, 2025 by
Acly
Loading…
ggml: add FLOOR unary op (CPU + SYCL)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#15989
opened Sep 14, 2025 by
safranowith
Loading…
metal : use virtual GPU address for private buffers
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
CUDA: fix FA occupancy, optimize tile kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15982
opened Sep 14, 2025 by
JohannesGaessler
Loading…
SYCL: Add ARANGE operator with GPU kernel, tests, and documentation
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#15978
opened Sep 14, 2025 by
GittyBurstein
Loading…
vulkan: automatically remove unsupported devices
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15976
opened Sep 14, 2025 by
netrunnereve
Loading…
Add resumable downloads for llama-server model loading
#15963
opened Sep 13, 2025 by
ericcurtin
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.