Releases · CodeLinaro/llama.cpp

23 May 00:11

3079e9a

b5460 Latest

Latest

release : fix windows hip release (#13707)

* release : fix windows hip release

* make single hip release with multiple targets

Assets 18

cudart-llama-bin-win-cuda-11.7-x64.zip

303 MB 2025-05-23T00:11:57Z
cudart-llama-bin-win-cuda-12.4-x64.zip

373 MB 2025-05-23T00:12:10Z
llama-b5460-bin-macos-arm64.zip

10.5 MB 2025-05-23T00:12:25Z
llama-b5460-bin-macos-x64.zip

24.9 MB 2025-05-23T00:12:26Z
llama-b5460-bin-ubuntu-arm64.zip

11.2 MB 2025-05-23T00:12:27Z
llama-b5460-bin-ubuntu-vulkan-x64.zip

19.4 MB 2025-05-23T00:12:28Z
llama-b5460-bin-ubuntu-x64.zip

11.7 MB 2025-05-23T00:12:29Z
llama-b5460-bin-win-cpu-arm64.zip

12 MB 2025-05-23T00:12:30Z
llama-b5460-bin-win-cpu-x64.zip

13.3 MB 2025-05-23T00:12:31Z
llama-b5460-bin-win-cuda-11.7-x64.zip

108 MB 2025-05-23T00:12:32Z
Source code (zip)

2025-05-22T22:21:37Z
Source code (tar.gz)

2025-05-22T22:21:37Z

02 May 06:55

github-actions

b5255

d24d592

b5255

ci: fix cross-compile sync issues (#12804)

Assets 26

10 Apr 21:36

github-actions

b5098

64eda5d

b5098

convert : ability to lazy-load safetensors remotely without downloadi…

Assets 26

01 Apr 22:20

github-actions

b5022

f423981

b5022

opencl : fix memory allocation size (#12649)

issue:
https://github.com/CodeLinaro/llama.cpp/pull/17#issuecomment-2760611283

This patch fixes the memory allocation size
not exceeding the maximum size of the OpenCL device.

Assets 26

27 Mar 04:45

github-actions

b4967

f17a3bb

b4967

SYCL: implement memset ggml backend buffer interface (#12580)

* SYCL: implement memset ggml backend buffer interface

* use GGML_ABORT macro

* Do not wait for all queues to finish for memset operation

Assets 26

24 Mar 17:42

github-actions

b4951

2b65ae3

b4951

opencl: simplify kernel embedding logic in cmakefile (#12503)

Co-authored-by: Max Krasnyansky <[email protected]>

Assets 26

17 Mar 17:20

github-actions

b4903

484a8ab

b4903

vulkan: Add N/2 and N/4 optimized paths in coopmat2 shader (#12312)

Assets 26

15 Feb 00:56

github-actions

b4719

89daa25

b4719

llguidance build fixes for Windows (#11664)

* setup windows linking for llguidance; thanks @phil-scott-78

* add build instructions for windows and update script link

* change VS Community link from DE to EN

* whitespace fix

Assets 24

14 Feb 18:02

github-actions

b4717

94b87f8

b4717

cuda : add ampere to the list of default architectures (#11870)

Assets 24

08 Feb 00:32

github-actions

b4667

d2fe216

b4667

Make logging more verbose (#11714)

Debugged an issue with a user who was on a read-only filesystem.

Signed-off-by: Eric Curtin <[email protected]>

Assets 23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b5460

Uh oh!

b5255

Uh oh!

b5098

Uh oh!

b5022

Uh oh!

b4967

Uh oh!

b4951

Uh oh!

b4903

Uh oh!

b4719

Uh oh!

b4717

Uh oh!

b4667

Uh oh!