Releases · CodeLinaro/llama.cpp

29 Sep 18:08

faac0ba

b3841

common : ensure llama_batch size does not exceed max size (#9668)

A crash was observed when the number of tokens added to a batch exceeds
llama_batch size. An assertion in llama_batch_add was added to protect
against llama_batch size overflow.

Assets 22

26 Sep 17:06

github-actions

b3828

95bc82f

b3828

[SYCL] add missed dll file in package (#9577)

* update oneapi to 2024.2

* use 2024.1

---------

Co-authored-by: arthw <[email protected]>

Assets 22

25 Sep 17:08

github-actions

b3826

ea9c32b

b3826

ci : fix docker build number and tag name (#9638)

* ci : fix docker build number and tag name

* fine-grant permissions

Assets 22

24 Sep 18:38

github-actions

b3821

70392f1

b3821

ggml : add AVX512DQ requirement for AVX512 builds (#9622)

Assets 22

24 Sep 05:44

github-actions

b3814

c087b6f

b3814

threads: fix msvc build without openmp (#9615)

We're missing atomic_thread_fence() in MSVC builds when openmp is disabled.

Assets 22

23 Sep 17:34

github-actions

b3810

1d48e98

b3810

readme : add programmable prompt engine language CLI (#9599)

Assets 22

23 Sep 06:25

github-actions

b3805

e62e978

b3805

Revert "[SYCL] fallback mmvq (#9088)" (#9579)

This reverts commit 50addec9a532a6518146ab837a85504850627316.

Assets 22

21 Sep 17:43

github-actions

b3799

d09770c

b3799

ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG …

Assets 22

21 Sep 05:37

github-actions

b3798

41f4778

b3798

Update CUDA graph on scale change plus clear nodes/params  (#9550)

* Avoid using saved CUDA graph if scale changes and reset nodes/params on update

Fixes https://github.com/ggerganov/llama.cpp/issues/9451

* clear before resize

Assets 22

20 Sep 22:42

github-actions

b3796

6335114

b3796

quantize : improve type name parsing (#9570)

quantize : do not ignore invalid types in arg parsing

quantize : ignore case of type and ftype arguments

Assets 19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b3841

Uh oh!

b3828

Uh oh!

b3826

Uh oh!

b3821

Uh oh!

b3814

Uh oh!

b3810

Uh oh!

b3805

Uh oh!

b3799

Uh oh!

b3798

Uh oh!

b3796

Uh oh!