Skip to content

Releases: CodeLinaro/llama.cpp

b4150

21 Nov 18:56
a5e4759
Compare
Choose a tag to compare
cuda : optimize argmax (#10441)

* cuda : optimize argmax

* remove unused parameter

ggml-ci

* fixup : use full warps

ggml-ci

* Apply suggestions from code review

Co-authored-by: Johannes Gäßler <[email protected]>

* fix ub

* ggml : check ne00 <= INT32_MAX in argmax and argsort

---------

Co-authored-by: Johannes Gäßler <[email protected]>

b4144

20 Nov 19:35
f95caa7
Compare
Choose a tag to compare
cmake: add link dependencies to cmake find pkg (#10433)

* cmake pkg: find accelerate, openmp, memkind libs

* cmake pkg: find BLAS libs

* try BLAS_LIBRARIES instead

* Add BLAS link opts

* Add more link deps. and set GGML_ vars

b4126

19 Nov 00:02
d3481e6
Compare
Choose a tag to compare
cuda : only use native when supported by cmake (#10389)

b4100

16 Nov 18:29
bcdb7a2
Compare
Choose a tag to compare
server: (web UI) Add samplers sequence customization (#10255)

* Samplers sequence: simplified and input field.

* Removed unused function

* Modify and use `settings-modal-short-input`

* rename "name" --> "label"

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

b4092

15 Nov 21:13
Compare
Choose a tag to compare
ggml : fix some build issues

b4042

07 Nov 17:20
5107e8c
Compare
Choose a tag to compare
DRY: Fixes clone functionality (#10192)

b3978

25 Oct 19:41
ff252ea
Compare
Choose a tag to compare
llama : add DRY sampler (#9702)

* sampling : add DRY sampler (post-refactor)

* DRY: Trying to fix coauthors, removed unneeded line

* DRY: Fixed redundant code

* DRY: Fixed crash issue due to DRY being in chain but uninitialized

---------

Co-authored-by: l3utterfly <[email protected]>
Co-authored-by: pi6am <[email protected]>

b3933

17 Oct 03:02
f010b77
Compare
Choose a tag to compare
vulkan : add backend registry / device interfaces (#9721)

* vulkan : add backend registry / device interfaces

* llama : print devices used on model load

b3899

08 Oct 21:32
dca1d4b
Compare
Choose a tag to compare
ggml : fix BLAS with unsupported types (#9775)

* ggml : do not use BLAS with types without to_float

* ggml : return pointer from ggml_internal_get_type_traits to avoid unnecessary copies

* ggml : rename ggml_internal_get_type_traits -> ggml_get_type_traits

it's not really internal if everybody uses it

b3862

02 Oct 05:40
3f1ae2e
Compare
Choose a tag to compare
Update README.md (#9591)

Add Bielik model.