Releases · CodeLinaro/llama.cpp

25 Nov 21:25

47f931c

b4170

server : enable cache_prompt by default (#10501)

ggml-ci

Assets 21

21 Nov 18:56

github-actions

b4150

a5e4759

b4150

cuda : optimize argmax (#10441)

* cuda : optimize argmax

* remove unused parameter

ggml-ci

* fixup : use full warps

ggml-ci

* Apply suggestions from code review

Co-authored-by: Johannes Gäßler <[email protected]>

* fix ub

* ggml : check ne00 <= INT32_MAX in argmax and argsort

---------

Co-authored-by: Johannes Gäßler <[email protected]>

Assets 21

20 Nov 19:35

github-actions

b4144

f95caa7

b4144

cmake: add link dependencies to cmake find pkg (#10433)

* cmake pkg: find accelerate, openmp, memkind libs

* cmake pkg: find BLAS libs

* try BLAS_LIBRARIES instead

* Add BLAS link opts

* Add more link deps. and set GGML_ vars

Assets 21

19 Nov 00:02

github-actions

b4126

d3481e6

b4126

cuda : only use native when supported by cmake (#10389)

Assets 21

16 Nov 18:29

github-actions

b4100

bcdb7a2

b4100

server: (web UI) Add samplers sequence customization (#10255)

* Samplers sequence: simplified and input field.

* Removed unused function

* Modify and use `settings-modal-short-input`

* rename "name" --> "label"

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

Assets 21

15 Nov 21:13

github-actions

b4092

883d206

b4092

ggml : fix some build issues

Assets 21

07 Nov 17:20

github-actions

b4042

5107e8c

b4042

DRY: Fixes clone functionality (#10192)

Assets 22

25 Oct 19:41

github-actions

b3978

ff252ea

b3978

llama : add DRY sampler (#9702)

* sampling : add DRY sampler (post-refactor)

* DRY: Trying to fix coauthors, removed unneeded line

* DRY: Fixed redundant code

* DRY: Fixed crash issue due to DRY being in chain but uninitialized

---------

Co-authored-by: l3utterfly <[email protected]>
Co-authored-by: pi6am <[email protected]>

Assets 22

17 Oct 03:02

github-actions

b3933

f010b77

b3933

vulkan : add backend registry / device interfaces (#9721)

* vulkan : add backend registry / device interfaces

* llama : print devices used on model load

Assets 22

08 Oct 21:32

github-actions

b3899

dca1d4b

b3899

ggml : fix BLAS with unsupported types (#9775)

* ggml : do not use BLAS with types without to_float

* ggml : return pointer from ggml_internal_get_type_traits to avoid unnecessary copies

* ggml : rename ggml_internal_get_type_traits -> ggml_get_type_traits

it's not really internal if everybody uses it

Assets 22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b4170

Uh oh!

b4150

Uh oh!

b4144

Uh oh!

b4126

Uh oh!

b4100

Uh oh!

b4092

Uh oh!

b4042

Uh oh!

b3978

Uh oh!

b3933

Uh oh!

b3899

Uh oh!