Releases · CodeLinaro/llama.cpp

20 Sep 22:42

6335114

b3796

quantize : improve type name parsing (#9570)

quantize : do not ignore invalid types in arg parsing

quantize : ignore case of type and ftype arguments

Assets 19

20 Sep 19:21

github-actions

b3795

d13edb1

b3795

ggml : fix builds (#0)

ggml-ci

Assets 19

20 Sep 18:46

github-actions

b3790

5cb12f6

b3790

CUDA: fix sum.cu compilation for CUDA < 11.7 (#9562)

Assets 19

20 Sep 00:32

github-actions

b3787

6026da5

b3787

server : clean-up completed tasks from waiting list (#9531)

ggml-ci

Assets 19

18 Sep 18:22

github-actions

b3785

64c6af3

b3785

ggml : fix n_threads_cur initialization with one thread (#9538)

* ggml : fix n_threads_cur initialization with one thread

* Update ggml/src/ggml.c

---------

Co-authored-by: Max Krasnyansky <[email protected]>

Assets 19

16 Sep 18:46

github-actions

b3772

23e0d70

b3772

ggml : move common CPU backend impl to new header (#9509)

Assets 19

13 Sep 05:18

github-actions

b3749

bd35cb0

b3749

feat: remove a sampler from a chain (#9445)

* feat: remove a sampler from a chain

* fix: return removed sampler

* fix: safer casting

Assets 19

11 Sep 18:05

github-actions

b3733

1b28061

b3733

llama : skip token bounds check when evaluating embeddings (#9437)

Assets 19

09 Sep 16:30

github-actions

b3713

5fb5e24

b3713

llama : minor sampling refactor (2) (#9386)

Assets 19

30 Aug 14:13

github-actions

b3646

cddae48

b3646

Correct typo run_llama2.sh > run-llama2.sh (#9149)

Assets 19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b3796

Uh oh!

b3795

Uh oh!

b3790

Uh oh!

b3787

Uh oh!

b3785

Uh oh!

b3772

Uh oh!

b3749

Uh oh!

b3733

Uh oh!

b3713

Uh oh!

b3646

Uh oh!