Releases · CodeLinaro/llama.cpp

15 Jul 15:09

8fac431

b3398

ggml : suppress unknown pragma 'GCC' on windows (#8460)

This commit adds a macro guard to pragma GCC to avoid the following
warning on windows:

```console
C:\llama.cpp\ggml\src\ggml-aarch64.c(17,9): warning C4068:
unknown pragma 'GCC' [C:\lama.cpp\build\ggml\src\ggml.vcxproj]
```

Assets 20

11 Jul 15:57

github-actions

b3373

808aba3

b3373

CUDA: optimize and refactor MMQ (#8416)

* CUDA: optimize and refactor MMQ

* explicit q8_1 memory layouts, add documentation

Assets 20

10 Jul 18:44

github-actions

b3368

dd07a12

b3368

Name Migration: Build the deprecation-warning 'main' binary every tim…

Assets 20

28 Jun 14:46

github-actions

b3263

26a39bb

b3263

Add MiniCPM, Deepseek V2 chat template + clean up `llama_chat_apply_t…

Assets 20

25 Jun 16:02

github-actions

b3224

c8ad359

b3224

Gguf dump start data offset via --data-offset and some extra refactor…

Assets 20

13 Jun 22:06

github-actions

b3145

172c825

b3145

rpc : fix ggml_backend_rpc_supports_buft() (#7918)

Assets 20

04 Jun 20:36

github-actions

b3087

1442677

b3087

common : refactor cli arg parsing (#7675)

* common : gpt_params_parse do not print usage

* common : rework usage print (wip)

* common : valign

* common : rework print_usage

* infill : remove cfg support

* common : reorder args

* server : deduplicate parameters

ggml-ci

* common : add missing header

ggml-ci

* common : remote --random-prompt usages

ggml-ci

* examples : migrate to gpt_params

ggml-ci

* batched-bench : migrate to gpt_params

* retrieval : migrate to gpt_params

* common : change defaults for escape and n_ctx

* common : remove chatml and instruct params

ggml-ci

* common : passkey use gpt_params

Assets 20

31 May 17:25

github-actions

b3060

0515ad9

b3060

convert-hf : Handle NotImplementedError in convert-hf-to-gguf (#7660)

Assets 21

28 May 23:49

github-actions

b3029

b864b50

b3029

[SYCL] Align GEMM dispatch (#7566)

* align GEMM dispatch

Assets 21

28 May 00:47

github-actions

b3014

852aafb

b3014

update HIP_UMA #7399 (#7414)

* update HIP_UMA #7399

add use of hipMemAdviseSetCoarseGrain when LLAMA_HIP_UMA is enable.
- get x2 on prompte eval and x1.5 on token gen with rocm6.0 on ryzen 7940HX iGPU (780M/gfx1103)

* simplify code, more consistent style

---------

Co-authored-by: slaren <[email protected]>

Assets 21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b3398

Uh oh!

b3373

Uh oh!

b3368

Uh oh!

b3263

Uh oh!

b3224

Uh oh!

b3145

Uh oh!

b3087

Uh oh!

b3060

Uh oh!

b3029

Uh oh!

b3014

Uh oh!