Attempt to add the `mllama` support #11639

q82419 · 2025-02-04T01:44:13Z

Motivation

This PR attempts to add the mllama support from the Ollama github into examples of this repository.

All code changes are mainly from the llama patch, operator patch, and mllama implement of the ollama repo.

Goals

Mllama implementation (similar to clip in llava)
Model converter of llama-3.2-vision to mllama
Full mllama example and document (such as the example of llava)
unpad operation supporting
Mllama model build and load in llama.cpp

Current Status

There are still some issues for this implementation.

Model converter. The example model and projection are not on the huggingface.

Currently I use the ollama application to fetch the converted model for testing.
The n_vocab (n_tokens loaded from model) is mismatch with the tensor dimension.

The n_tokens is 128257, the dimension of LLM_TENSOR_OUTPUT for example is 128256. It seems like something wrong in the converted model.
As mentioned in 2., some assertion will fail when executing the mllama models.

ggml_backend_tensor_get_async and ggml_backend_tensor_get will fail in the tensor-read-out-of-bound checking.

Signed-off-by: YiYing He <[email protected]>

danbev · 2025-02-04T05:17:00Z

Thank you for the PR!

There is currently work in progress to introduce a new vision api, and along side this work there has been work on supporting mllama (Llama 3.2 Vision Instruct). Regarding the vocab issue we've had a disussion about this matter which might be of interest.

q82419 added 4 commits February 4, 2025 09:32

llama: apply the mllama support patch

45a89e0

Signed-off-by: YiYing He <[email protected]>

ggml: apply the unpad operator patch

8bb33d3

Signed-off-by: YiYing He <[email protected]>

examples: add mllama implementation

88c513f

Signed-off-by: YiYing He <[email protected]>

wip: fix mllama error

c0a71b1

Signed-off-by: YiYing He <[email protected]>

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs examples ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Feb 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempt to add the `mllama` support #11639

Attempt to add the `mllama` support #11639

q82419 commented Feb 4, 2025

danbev commented Feb 4, 2025

Attempt to add the mllama support #11639

Are you sure you want to change the base?

Attempt to add the mllama support #11639

Conversation

q82419 commented Feb 4, 2025

Motivation

Goals

Current Status

danbev commented Feb 4, 2025

Attempt to add the `mllama` support #11639

Attempt to add the `mllama` support #11639