Skip to content

Latest commit

 

History

History
56 lines (47 loc) · 2.2 KB

README.md

File metadata and controls

56 lines (47 loc) · 2.2 KB

@huggingface/ollama-utils

Various utilities for maintaining Ollama compatibility with GGUF models on the Hugging Face Hub.

For now, we are exposing chat template conversion to the Go format used by Ollama.

Chat template converter

import { convertJinjaToGoTemplate } from "@huggingface/ollama-utils";

const MODEL_INFO_URL = "https://huggingface.co/api/models/bartowski/Llama-3.2-3B-Instruct-GGUF?expand[]=gguf";
const modelInfo = await (await fetch(MODEL_INFO_URL)).json();
console.log(modelInfo);
/**
 * {
 *   gguf: {
 *     chat_template: "here is the Jinja chat template",
 *     bos_token: "...",
 *     eos_token: "...",
 *     [...]
 *   }
 * }
 */
const convertedTemplate = convertJinjaToGoTemplate(modelInfo.gguf);
if (convertedTemplate) {
  console.log(convertedTemplate.ollama);
  /**
   * {
   *   template: "this is the converted template, compatible with Ollama",
   *   tokens: [... list of special tokens],
   *   params: {
   *     stop: [... list of stop tokens or stop words]
   *   }
   * }
   */
} else {
  console.error("Conversion is not successful");
}

How can I add a custom template?

Most templates will be converted automatically. You can debug the output template using:

Please only add a new template when the conversion process above is not successful. Cases that are acceptable include:

  • The converted template is wrong
  • The Jinja template is not compatible with @huggingface/jinja
  • The Jinja template is not "linear," meaning it can modify the content of other messages or append dynamic postfixes. For instance, the DeepSeek template removes <think>...</think> from previous messages in a conversation, making it non-linear. Another example is a template that adds the EOS token </s> when add_generation_prompt=False.

To add a new custom handler:

  1. Edit the list of CUSTOM_TEMPLATE_MAPPING inside chat-template.ts
  2. Add a new test case in chat-template.spec.ts
  3. Push your change to a new PR.