conversion.py and patch for T5 text encoder #215

magekinnarus · 2025-02-19T15:28:51Z

Hi, I have been testing different T5 text encoder variants. And I would very much like to convert and quantize to llama.cpp gguf files. After some trial and error, it seems that you added T5 text encoder architecture to llama.cpp just as you have done for the image AIs. Since the new T5 text encoder still needs to be loaded using your nodes, I think it will be easier to just ask for the related scripts to convert and quantize T5 text encoder if you are willing to share the scripts. Thank you in advance and I look forward to hearing from you.

city96 · 2025-02-20T00:08:18Z

Hi, T5 models are supported by native llama.cpp so the only support in this repo is for the loader logic. The conversion for that was done with vanilla llama.cpp.

I believe I did T5EncoderModel.from_pretrained on the original weights, then save_pretrained to save it to a folder as safetensors. From there, you can use the llama.cpp default conver_hf_to_gguf script which should give you a valid file that the default llama-quantize binary can handle.

magekinnarus · 2025-02-20T06:29:50Z

Ah, I see. In my case, I ported the pile T5 layers to the T5 encoder and merged the two using SVD. It was an experiment and didn't expect it to work so well without fine-tuning since Pile T5 is trained on different datasets and uses a different tokenizer. However, it turned out to unlock Flux a bit as the filtering shield of T5 was removed.
I leveraged your convert.py to convert clip_g by adding the architecture (https://huggingface.co/Old-Fisherman/SDXL_Finetune_GGUF_Files/resolve/main/convert_g.py?download=true). But I have no idea how to patch it to make it work with llama-quantize. If I use the same method to convert T5 to F16 gguf, could you give me some guidance on how to llama-quantize?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conversion.py and patch for T5 text encoder #215

conversion.py and patch for T5 text encoder #215

magekinnarus commented Feb 19, 2025

city96 commented Feb 20, 2025

magekinnarus commented Feb 20, 2025

conversion.py and patch for T5 text encoder #215

conversion.py and patch for T5 text encoder #215

Comments

magekinnarus commented Feb 19, 2025

city96 commented Feb 20, 2025

magekinnarus commented Feb 20, 2025