Safetensors conversion #2290

Bond099 · 2025-06-06T09:11:57Z

Description of the change

Reference

Colab Notebook

https://colab.research.google.com/drive/1naqf0sO2J40skndWbVMeQismjL7MuEjd?usp=sharing

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and works with all backends (TensorFlow, JAX, and PyTorch).
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have followed the Keras Hub Model contribution guidelines in making these changes.
I have followed the Keras Hub API design guidelines in making these changes.
I have signed the Contributor License Agreement.

abheesht17 · 2025-06-06T16:27:27Z

Thanks for the PR, will take a look in a bit :)

mattdangerw

Thanks! Just left some initial comments.

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

mattdangerw

Let's add a unit test that calls this util and tries loading the result with transformers and seeing if it works. OK to add transformers to our ci environment here https://github.com/keras-team/keras-hub/blob/master/requirements-common.txt

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

… into safetensors_conversion merge updated branch

mattdangerw

Nice! Please address the changes from the earlier PR as well

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

abheesht17

Thanks, nice work!

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

abheesht17 · 2025-06-19T15:57:14Z

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

+    return hf_config
+
+
+def export_to_hf(keras_model, path):


We should add the API export decorator here, similar to this: https://github.com/keras-team/keras-hub/blob/master/keras_hub/src/models/bloom/bloom_backbone.py#L15-L16

Also, do you think we should refactor some of the common code across models to a separate file? We can then expose that as the API.

So, this is how the directory keras_hub/src/utils/transformers/convert_to_safetensor/ will look like:

export.py: this will have the common code. We will expose this as the API. This will also check if we support safetensor conversion for a given passed model yet.

gemma.py: this will just have a way to create the weight dictionary for Gemma. Inside export.py, we will call the the weight conversion function specific to a specified model.

Pinging @mattdangerw to confirm if we should do this now or at a later point.

I think we could land and do the API bit a later point. Though agree it's an important concern. I'm not sure if we want a method like model.save_to_preset() or a function like some_export(model). Any thoughts?

I think structuring the export logic with a utility function (export_to_hf) and model-specific mappings (gemma.py) will enhance scalability and maintainability. New models can be added by creating a new file, while existing tests only need an import update.

+1 to Abheesht's comment we need an API instead of a script for Gemma, we already have that
https://github.com/keras-team/keras-hub/blob/master/tools/gemma/export_gemma_to_hf.py

… into safetensors_conversion

abheesht17

Leaving comments since I don't see the changes we discussed last week.

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py

abheesht17

Okay, reviewed. Let's fix the tests!

abheesht17 · 2025-07-10T18:45:28Z

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py

+    GemmaCausalLMPreprocessor,
+)
+from keras_hub.src.models.gemma.gemma_tokenizer import (
+    GemmaTokenizer as KerasGemmaTokenizer,


Why not just GemmaTokenizer?

HF also imports the tokenizer as GemmaTokenizer… so just used KerasGemmaTokenizer to avoid confusion.

Oh, okay. Then it's better to call the HF import as HFGemmaTokenizer and the Keras one as GemmaTokenizer, I suppose

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py

divyashreepathihalli · 2025-07-10T23:36:05Z

/gemini review

gemini-code-assist

Code Review

The code changes introduce a utility function to export Keras Gemma models to Hugging Face format, saving the configuration, weights, and tokenizer assets. The review focuses on improving the robustness of weight mapping, adding checks for empty weight dictionaries, and enhancing the warning message for missing vocabulary files.

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

divyashreepathihalli · 2025-07-11T00:16:17Z

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py

@@ -0,0 +1,161 @@
+import json


lets make this a model-agnostic export utility. Rename file to safetensor_exporter.py
add a dict to maintain the mapping

MODEL_EXPORTERS = { "GemmaBackbone": gemma_exporter.get_gemma_weights_map, "LlamaBackbone": llama_exporter.get_llama_weights_map, # Future }

and a user facing API function for the export

def export_to_safetensors(keras_model): ...

Implement exporter mapping for each model - for this PR's scope just the Gemma model that can serve as a prototype for other models

Yeah, let's land this PR first and do this in a separate PR: #2290 (comment)

abheesht17

LGTM, awesome work! Made a few cosmetic changes

keras_hub/src/utils/transformers/convert_to_safetensor/export.py

divyashreepathihalli

Lets land the generic API in a different PR! Thanks for the great work!

mattdangerw

Thanks! Just a few changes.

keras_hub/src/utils/transformers/convert_to_safetensor/export.py

keras_hub/src/utils/transformers/convert_to_safetensor/gemma_test.py

mattdangerw

Actually let me switch to approval so I won't block merging this. But let's address these comment before merge!

mattdangerw · 2025-07-16T03:36:08Z

ty!

Bond099 added 2 commits June 6, 2025 14:33

Safetensors conversion

903733b

Reformatted

9f99030

mattdangerw reviewed Jun 6, 2025

View reviewed changes

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

abheesht17 reviewed Jun 6, 2025

View reviewed changes

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

corrected and formatted into a util file

c896fdb

mattdangerw reviewed Jun 13, 2025

View reviewed changes

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

keras_hub/src/utils/transformers/export_gemma_to_safetensor.py Outdated Show resolved Hide resolved

Bond099 added 6 commits June 13, 2025 15:58

test cases wip

219bf37

Merge branch 'keras-team:master' into safetensors_conversion

b5cf25c

Merge branch 'safetensors_conversion' of github.com:Bond099/keras-hub…

6eaa954

… into safetensors_conversion merge updated branch

unit tests for safetensors conversion

2cbedc4

rename vocab.spm

bbb2042

reformatted

df2951a

mattdangerw added the kokoro:force-run Runs Tests on GPU label Jun 19, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jun 19, 2025

mattdangerw requested changes Jun 19, 2025

View reviewed changes

abheesht17 reviewed Jun 19, 2025

View reviewed changes

Bond099 added 4 commits June 20, 2025 19:38

Merge branch 'keras-team:master' into safetensors_conversion

aa5f7e0

address comments

ab27a73

Merge branch 'safetensors_conversion' of github.com:Bond099/keras-hub…

cda19d3

… into safetensors_conversion

minor changes

f31ad26

abheesht17 added the kokoro:force-run Runs Tests on GPU label Jun 24, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jun 24, 2025

mattdangerw mentioned this pull request Jun 26, 2025

Add save hf weights #2312

Closed

Bond099 added 3 commits July 1, 2025 18:56

Merge branch 'keras-team:master' into safetensors_conversion

a9253c0

backend agnostic

4045ce6

Merge branch 'safetensors_conversion' of github.com:Bond099/keras-hub…

bbc05a6

… into safetensors_conversion

abheesht17 added the kokoro:force-run Runs Tests on GPU label Jul 2, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jul 2, 2025

abheesht17 requested changes Jul 2, 2025

View reviewed changes

address comments

0c46606

abheesht17 reviewed Jul 10, 2025

View reviewed changes

keras_hub/src/utils/transformers/export_gemma_to_safetensors_test.py Outdated Show resolved Hide resolved

abheesht17 reviewed Jul 10, 2025

View reviewed changes

gemini-code-assist bot reviewed Jul 10, 2025

View reviewed changes

divyashreepathihalli reviewed Jul 11, 2025

View reviewed changes

Bond099 added 2 commits July 12, 2025 02:52

convert_to_safetensor

c591697

Compatible with all backends

1c57291

abheesht17 added the kokoro:force-run Runs Tests on GPU label Jul 12, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jul 12, 2025

abheesht17 added 2 commits July 12, 2025 09:34

Cosmetic changes

37bf4c3

Cosmetic changes (1)

9545d02

abheesht17 reviewed Jul 12, 2025

View reviewed changes

keras_hub/src/utils/transformers/convert_to_safetensor/export.py Outdated Show resolved Hide resolved

keras_hub/src/utils/transformers/convert_to_safetensor/export.py Outdated Show resolved Hide resolved

abheesht17 approved these changes Jul 12, 2025

View reviewed changes

abheesht17 added the kokoro:force-run Runs Tests on GPU label Jul 12, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jul 12, 2025

divyashreepathihalli approved these changes Jul 12, 2025

View reviewed changes

divyashreepathihalli requested a review from mattdangerw July 12, 2025 04:35

abheesht17 added 2 commits July 12, 2025 10:21

Cosmetic changes (2)

465541b

Cosmetic changes (3)

26526e2

mattdangerw reviewed Jul 14, 2025

View reviewed changes

mattdangerw approved these changes Jul 14, 2025

View reviewed changes

Address comments

b39e3ec

mattdangerw merged commit 9989fda into keras-team:master Jul 16, 2025
7 checks passed

Safetensors conversion #2290

Safetensors conversion #2290

Uh oh!

Conversation

Bond099 commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Reference

Colab Notebook

Checklist

Uh oh!

abheesht17 commented Jun 6, 2025

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abheesht17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abheesht17 Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abheesht17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abheesht17 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abheesht17 Jul 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

divyashreepathihalli commented Jul 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Bond099 commented Jun 6, 2025 •

edited

Loading

abheesht17 Jun 19, 2025 •

edited

Loading

abheesht17 Jul 12, 2025 •

edited

Loading

divyashreepathihalli left a comment •

edited

Loading