Enable Checkpoint Conversion from Huggingface to Maxtext #1839

YixuanWang-99 · 2025-06-16T18:40:38Z

Description

Enable checkpoint conversion from Huggingface to Maxtext.

Add to_maxtext.py to perform the checkpoint conversion from HF to MaxText.
Add convert_gemma2_to_mt.sh to automate the conversion and verification.
Add mt_hf_mutual_conversion_check.py to compare the Huggingface and MaxText checkpoints.
Official Gemma2 models are supported.

Tests

The converted checkpoint is tested with mt_hf_mutual_conversion_check.py. It compared:

For given prompts, top-k predicted tokens and scores for the next token;
KL divergence of the full logit distributions

Tested on Gemma2-2b Model. A successful conversion example.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

hengtaoguo

Excellent work!

MaxText/tests/mt_hf_mutual_conversion_check.py

hengtaoguo · 2025-06-20T17:59:55Z

MaxText/utils/ckpt_conversion/to_maxtext.py

This is nice, thanks for adding such a feature!

MaxText/utils/ckpt_conversion/to_maxtext.py

hengtaoguo · 2025-06-20T18:35:53Z

Hi @gagika ! I've heard this might be interesting to you for loading/saving HF checkpoints. Would you like to take a look when you got a chance? Thanks a lot for your time!

shralex

Thanks Yixuan! Added a few comments

MaxText/utils/ckpt_conversion/to_maxtext.py

MaxText/tests/mt_hf_mutual_conversion_check.py

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh

MaxText/utils/ckpt_conversion/to_maxtext.py

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh

shralex

Thanks for addressing the comments! I have 1 small comment and also a question -- did you test both directions -- to and from HF ? if so can you add both to the PR description testing section, currently it includes 1 example. Thanks!

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh

shralex · 2025-06-24T15:50:04Z

MaxText/tests/mt_hf_mutual_conversion_check.py

+limitations under the License.
+"""
+
+"""


Could you please clarify what's the difference between this file and forward_pass_logits_checker ? this file is used in all our end-to-end tests to verify logits. If there is something missing in that file, I wonder if we could change it rather than creating a new file ?

YixuanWang-99 · 2025-06-24T17:49:31Z

Thanks for addressing the comments! I have 1 small comment and also a question -- did you test both directions -- to and from HF ? if so can you add both to the PR description testing section, currently it includes 1 example. Thanks!

from HF conversion with examples is pushed in previous PR: #1785 and #1821. And I have revised the run name.

Enable conversion from huggingface to maxtext

74b2608

YixuanWang-99 changed the title ~~Enable conversion from Huggingface to Maxtext~~ Enable Checkpoint Conversion from Huggingface to Maxtext Jun 16, 2025

YixuanWang-99 added 3 commits June 16, 2025 19:56

convert shell script added

5271ef7

add mt_hf_conversion check and example shell

adec18d

refine arguments of mt_hf_check

3c431ca

hengtaoguo approved these changes Jun 20, 2025

View reviewed changes

hengtaoguo marked this pull request as ready for review June 20, 2025 18:43

hengtaoguo requested review from gobbleturk, khatwanimohit, bvandermoon, vipannalla, RissyRan, richjames0, gagika, shralex, yangyuwei, SurbhiJainUSC, A9isha and aireenmei as code owners June 20, 2025 18:43

Minor fix of model_id check and remove unused comments

289f62e

shralex reviewed Jun 21, 2025

View reviewed changes

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh Outdated Show resolved Hide resolved

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh Outdated Show resolved Hide resolved

shralex reviewed Jun 22, 2025

View reviewed changes

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh Show resolved Hide resolved

MaxText/utils/ckpt_conversion/to_maxtext.py Show resolved Hide resolved

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh Outdated Show resolved Hide resolved

shralex reviewed Jun 23, 2025

View reviewed changes

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh Outdated Show resolved Hide resolved

add decriptions of scripts and minor fix of naming

4f086f7

shralex reviewed Jun 24, 2025

View reviewed changes

MaxText/utils/ckpt_conversion/examples/convert_gemma2_to_mt.sh Outdated Show resolved Hide resolved

shralex reviewed Jun 24, 2025

View reviewed changes

revised run name

8996b41

fix comment

6099bda

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable Checkpoint Conversion from Huggingface to Maxtext #1839

Enable Checkpoint Conversion from Huggingface to Maxtext #1839

Uh oh!

YixuanWang-99 commented Jun 16, 2025 •

edited

Loading

Uh oh!

hengtaoguo left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hengtaoguo Jun 20, 2025

Uh oh!

Uh oh!

Uh oh!

hengtaoguo commented Jun 20, 2025

Uh oh!

shralex left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shralex left a comment

Uh oh!

Uh oh!

shralex Jun 24, 2025

Uh oh!

YixuanWang-99 commented Jun 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Enable Checkpoint Conversion from Huggingface to Maxtext #1839

Are you sure you want to change the base?

Enable Checkpoint Conversion from Huggingface to Maxtext #1839

Uh oh!

Conversation

YixuanWang-99 commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

hengtaoguo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hengtaoguo Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hengtaoguo commented Jun 20, 2025

Uh oh!

shralex left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shralex left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shralex Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

YixuanWang-99 commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

YixuanWang-99 commented Jun 16, 2025 •

edited

Loading

YixuanWang-99 commented Jun 24, 2025 •

edited

Loading