Support loading from `model.safetensors.index.json` #6

findmyway · 2025-03-19T10:08:59Z

No description provided.

pxl-th

LGTM! Would be good to add a tests if possible.

ToucheSir · 2025-03-19T19:09:36Z

I noticed that this index file is not covered anywhere in https://huggingface.co/docs/safetensors/index or the safetensors repo. Is it a huggingface-specific thing? Is there a documentation link we can point users to for it?

findmyway · 2025-03-20T03:03:11Z

I noticed that this index file is not covered anywhere in https://huggingface.co/docs/safetensors/index or the safetensors repo. Is it a huggingface-specific thing? Is there a documentation link we can point users to for it?

See https://huggingface.co/docs/huggingface_hub/package_reference/hf_api#huggingface_hub.utils.SafetensorsRepoMetadata

findmyway · 2025-03-20T05:29:47Z

LGTM! Would be good to add a tests if possible.

Added

chengchingwen

The index.json is defined by huggingface in their python binding for sharding the model weights and is not part of the safetensors format/spec. This should not be in this package (or at least, it should be named as something like load_shard_safetensors instead of modifying load_safetensors.

findmyway · 2025-03-21T07:14:53Z

Hi @chengchingwen ,

I agree this feature is not part of the spec. The reason is that in real world cases, different packages may have different approaches to handle the shards (like loading them distributedly or GC during loading). However, I think the modifications I made here provide a nice-to-have fallback (by loading them all in memory).

Regarding the naming issue, reusing the load_safetensors won't create any breaking change. But I'd be happy to change if you insist.

chengchingwen · 2025-03-21T08:38:32Z

The reason is that in real world cases, different packages may have different approaches to handle the shards

It's also part of the reason that I think it should not be in this package, but I agree it would be convenient to have and is a reasonable default for loading sharded weights. Personally, the index.json is kind of unsafe regarding the original intention of safetensors. Meanwhile, it doesn't have a stable format/spec which make things break easily and inconspicuously if any changes have been made to their python code. Adding new function also won't be a breaking change and could provide a clear separation between spec and custom behavior, so I would insist the name change if we are going to merge it.

chengchingwen · 2025-03-21T08:44:08Z

Also, some unneeded files should be removed. e.g. test/README.md

src/SafeTensors.jl

Support model.safetensors.index.json

94850a2

pxl-th approved these changes Mar 19, 2025

View reviewed changes

findmyway marked this pull request as draft March 20, 2025 03:44

findmyway added 3 commits March 20, 2025 05:03

add tests for sharded checkpoints

e6744b2

update compat of ProgressMeter

719bd76

fix typo

dbf0b07

findmyway marked this pull request as ready for review March 20, 2025 05:29

bump version

8851d94

chengchingwen requested changes Mar 21, 2025

View reviewed changes

separate load_sharded_safetensors from load_safetensors

885636c

findmyway requested a review from chengchingwen March 21, 2025 09:22

chengchingwen reviewed Mar 21, 2025

View reviewed changes

src/SafeTensors.jl Outdated Show resolved Hide resolved

add docs

7cea088

findmyway requested a review from chengchingwen March 21, 2025 13:39

chengchingwen approved these changes Mar 21, 2025

View reviewed changes

chengchingwen merged commit a720c55 into FluxML:main Mar 21, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support loading from `model.safetensors.index.json` #6

Support loading from `model.safetensors.index.json` #6

findmyway commented Mar 19, 2025

pxl-th left a comment

ToucheSir commented Mar 19, 2025

findmyway commented Mar 20, 2025

findmyway commented Mar 20, 2025

chengchingwen left a comment

findmyway commented Mar 21, 2025

chengchingwen commented Mar 21, 2025

chengchingwen commented Mar 21, 2025

Support loading from model.safetensors.index.json #6

Support loading from model.safetensors.index.json #6

Conversation

findmyway commented Mar 19, 2025

pxl-th left a comment

Choose a reason for hiding this comment

ToucheSir commented Mar 19, 2025

findmyway commented Mar 20, 2025

findmyway commented Mar 20, 2025

chengchingwen left a comment

Choose a reason for hiding this comment

findmyway commented Mar 21, 2025

chengchingwen commented Mar 21, 2025

chengchingwen commented Mar 21, 2025

Support loading from `model.safetensors.index.json` #6

Support loading from `model.safetensors.index.json` #6