Support FP8 accelerate config #39370

djsaunde · 2025-07-11T17:16:47Z

What does this PR do?

Adds config options for mixed precision and fp8 config, which are supported in accelerate's Accelerator object.

Also parses the config field for the torchao ("AO") fp8 backend from dictionary values into the required Float8LinearConfig object.

This requires also a simple gating change to accelerate, which is actually covered by an existing PR: https://github.com/huggingface/accelerate/pull/3677/files#diff-2d7515874eaecac2687c7fc1a9c720be53f802bf14b4c3dcebe14ad443d075dcR501-R505.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

TODO

Add docs (?)
Add tests (?)
Add quick benchmarks

Note

I've tested this downstream in axolotl and find that it works. Performance benefits can be had for certain models when setting torch_compile: true; it's not yet clear to me which models + hyperparameter settings. I'll add some quick numbers here to demonstrate this.

qgallouedec · 2025-07-12T17:54:28Z

src/transformers/trainer_pt_utils.py

@@ -1256,6 +1256,14 @@ class AcceleratorConfig:
            Whether or not to use a pre-configured `AcceleratorState` or `PartialState` defined
            before calling `TrainingArguments`. If `True`, an `Accelerator` or `PartialState`
            must be initialized. May lead to issues using sweeps or hyperparameter tuning.
+        mixed_precision (`str`, *optional*, defaults to `"no"`):


Can clarify how is it different from the args bf16 and fp16?

@qgallouedec is this because in training args, that bf16 and fp16 are implicitly mixed precision, vs float16 and bfloat16 in the args?

It looks like bf16 and fp16 are ultimately used to set ACCELERATE_MIXED_PRECISION, which could more cleanly be passed as mixed_precision, since the AcceleratorConfig exposes this. Digging into the training arguments code, we have:

transformers/src/transformers/training_args.py

Line 1861 in 3d8be20

os.environ["ACCELERATE_MIXED_PRECISION"] = mixed_precision_dtype

.

# if training args is specified, it will override the one specified in the accelerate config if self.half_precision_backend != "apex": mixed_precision_dtype = os.environ.get("ACCELERATE_MIXED_PRECISION", "no") if self.fp16: mixed_precision_dtype = "fp16" elif self.bf16: mixed_precision_dtype = "bf16" os.environ["ACCELERATE_MIXED_PRECISION"] = mixed_precision_dtype

We set the fp16, bf16 from the training args, or from the ACCELERATE_MIXED_PRECISION if set, in that priority order. But, we could just use / pass the mixed_precision config as exposed by accelerate instead. fp8 is not set by the existing logic.

SunMarc

We have this old PR that cleans a bunch of things, let me know if this will clean a bunch of things for you. If so, I can try to take over this PR myself and first merge this PR

SunMarc · 2025-07-16T14:22:58Z

src/transformers/trainer.py

+        if "fp8_config" in accelerator_config and accelerator_config["fp8_config"] is not None:
+            if "backend" in accelerator_config["fp8_config"]:
+                recipe_kwargs = MP_BACKEND_TO_KWARGS[accelerator_config["fp8_config"]["backend"]]
+                fp8_config = accelerator_config["fp8_config"].copy()
+
+                if fp8_config["backend"] == "AO":
+                    from torchao.float8 import Float8LinearConfig
+
+                    if "recipe_name" in fp8_config:
+                        recipe_name = fp8_config["recipe_name"]
+                        fp8_config["config"] = (
+                            Float8LinearConfig.from_recipe_name(recipe_name=recipe_name)
+                        )
+                        fp8_config.pop("recipe_name")
+                    elif "config" in accelerator_config["fp8_config"]:
+                        config = fp8_config["config"]
+                        kwargs = {k: v for k, v in config.items() if v is not None}
+                        fp8_config["config"] = Float8LinearConfig(
+                            **kwargs
+                        )
+
+                fp8_config.pop("backend")
+                kwargs_handlers = [recipe_kwargs(**fp8_config)]
+                args["kwargs_handlers"] = kwargs_handlers


Maybe it will be easier to allow users to pass directly kwargs_handler in accelerate_config ?

djsaunde · 2025-07-16T15:09:31Z

We have this old PR that cleans a bunch of things, let me know if this will clean a bunch of things for you. If so, I can try to take over this PR myself and first merge this PR

That would be great! For the moment we are patching things downstream in axolotl just to get things working. Having this eventually in an upstream release would be nice.

SunMarc · 2025-07-16T16:18:27Z

That would be great! For the moment we are patching things downstream in axolotl just to get things working. Having this eventually in an upstream release would be nice.

I'll spend some time very soon on that PR to clean a bit trainer then ! I'll ping you when this is done to have it reviewed !

SunMarc · 2025-07-17T15:36:41Z

Sorry, I forgot to add the link of the PR I was talking about. Here you go #37259

djsaunde added 6 commits July 10, 2025 21:02

start fp8 support

8eb4020

adding fp8 config to accelerator config

ed14978

debug

b611ab5

debug

1d12052

parse raw torchao config into Float8LinearConfig

0bc40ad

remove debug

3550dab

djsaunde mentioned this pull request Jul 11, 2025

Fix FP8 tests, enable FP8 to be used without direct Accelerator() configuring huggingface/accelerate#3677

Merged

5 tasks

spacing

0ada23c

qgallouedec reviewed Jul 12, 2025

View reviewed changes

SunMarc reviewed Jul 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support FP8 accelerate config #39370

Support FP8 accelerate config #39370

djsaunde commented Jul 11, 2025

Uh oh!

qgallouedec Jul 12, 2025

Uh oh!

winglian Jul 14, 2025

Uh oh!

djsaunde Jul 14, 2025

Uh oh!

SunMarc left a comment

Uh oh!

SunMarc Jul 16, 2025

Uh oh!

djsaunde commented Jul 16, 2025

Uh oh!

SunMarc commented Jul 16, 2025 •

edited

Loading

Uh oh!

SunMarc commented Jul 17, 2025

Uh oh!

Uh oh!

Support FP8 accelerate config #39370

Are you sure you want to change the base?

Support FP8 accelerate config #39370

Conversation

djsaunde commented Jul 11, 2025

What does this PR do?

Before submitting

Who can review?

TODO

Note

Uh oh!

qgallouedec Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

winglian Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

djsaunde Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

SunMarc Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

djsaunde commented Jul 16, 2025

Uh oh!

SunMarc commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SunMarc commented Jul 17, 2025

Uh oh!

Uh oh!

SunMarc commented Jul 16, 2025 •

edited

Loading