XPU backend support 8bit optimizer #1565

Liangliang-Ma · 2025-03-14T07:27:29Z

This pr adds support of 8bit optimizer for XPU backend.
The backend kernels is integrated in Intel_extension_for_pytorch now.
We have verified the whole path accuracy with 8bit Adam blockwise.

Also add device synchronize func for every backend class to avoid cuda hardcode.

@jiqing-feng @matthewdouglas @Titus-von-Koeller

jiqing-feng · 2025-03-26T08:12:38Z

After I verified it on ipex 2.7, we can add XPU tests on test_optim.

matthewdouglas · 2025-03-27T17:21:35Z

Thanks!

Optimizer support isn't addressed yet on the new custom ops interface that we've mainlined, but we can keep dev on it here in this branch until that's ready.

Is there a plan to support any other optimizers? Completely understandable if not; just curious!

github-actions · 2025-03-27T17:22:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

matthewdouglas · 2025-03-27T17:25:40Z

bitsandbytes/backends/xpu.py

+        if out.dtype == torch.float16:
+            ipex.xpu.bitsandbytes.cdequantize_blockwise_fp16(code, A, absmax, out, blocksize, A.numel())
+        elif out.dtype == torch.bfloat16:
+            ipex.xpu.bitsandbytes.cdequantize_blockwise_bf16(code, A, absmax, out, blocksize, A.numel())
+        elif out.dtype == torch.float32:
+            ipex.xpu.bitsandbytes.cdequantize_blockwise_fp32(code, A, absmax, out, blocksize, A.numel())
+        else:
+            raise ValueError(f"Blockwise quantization only supports 16/32-bit floats, but got {out.dtype}")


This will be useful when porting over to the new custom ops as an implementation for bitsandbytes::dequantize_blockwise.out(Tensor A, Tensor absmax, Tensor code, int blocksize, ScalarType dtype, Tensor! out) -> ()

Hard to understand. Could you please supply more details or instructions? Thanks!

@jinq-feng
What I meant by that is in the new interface, we define a custom op for the 8bit dynamic quantization that is used for the optimizers and nested absmax. Since there seems to exist an optimized implementation of this exact op in ipex.xpu now, we can just wrap it during our port.

jiqing-feng · 2025-03-28T01:02:13Z

Thanks!

Optimizer support isn't addressed yet on the new custom ops interface that we've mainlined, but we can keep dev on it here in this branch until that's ready.

Is there a plan to support any other optimizers? Completely understandable if not; just curious!

Currently no plan to enable other optimizers.

Titus-von-Koeller · 2025-04-15T17:19:52Z

The code looks good, thanks for your work on this!

@jiqing-feng @Liangliang-Ma

Please see this short update about the multi-backend refactor #1596.

Regarding the Intel backend, as discussed in parallel with Ke Ding, the target for PRs migrating existing work from multi-backend-refactor instead of main will be the new bitsandbytes-intel repo.

However, some of the pure torch ops and generic cpu functionality still make more sense in the main branch of bitsandbytes, if they don't have the Intel IPEX dependency. Please align with @matthewdouglas and me on those. It's probably best to discuss that in our shared Slack channel.

Titus-von-Koeller · 2025-04-16T14:21:14Z

@Liangliang-Ma I invited you to our bitsandbytes-intel slack channel. Could you join there to discuss if you're planning on supporting the PagedOptimizers of BNB?

The paged memory feature is what we have in functional.py:get_paged() using cudaMallocManaged under the hood.

Liangliang-Ma · 2025-04-24T04:48:59Z

@Titus-von-Koeller Due to changes in work content, I will not be doing related work in the near future. There will be my other colleague to take over. Thanks for invitation tho :)

Liangliang-Ma added 8 commits February 13, 2025 07:12

enable xpu 8bit optim

e519b21

add deqaunt_blockwise

70a38cb

dequantize_blockwise

75e9983

add bakcend synchronize

1c96995

refine code

1192982

ipex dep

550cf70

ipex dep too

886213b

ipex version check

b0982fe

matthewdouglas added Intel Optimizers Issues or feature requests relating to optimizers labels Mar 18, 2025

Merge branch 'multi-backend-refactor' into xpu

8c0271d

matthewdouglas reviewed Mar 27, 2025

View reviewed changes

matthewdouglas merged commit 5c48b33 into bitsandbytes-foundation:multi-backend-refactor Apr 15, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

XPU backend support 8bit optimizer #1565

XPU backend support 8bit optimizer #1565

Uh oh!

Liangliang-Ma commented Mar 14, 2025 •

edited

Loading

Uh oh!

jiqing-feng commented Mar 26, 2025 •

edited

Loading

Uh oh!

matthewdouglas commented Mar 27, 2025

Uh oh!

github-actions bot commented Mar 27, 2025

Uh oh!

matthewdouglas Mar 27, 2025 •

edited

Loading

Uh oh!

jiqing-feng Mar 31, 2025

Uh oh!

matthewdouglas Apr 15, 2025

Uh oh!

jiqing-feng commented Mar 28, 2025

Uh oh!

Uh oh!

Titus-von-Koeller commented Apr 15, 2025 •

edited

Loading

Uh oh!

Titus-von-Koeller commented Apr 16, 2025

Uh oh!

Liangliang-Ma commented Apr 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

XPU backend support 8bit optimizer #1565

XPU backend support 8bit optimizer #1565

Uh oh!

Conversation

Liangliang-Ma commented Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jiqing-feng commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matthewdouglas commented Mar 27, 2025

Uh oh!

github-actions bot commented Mar 27, 2025

Uh oh!

matthewdouglas Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jiqing-feng Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

matthewdouglas Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

jiqing-feng commented Mar 28, 2025

Uh oh!

Uh oh!

Titus-von-Koeller commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Titus-von-Koeller commented Apr 16, 2025

Uh oh!

Liangliang-Ma commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Liangliang-Ma commented Mar 14, 2025 •

edited

Loading

jiqing-feng commented Mar 26, 2025 •

edited

Loading

matthewdouglas Mar 27, 2025 •

edited

Loading

Titus-von-Koeller commented Apr 15, 2025 •

edited

Loading

Liangliang-Ma commented Apr 24, 2025 •

edited

Loading