[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Copilot · 2025-06-14T15:38:57Z

This PR implements the missing quantize_per_channel and dequantize_per_channel operations in the torchlib quantized_decomposed module.

Changes

Added two new functions to onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py:

`quantized_decomposed_quantize_per_channel`

Implements per-channel quantization using ONNX QuantizeLinear with per-axis support
Takes tensor inputs for scales and zero_points (one value per channel)
Supports axis parameter to specify the quantization dimension
Uses ONNX opset23 for per-axis quantization capabilities

`quantized_decomposed_dequantize_per_channel`

Implements per-channel dequantization using ONNX DequantizeLinear with per-axis support
Takes tensor inputs for scales and optional zero_points
zero_points parameter is Optional[TensorType] matching PyTorch reference
Supports both default output type and explicit output_dtype parameter

Implementation Details

Both functions:

Follow the existing code patterns using @torch_op decorator with trace_only=True
Have function signatures that match the PyTorch reference implementations from torch.ao.quantization.fx._decomposed
Use ONNX opset23 to leverage the axis and output_dtype parameters for per-axis quantization
Are properly registered and accessible as TracedOnnxFunctions

The implementation leverages ONNX's native per-axis quantization support rather than implementing the tensor manipulation logic from the PyTorch reference, making it more efficient and aligned with ONNX best practices.

Testing

Validated that:

Functions import correctly and have proper signatures
Function schemas are correctly generated with appropriate parameter types
No existing functionality is broken (per-tensor functions still work)
Functions can be used to build ONNX models

Fixes #2389.

💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click here to start the survey.

Co-authored-by: justinchuby <[email protected]>

justinchuby · 2025-06-14T16:03:17Z

onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

+    quant_min: int,
+    quant_max: int,


Are these unused? Why is that?

These parameters are unused because the ONNX QuantizeLinear/DequantizeLinear operators don't require explicit quant_min/quant_max parameters - they determine the quantization range from the data type and quantization parameters. The parameters are kept in the function signature for API compatibility with PyTorch's reference implementation, following the same pattern as the existing per-tensor functions above.

codecov · 2025-06-14T16:14:41Z

❌ 25 Tests Failed:

Tests completed	Failed	Passed	Skipped
15725	25	15700	1697

View the top 2 failed test(s) by shortest run time

onnxscript.backend.onnx_export_test.TestOnnxBackEnd::test_export2python_produces_correct_onnx_script_model_0362_test_equal_bcast

Stack Traces | 0.004s run time

onnxscript\backend\onnx_export_test.py:137: in extract_functions
    mod = importlib.import_module(import_name)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
C:\hostedtoolcache\windows\Python\3.11.9\x64\Lib\importlib\__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
E   ModuleNotFoundError: No module named 'tests.onnx_backend_test_code.test_equal_bcast'

The above exception was the direct cause of the following exception:
.nox\test_ort_nightly\Lib\site-packages\parameterized\parameterized.py:620: in standalone_func
    return func(*(a + p.args), **p.kwargs, **kw)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxscript\backend\onnx_export_test.py:271: in test_export2python_produces_correct_onnx_script_model
    functions = extract_functions(backend_test.name, code, self.test_folder)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxscript\backend\onnx_export_test.py:139: in extract_functions
    raise AssertionError(
E   AssertionError: Unable to import 'tests.onnx_backend_test_code.test_equal_bcast' (e=No module named 'tests.onnx_backend_test_code.test_equal_bcast') (file: 'C:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_equal_bcast.py', absolute path: 'C:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_equal_bcast.py', current folder: C:\a\onnxscript\onnxscript
E   ---- CONTENT --
E   import numpy
E   from onnx import TensorProto
E   from onnx.helper import make_tensor
E   from onnxscript import script, external_tensor
E   from onnxscript.values import Opset
E   from onnxscript.onnx_types import BOOL, INT32
E   from onnxscript.onnx_opset import opset19
E   
E   @script()
E   def bck_test_equal_bcast(x: INT32[3,4,5], y: INT32[5]) -> (BOOL[3,4,5]):
E       z = opset19.Equal(x, y)
E       return z

onnxscript.backend.onnx_export_test.TestOnnxBackEnd::test_export2python_produces_correct_onnx_script_model_0788_test_quantizelinear_blocked_symmetric

Stack Traces | 0.004s run time

onnxscript\backend\onnx_export_test.py:137: in extract_functions
    mod = importlib.import_module(import_name)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
C:\hostedtoolcache\windows\Python\3.11.9\x64\Lib\importlib\__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
E   ModuleNotFoundError: No module named 'tests.onnx_backend_test_code.test_quantizelinear_blocked_symmetric'

The above exception was the direct cause of the following exception:
.nox\test_ort_nightly\Lib\site-packages\parameterized\parameterized.py:620: in standalone_func
    return func(*(a + p.args), **p.kwargs, **kw)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxscript\backend\onnx_export_test.py:271: in test_export2python_produces_correct_onnx_script_model
    functions = extract_functions(backend_test.name, code, self.test_folder)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxscript\backend\onnx_export_test.py:139: in extract_functions
    raise AssertionError(
E   AssertionError: Unable to import 'tests.onnx_backend_test_code.test_quantizelinear_blocked_symmetric' (e=No module named 'tests.onnx_backend_test_code.test_quantizelinear_blocked_symmetric') (file: 'C:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_quantizelinear_blocked_symmetric.py', absolute path: 'C:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_quantizelinear_blocked_symmetric.py', current folder: C:\a\onnxscript\onnxscript
E   ---- CONTENT --
E   import numpy
E   from onnx import TensorProto
E   from onnx.helper import make_tensor
E   from onnxscript import script, external_tensor
E   from onnxscript.values import Opset
E   from onnxscript.onnx_types import FLOAT, INT16
E   from onnxscript.onnx_opset import opset21
E   
E   @script()
E   def bck_test_quantizelinear_blocked_symmetric(x: FLOAT[3,4], y_scale: FLOAT[3,2]) -> (INT16[3,4]):
E       y = opset21.QuantizeLinear(x, y_scale, axis=1, block_size=2, output_dtype=5)
E       return y

View the full list of 1 ❄️ flaky tests

onnxscript.backend.onnx_export_test.TestOnnxBackEnd::test_export2python_produces_correct_onnx_script_model_0759_test_or_bcast4v4d

Flake rate in main: 5.26% (Passed 36 times, Failed 2 times)

Stack Traces | 0.004s run time

onnxscript\backend\onnx_export_test.py:137: in extract_functions
    mod = importlib.import_module(import_name)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
C:\hostedtoolcache\windows\Python\3.11.9\x64\Lib\importlib\__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
E   ModuleNotFoundError: No module named 'tests.onnx_backend_test_code.test_or_bcast4v4d'

The above exception was the direct cause of the following exception:
.nox\test_ort_nightly\Lib\site-packages\parameterized\parameterized.py:620: in standalone_func
    return func(*(a + p.args), **p.kwargs, **kw)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxscript\backend\onnx_export_test.py:271: in test_export2python_produces_correct_onnx_script_model
    functions = extract_functions(backend_test.name, code, self.test_folder)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxscript\backend\onnx_export_test.py:139: in extract_functions
    raise AssertionError(
E   AssertionError: Unable to import 'tests.onnx_backend_test_code.test_or_bcast4v4d' (e=No module named 'tests.onnx_backend_test_code.test_or_bcast4v4d') (file: 'C:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_or_bcast4v4d.py', absolute path: 'C:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_or_bcast4v4d.py', current folder: C:\a\onnxscript\onnxscript
E   ---- CONTENT --
E   import numpy
E   from onnx import TensorProto
E   from onnx.helper import make_tensor
E   from onnxscript import script, external_tensor
E   from onnxscript.values import Opset
E   from onnxscript.onnx_types import BOOL
E   from onnxscript.onnx_opset import opset7
E   
E   @script()
E   def bck_test_or_bcast4v4d(x: BOOL[1,4,1,6], y: BOOL[3,1,5,6]) -> (BOOL[3,4,5,6]):
E       r_or = opset7.Or(x, y)
E       return r_or

To view more test analytics, go to the Test Analytics Dashboard
_{📋 Got 3 mins? Take this short survey to help us improve Test Analytics.}

onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

 from onnxscript.onnx_types import TensorType
+from typing import Optional


onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

+) -> TensorType:
+    """Affine per channel quantization for the Tensor using the same quantization
+    parameters for each channel/axis to map from floating point to quantized values.
+


onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

+) -> TensorType:
+    """Affine per channel quantization for the Tensor using the same quantization
+    parameters for each channel/axis to map from floating point to quantized values.
+


onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

+) -> TensorType:
+    """Affine per channel dequantization for the Tensor using the same quantization
+    parameters for each channel/axis to map from quantized values to floating point values.
+


onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

+) -> TensorType:
+    """Affine per channel dequantization for the Tensor using the same quantization
+    parameters for each channel/axis to map from quantized values to floating point values.
+


Initial plan for issue

3646461

Copilot AI assigned Copilot and justinchuby Jun 14, 2025

github-project-automation bot added this to ONNX Script Review Board Jun 14, 2025

Copilot started work on behalf of justinchuby June 14, 2025 15:38 View session

github-project-automation bot moved this to Todo in ONNX Script Review Board Jun 14, 2025

Implement quantize_per_channel and dequantize_per_channel for torchlib

8cc5271

Co-authored-by: justinchuby <[email protected]>

Copilot AI changed the title ~~[WIP] [torchlib] Implement quantize_per_channel and dequantize_per_channel~~ [torchlib] Implement quantize_per_channel and dequantize_per_channel Jun 14, 2025

Copilot AI requested a review from justinchuby June 14, 2025 15:50

Copilot finished work on behalf of justinchuby June 14, 2025 15:50

justinchuby reviewed Jun 14, 2025

View reviewed changes

Copilot started work on behalf of justinchuby June 14, 2025 16:03 View session

Copilot finished work on behalf of justinchuby June 14, 2025 16:07

justinchuby marked this pull request as ready for review June 14, 2025 16:10

github-advanced-security bot found potential problems Jun 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Uh oh!

Copilot AI commented Jun 14, 2025 •

edited

Loading

Uh oh!

justinchuby Jun 14, 2025

Uh oh!

Copilot AI Jun 14, 2025

Uh oh!

codecov bot commented Jun 14, 2025 •

edited

Loading

Uh oh!

Check notice

Check warning

Check warning

Check warning

Check warning

Uh oh!

		from onnxscript.onnx_types import TensorType
		from typing import Optional

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Are you sure you want to change the base?

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Uh oh!

Conversation

Copilot AI commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

quantized_decomposed_quantize_per_channel

quantized_decomposed_dequantize_per_channel

Implementation Details

Testing

Uh oh!

justinchuby Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

❌ 25 Tests Failed:

Uh oh!

Check notice

Check warning

Check warning

Check warning

Check warning

Uh oh!

Copilot AI commented Jun 14, 2025 •

edited

Loading

`quantized_decomposed_quantize_per_channel`

`quantized_decomposed_dequantize_per_channel`

codecov bot commented Jun 14, 2025 •

edited

Loading