Skip to content

Conversation

vanbasten23
Copy link
Collaborator

@vanbasten23 vanbasten23 commented Jun 2, 2025

This PR adds:

  • a Pallas kernel for doing w8a8 quantized matmul
  • tuned table for v6e

Test plan

  • pip install pytest
  • pytest pytorch/xla/test/test_quantized_matmul_pallas_kernel.py -s

cc: @yixinshi

@vanbasten23 vanbasten23 requested review from yaochengji and lsy323 June 2, 2025 18:47
Copy link
Collaborator

@yaochengji yaochengji left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks Xiongfei for your contribution! Left a few comments.

@vanbasten23 vanbasten23 requested review from yaochengji and lsy323 June 4, 2025 16:30
Copy link
Collaborator

@yaochengji yaochengji left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@lsy323
Copy link
Collaborator

lsy323 commented Jun 4, 2025

Quantization logic looks good to me!

@vanbasten23
Copy link
Collaborator Author

Thanks for the review!

@vanbasten23 vanbasten23 merged commit 2504888 into master Jun 4, 2025
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants