-
Notifications
You must be signed in to change notification settings - Fork 66
[release/2.7][ROCm][TunableOp] TunableOp TF32 support #2049
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: release/2.7
Are you sure you want to change the base?
Conversation
TF32 is supported by hipblaslt. Support added by pytorch#143549. This PR expands integration to the TunableOp feature. Pull Request resolved: pytorch#145946 Approved by: https://github.com/pruthvistony, https://github.com/echen4096, https://github.com/yoyoyocmu Co-authored-by: Nichols A. Romero <[email protected]> (cherry picked from commit 0c8ec26)
This PR includes additional enhancements to TF32 support in TunableOp. - OpSignature now differentiates between float32 and tf32 data types. - Offline tuning now supports TF32. - Unit tests for online and offline tuning of TF32. Pull Request resolved: pytorch#149088 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <[email protected]> (cherry picked from commit 11d4438)
Jenkins build for 0514332de8a5c32775566c9ef5c7b2063280deff commit finished as FAILURE |
This was a feature that just missed the PyTorch 2.7 cutoff date. Jeff said that it would be worth backporting since some customers are asking for it. |
Jenkins build for 0514332de8a5c32775566c9ef5c7b2063280deff commit finished as FAILURE Detected error during Pytorch building:
|
Jenkins build for 0514332de8a5c32775566c9ef5c7b2063280deff commit finished as FAILURE |
FWIW, ran all tunableop tests locally on MI300 and they all pass. |
Jenkins build for 0514332de8a5c32775566c9ef5c7b2063280deff commit finished as FAILURE |
94b4d76
to
0be82c5
Compare
0be82c5
to
fdf85e1
Compare
Jenkins build for 0514332de8a5c32775566c9ef5c7b2063280deff commit finished as FAILURE |
fdf85e1
to
6fd4078
Compare
Jenkins build for 0514332de8a5c32775566c9ef5c7b2063280deff commit finished as FAILURE |
TunableOp TF32 support enabled via two-cherry picks from upstream.
pytorch#145946
pytorch#149088