Skip to content

Conversation

@csteegz
Copy link

@csteegz csteegz commented Oct 16, 2025

Summary:
Traces for ads models show an absurd amount of show an absurd amount of aten::select calls inside fbgemm::permute_multi_embedding. Use dataptr with the stride to avoid these select calls (indexing into a tensor turns into an aten::select call).

Testing with the model shows decrease in trace time from 6MS to 70 uS

Differential Revision: D83609157

Summary:
Traces for ads models show an absurd amount of show an absurd amount of aten::select calls inside fbgemm::permute_multi_embedding. Use dataptr with the stride to avoid these select calls (indexing into a tensor turns into an aten::select call).

Testing with the model shows decrease in trace time from 6MS to 70 uS

Differential Revision: D83609157
@netlify
Copy link

netlify bot commented Oct 16, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit afcfa1e
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68f1543f6669b800085332cf
😎 Deploy Preview https://deploy-preview-5017--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@meta-codesync
Copy link
Contributor

meta-codesync bot commented Oct 16, 2025

@csteegz has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83609157.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant