Fix double-tracing in SpecPropPass #15485

GregoryComer · 2025-10-31T00:23:36Z

Our current SpecPropPass doesn't properly capture the effect of guards in the shape environment due to double-tracing certain ops. The problem looks like this:

Every time we trace through the graph, we generate new symints.
That's fine, since shape_env will pick up guards during the retrace.
Problem is that SpecPropPass does this twice. Once to generate the spec and then once by calling super().call_operator(...) (https://github.com/.../exir/passes/spec_prop_pass.py...).
The tensor spec gets the symint from the first. But the graph and guards use the second.
Hence the tensor spec doesn't pick up on guards.

To resolve this, I've updated the SpecPropPass to re-trace the graph and then generate specs based on the meta values, not the traced ProxyValues (thanks @angelayi for the suggestion). This resolves the issue.

I originally saw this issue with the NMS torchvision op, but to avoid adding a new dep to the core EXIR tests, I've written a test with a custom op that uses an unbacked symint in the meta kernel output shape to replicate the bug in the same way.

Differential Revision: D85913581

pytorch-bot · 2025-10-31T00:23:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15485

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 1996324 with merge base 9981e41 ():

NEW FAILURES - The following jobs have failed:

pull / test-moshi-linux / linux-job (gh)
RuntimeError: Command docker exec -t 146281050a8a1eb6b876cef8c3cf8842f39deb3044688e6bac55c5e34486b907 /exec failed with exit code 1
trunk / test-arm-cortex-m-size-test (zephyr-preset) / linux-job (gh)
RuntimeError: Command docker exec -t 132edde613850bc65f145a5b78499da61afce95b9459bb7ccbd33508ce9bdda4 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-10-31T00:23:45Z

@GregoryComer has exported this pull request. If you are a Meta employee, you can view the originating Diff in D85913581.

Summary: Pull Request resolved: #15485 Differential Revision: D85913581

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer · 2025-11-07T00:38:23Z

One additional note is that aliasing analysis feels pretty fragile as is. I fixed several subtle issues (luckily caught by CI) where my changes were accidentally planning two seperate tensors when they should alias / share one TensorSpec.

I'm wondering if we should re-write this pass again to either rely on ProxyValue reference equality or otherwise introduce some proper aliasing analysis. This is as opposed to hard coding that getitem and output, for example, always alias their argument.

This seems like it could get messy with non-functional custom ops or defunctionalization, in general. @JacobSzwejbka @angelayi what are your thoughts on this?

exir/passes/spec_prop_pass.py

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

meta-codesync · 2025-11-11T00:48:35Z

@GregoryComer has imported this pull request. If you are a Meta employee, you can view this in D85913581.

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer · 2025-11-13T02:34:54Z

Note that the moshi and zephyr size test failures are pre-existing.

GregoryComer requested review from JacobSzwejbka and larryliu0820 as code owners October 31, 2025 00:23

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 31, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 31, 2025

GregoryComer marked this pull request as draft October 31, 2025 00:24

GregoryComer added ciflow/trunk release notes: exir Changes to any dialects and passes on these dialects, such as memory planning labels Oct 31, 2025

GregoryComer force-pushed the export-D85913581 branch from db9ef9c to 3944aac Compare November 6, 2025 23:42

pytorch-bot bot pushed a commit that referenced this pull request Nov 6, 2025

Fix shape_env handling in SpecPropPass (WIP) (#15485)

3944aac

Summary: Pull Request resolved: #15485 Differential Revision: D85913581

GregoryComer added a commit to GregoryComer/executorch that referenced this pull request Nov 6, 2025

Fix shape_env handling in SpecPropPass (WIP) (pytorch#15485)

6c9d1cb

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer force-pushed the export-D85913581 branch from 3944aac to 6c9d1cb Compare November 6, 2025 23:49

GregoryComer added a commit to GregoryComer/executorch that referenced this pull request Nov 6, 2025

Fix shape_env handling in SpecPropPass (WIP) (pytorch#15485)

8213149

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer force-pushed the export-D85913581 branch from 6c9d1cb to 8213149 Compare November 6, 2025 23:58

GregoryComer requested a review from angelayi November 7, 2025 00:09

GregoryComer changed the title ~~Fix shape_env handling in SpecPropPass~~ Fix double-tracing in SpecPropPass Nov 7, 2025

angelayi reviewed Nov 7, 2025

View reviewed changes

exir/passes/spec_prop_pass.py Outdated Show resolved Hide resolved

GregoryComer force-pushed the export-D85913581 branch from 8213149 to 01418d5 Compare November 8, 2025 02:44

GregoryComer added a commit to GregoryComer/executorch that referenced this pull request Nov 8, 2025

Fix shape_env handling in SpecPropPass (WIP) (pytorch#15485)

01418d5

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer added a commit to GregoryComer/executorch that referenced this pull request Nov 10, 2025

Fix shape_env handling in SpecPropPass (WIP) (pytorch#15485)

0152b4a

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer force-pushed the export-D85913581 branch from 01418d5 to 0152b4a Compare November 10, 2025 22:37

GregoryComer added a commit to GregoryComer/executorch that referenced this pull request Nov 11, 2025

Fix shape_env handling in SpecPropPass (WIP) (pytorch#15485)

83de667

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer force-pushed the export-D85913581 branch from 0152b4a to 83de667 Compare November 11, 2025 00:42

GregoryComer marked this pull request as ready for review November 11, 2025 05:10

GregoryComer added a commit to GregoryComer/executorch that referenced this pull request Nov 12, 2025

Fix shape_env handling in SpecPropPass (WIP) (pytorch#15485)

19eef20

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer force-pushed the export-D85913581 branch from 83de667 to 19eef20 Compare November 12, 2025 00:40

GregoryComer requested a review from angelayi November 12, 2025 01:32

Fix shape_env handling in SpecPropPass (WIP) (pytorch#15485)

1996324

Summary: Pull Request resolved: pytorch#15485 Differential Revision: D85913581

GregoryComer force-pushed the export-D85913581 branch from 19eef20 to 1996324 Compare November 12, 2025 23:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix double-tracing in SpecPropPass #15485

Fix double-tracing in SpecPropPass #15485

Uh oh!

GregoryComer commented Oct 31, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 31, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 31, 2025

Uh oh!

GregoryComer commented Nov 7, 2025

Uh oh!

Uh oh!

meta-codesync bot commented Nov 11, 2025

Uh oh!

GregoryComer commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix double-tracing in SpecPropPass #15485

Are you sure you want to change the base?

Fix double-tracing in SpecPropPass #15485

Uh oh!

Conversation

GregoryComer commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15485

❌ 2 New Failures

Uh oh!

meta-codesync bot commented Oct 31, 2025

Uh oh!

GregoryComer commented Nov 7, 2025

Uh oh!

Uh oh!

meta-codesync bot commented Nov 11, 2025

Uh oh!

GregoryComer commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GregoryComer commented Oct 31, 2025 •

edited

Loading

pytorch-bot bot commented Oct 31, 2025 •

edited

Loading