Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GH-128682: Make PyStackRef_CLOSE escaping. #129404

Merged
merged 9 commits into from
Feb 3, 2025

Conversation

markshannon
Copy link
Member

@markshannon markshannon commented Jan 28, 2025

Makes PyStackRef_CLOSE escaping.

PyStackRef_CLOSE kills the variable, and then escapes.
This PR adds an EscapingCall dataclass to the analysis, with a new kill attribute. PyStackRef_CLOSE and PyStackRef_XCLOSE set the kill attribute, so the code generator knows to kill the argument before spilling.

Doing so, means that pop_N_error labels no longer work as we need to adjust the stack pointer before all PyStackRef_CLOSE rather than once at the jump to error.

Performance numbers to come...

Copy link
Member

@Fidget-Spinner Fidget-Spinner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you bench this on Win64 as well? I suspect the _PyFrame_SetStackPointer calls will not get inlined, and will produce a tremendous slowdown on Win64.

You should define them as a macro on that platform to be safe.

@@ -330,6 +337,16 @@ def convert_stack_item(
cond = replace_op_arg_1
return StackItem(item.name, item.type, cond, item.size)

def check_unused(stack: list[StackItem], input_names: dict[str, lexer.Token]):
"Unused items cannot be on the stack above used, non-peek items"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What's the rationale for this?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should probably remove any special treatment of variables named "unused" as they get skipped by the analysis, leading to some incorrect stacks. This is just avoiding one of those cases.

@markshannon markshannon marked this pull request as draft January 28, 2025 15:51
@markshannon
Copy link
Member Author

Benchmarking shows a 0.2% speedup on Linux (noise) and 1.2% slowdown on Windows.

The slowdown on Windows includes an 8% slowdown on spectral_norm which heavily depends on a few instructions. The generated code for those instructions is unchanged, so this is likely just an artifact of code layout.

However, even with spectral_norm discounted there does seem to be a general slowdown.

Looking at the generated code and the latest execution counts, the instructions changed account for 2-3% of the total instructions executed which I would expect to cause a slowdown of no more than 0.2%.
While a 0.2% slowdown is not ideal, it is acceptable given the advantages of precise spilling.

@Fidget-Spinner Fidget-Spinner added the 🔨 test-with-refleak-buildbots Test PR w/ refleak buildbots; report in status section label Jan 30, 2025
@bedevere-bot
Copy link

🤖 New build scheduled with the buildbot fleet by @Fidget-Spinner for commit 5dc7977 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F30617%2Fmerge

If you want to schedule another build, you need to add the 🔨 test-with-refleak-buildbots label again.

@bedevere-bot bedevere-bot removed the 🔨 test-with-refleak-buildbots Test PR w/ refleak buildbots; report in status section label Jan 30, 2025
@Fidget-Spinner
Copy link
Member

Looks good, but worried about the close and stackref changes. Running with refleak buildbots.

@markshannon markshannon marked this pull request as ready for review January 30, 2025 16:15
@markshannon
Copy link
Member Author

The buildbots are failing test_opcache, which fails on main.

This fails on main: ./python -m test -v -R3:3 test_opcache.
Oddly, this doesn't fail: ./python -m test -v -F test_opcache

@markshannon markshannon merged commit 808071b into python:main Feb 3, 2025
69 checks passed
@Fidget-Spinner
Copy link
Member

Wait did we address the slowdowns on Windows?

srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this pull request Feb 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants