Commit a6080f7

authored and

committed

[Build] Add linker script optimization (pytorch#121975)

This PR adds a linker script optimization based on prioritized symbols that can be extracted from the profiles of popular workloads. The present linker script was generated to target ARM+CUDA and later can be extended if necessary. The reason we target ARM is shown below: > PyTorch and other applications that access more than 24x 2MB code regions in quick succession can result in performance bottlenecks in the CPU front-end. The link-time optimization improves executable code locality and improve performance. We recommend turning on the optimization always for PyTorch and other application that behaves similarly. Pull Request resolved: pytorch#121975 Approved by: https://github.com/ptrblck, https://github.com/atalman

1 parent 178ce14 commit a6080f7Copy full SHA for a6080f7

3 files changed

+1270

-0

lines changed

cmake
- prioritized_text.txt
setup.py
tools/setup_helpers
- generate_linker_script.py

3 files changed

+1270

-0

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit a6080f7

3 files changed

3 files changed

File tree

3 files changed

3 files changed

0 commit comments