Skip to content

Commit a6080f7

Browse files
Aidyn-Apytorchmergebot
authored andcommitted
[Build] Add linker script optimization (pytorch#121975)
This PR adds a linker script optimization based on prioritized symbols that can be extracted from the profiles of popular workloads. The present linker script was generated to target ARM+CUDA and later can be extended if necessary. The reason we target ARM is shown below: > PyTorch and other applications that access more than 24x 2MB code regions in quick succession can result in performance bottlenecks in the CPU front-end. The link-time optimization improves executable code locality and improve performance. We recommend turning on the optimization always for PyTorch and other application that behaves similarly. Pull Request resolved: pytorch#121975 Approved by: https://github.com/ptrblck, https://github.com/atalman
1 parent 178ce14 commit a6080f7

File tree

3 files changed

+1270
-0
lines changed

3 files changed

+1270
-0
lines changed

0 commit comments

Comments
 (0)