PTX: Demote GEP indices to 32 bits, if possible. #463

maleadt · 2023-05-31T11:29:15Z

In JuliaGPU/CUDA.jl#1895, I made the size tuple of CuDeviceArray 32 bits so that we can emit better code (lowering register pressure, making it possible to execute compute & indexing instructions in parallel, etc) However, the NVPTX back-end defaults to using 64 bits for indexing pointers, resulting in 64-bits GEPs being introduced by the front-end and optimization. I tried to change that by specifying a 32-bit pointer index size in the data layout, #444, but that breaks 64-bits indices which can still get reintroduced by optimization (see e.g. #461).

Either we try this again on LLVM 17 (where a bug has been fixed that was introducing 64-bits GEP offsets), or we instead create an optimization pass that demotes GEP indices to 32 bits if possible (e.g., if they are constants, or come from the size field of a device array).

The text was updated successfully, but these errors were encountered:

maleadt added the ptx Stuff about the NVIDIA PTX back-end. label May 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PTX: Demote GEP indices to 32 bits, if possible. #463

PTX: Demote GEP indices to 32 bits, if possible. #463

maleadt commented May 31, 2023

PTX: Demote GEP indices to 32 bits, if possible. #463

PTX: Demote GEP indices to 32 bits, if possible. #463

Comments

maleadt commented May 31, 2023