Open
Description
🐛 Describe the bug
In the v0.6.0 release notes, I noticed that for XNNPack the build option ENABLE_XNNPACK_WEIGHTS_CACHE was introduced to enable weight sharing across multiple methods within the same model. However, when following the Vulkan tutorial and documentation, there doesn’t appear to be an equivalent option for weight sharing.
Does the Vulkan backend in v0.6.0 support weight sharing (either through a CMake build flag or a runtime option)?
If not, are there plans to introduce weight sharing or similar weight-caching features in future Vulkan releases?
In the meantime, what optimizations or best practices (e.g., memory layout tweaks, shader tuning) would you recommend to improve Vulkan performance?
Versions
executorch v0.6.0