[Feature]: Offload compression #2031
Labels
change request
A code change requested by external or internal stakeholders
triaged
The issue has been reviewed by a team member and prioritized
Suggestion Description
TensileCreateLibrary should support offload compression.
rocBLAS uses TensileCreateLibrary and can hit 3GB of hsaco, co and dat/yaml files under
lib/rocblas/library/
.Compressing each file individually with zstd reduces this to ~150MB. Compressing everything into a single archive with 7zip reduces this to 70MB.
clang-offload-bundler and clang --offload-arch=... already support compression with
--compress
and--offload-compress
flags.rocBLAS tries to compress its kernels with these options ROCm/rocBLAS@bb81a83 but it barely does anything because most space is used by tensile which unbundles libraries.
Operating System
N/A
GPU
N/A
ROCm Component
Tensile
The text was updated successfully, but these errors were encountered: