library for accelerating mixed precision matrix multiply-accumulate operations
https://github.com/ROCm/rocWMMA