Support for global load/store padding #44

Hprairie · 2024-07-07T17:43:12Z

Hi, I was wondering if there were any plans to add padding support to global loads and store. This would bring the functionality of Thunderkittens a lot closer to Triton but with the flexibility of using Cuda. Without this, it's rather hard to justify using Thunderkittens as it doesn't make any sense to pad the sequence in HBM before passing it to a thunder kittens kernel.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for global load/store padding #44

Support for global load/store padding #44

Hprairie commented Jul 7, 2024

Support for global load/store padding #44

Support for global load/store padding #44

Comments

Hprairie commented Jul 7, 2024