Skip to content

Version 1.1.0

Compare
Choose a tag to compare
@alazzaro alazzaro released this 09 Apr 03:44

This DBCSR version introduces predictive modeling for the CUDA generation of the kernels. A decision tree model is built from the data obtained by autotuning certain (m, n, k)-triplets and is used to predict optimal parameters for unseen (m, n, k)-triplets.

Developed as a part of the PASC proposal "Sparse Tensor Linear Algebra Library"

Thanks to @shoshijak for the hard work to implement it!