Version 1.1.0
This DBCSR version introduces predictive modeling for the CUDA generation of the kernels. A decision tree model is built from the data obtained by autotuning certain (m, n, k)-triplets and is used to predict optimal parameters for unseen (m, n, k)-triplets.
Developed as a part of the PASC proposal "Sparse Tensor Linear Algebra Library"
Thanks to @shoshijak for the hard work to implement it!