StarGemm

实践与比较Cuda平台的高性能矩阵通用乘。

本仓库主要参考reed-lau老师的实现 https://github.com/reed-lau/cute-gemm

过了一遍实现的同时，在原实现的基础上添加了诸多注释，便于初学者理解

使用方法

git clone https://github.com/StarrickLiu/StarGemm.git
git submodule update
make
cd build
./gemm-starrick

基于Cutlass实现并比较stream-k等方法在不同规模Gemm上的性能

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
3rd		3rd
build		build
examples		examples
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
Makefile		Makefile
README.md		README.md