Overview

The vectorization idea of gemm by SIMD instructions comes from the zhihu (https://zhuanlan.zhihu.com/p/383115932) and the (https://github.com/pigirons/sgemm_hsw).

zhihu gives a detailed description of the methods with perspicuous pictures.

Build

make -j8

Init the input

./init.sh

This is used for initialization of the input elements (Integer and Float values). Input matrices of A[m][n] and B[n][k] are read from the *.random files. Make sure the m*n and n*k are less than the element number of .random files.

Run the gemm

./exe_gemm_float m n k res It means that C[m][k]=A[m][n]xB[n][k]. As is shown in zhihu, the n should be a multiple of 24 to fully use the 16 ymm logical registers.

For example:
./exe_gemm_float 2400 2400 2400 res

./exe_gemm_float_multiple 24 24 64 res

./run.sh

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
L1_cache_gemm		L1_cache_gemm
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
check.cpp		check.cpp
cublas_utils.h		cublas_utils.h
gemm_cqx.cpp		gemm_cqx.cpp
gemm_cublas.cu		gemm_cublas.cu
gemm_cuda.cu		gemm_cuda.cu
gemm_float.cpp		gemm_float.cpp
gemm_float_multiple.cpp		gemm_float_multiple.cpp
gemm_int.cpp		gemm_int.cpp
gemm_tile_fusion.cu		gemm_tile_fusion.cu
generate_random.cpp		generate_random.cpp
init.sh		init.sh
makefile_for_gemm_cqx		makefile_for_gemm_cqx
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Build

Init the input

Run the gemm

About

Releases

Packages

Languages

XiaomingXu1995/gemm

Folders and files

Latest commit

History

Repository files navigation

Overview

Build

Init the input

Run the gemm

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages