Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tried to build this code on Visual Studio, lots of compile problems, any suggestions? #1

Open
JacquesRoth opened this issue Dec 25, 2024 · 3 comments

Comments

@JacquesRoth
Copy link

a_frag, b_frag, and c_frag declarations give undefined errors, adding "using namespace nvcuda;" did not help. Adding "include <mma.h> did not help. Is there something that must be configured in Visual Studio to enable tensor core operations?
Invocation of naive_tensor_mat_mul_kernel<<<dim_grid... gives expected an expression error.
"int warpM = blockIdx.x;" gives blockIdx undefined error!

Quite a few other errors. Has anyone tried this code with Visual Studio for Cuda?

@tgautam03
Copy link
Owner

tgautam03 commented Dec 27, 2024

It could be a driver issue (assuming your GPU has tensor cores). Did you compile it using the make file I provided?

Try compiling using the following commands:

make 00_benchmark_cuBLAS.out

make 01_benchmark_naive.out

@JacquesRoth
Copy link
Author

JacquesRoth commented Dec 27, 2024 via email

@tgautam03
Copy link
Owner

tgautam03 commented Dec 28, 2024

I left two commands in the previous comment. Please try them out in the terminal (ensuring that the terminal is opened in tgemm folder) and see if that works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants