-
Notifications
You must be signed in to change notification settings - Fork 135
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Issue]: Build failure for gfx908 when building without optimization flags #1759
Comments
This comment was marked as outdated.
This comment was marked as outdated.
It looks like device_gemm_dpp only supports gfx10/gfx11 and shouldn't be being built for gfx9: composable_kernel/include/ck/tensor_operation/gpu/device/impl/device_gemm_dpp.hpp Lines 169 to 176 in 689a5ae
|
device_gemm_dpp only supports gfx10/gfx11 and will fail to build for gfx9. https://github.com/ROCm/composable_kernel/blob/689a5ae45be802f51fc947a9f92208dcfb143f77/include/ck/tensor_operation/gpu/device/impl/device_gemm_dpp.hpp#L169-L176 Fixes ROCm#1759
Docker reproduction steps on 6ef8d3c:
I accidentally removed the default release flags from the release build by trying to use
It's odd that the build requires these flags to succeed. Does this indicate an issue? Feel free to close if it's expected. I'm quite curious why missing optimization flags causes |
I can reproduce this issue. |
Hi @LunNova, Thanks for reporting the issue, I was able to reproduce the error using your specified instructions. However, following the build instructions from the documentation, I was able to build CK without any errors. Could you please follow the build instructions available here, and let me know if you run into any issues? Thanks! |
I am able to build normally, however it seems very odd for a build to rely on building with optimizations and worth investigating or documenting the cause of it failing. |
Related to #1371
Problem Description
The develop branch 6ef8d3c fails to build with
-DGPU_TARGETS="gfx908;gfx90a" -DCMAKE_CXX_FLAGS_RELEASE=' '
device_gemm_dpp_f16_f16_f16_km_nk_mn_instance
This is on top of a ROCM 6.3 stack from the rocm-6.3.0 tags.
Operating System
NixOS
CPU
EPYC 7773
GPU
AMD Instinct MI100
The text was updated successfully, but these errors were encountered: