Remove most special handling of MKL in CMake configuration #1149

msimberg · 2024-05-28T08:53:19Z

From version 0.5 onwards the spack package uses spack's blas/lapack libraries
The DLAF_WITH_MKL_LEGACY option is completely removed
MKL is auto-detected

This is currently untested, but early feedback more than welcome and the general structure is in place!

The first version of the MKL auto-detection simply uses try_compile with LAPACK_LIBRARY and a call to mkl_set_num_threads_local. It uses the result of this try_compile as the default value of DLAF_WITH_MKL. It's important to note that currently this will only be tested on the first configuration, so if the first configuration is wrong, and one then changes LAPACK_LIBRARY to have the correct paths, DLAF_WITH_MKL will still be off.

On the other hand, the above method means that one can simply set DLAF_WITH_MKL manually to on if one expects MKL to be there, and compilation will then fail if it's not there (instead of quietly disabling it).

The logic could be changed/improved to have different behaviour in the above case.

An alternative method is to simply use __has_include in single_threaded_blas.cpp. This doesn't allow overriding and is silent in either case (MKL found or not found).

msimberg · 2024-05-28T08:55:30Z

cscs-ci run

CMakeLists.txt

ci/.gitlab-ci.yml

ci/docker/build.Dockerfile

spack/packages/dla-future/package.py

msimberg · 2024-05-28T09:18:16Z

cscs-ci run

rasolca

The include directory is needed, otherwise non-spack builds are not possible.

albestro

LGTM

msimberg · 2024-05-29T09:12:53Z

cscs-ci run

CMakeLists.txt

msimberg · 2024-05-29T12:36:14Z

I've changed back the CUDA CI to use -Werror=all-warnings since we now don't get linker warnings with the newest version of Umpire. We will still get the warnings on builds with older versions of Umpire, but outside of CI those warnings should not be fatal. I opened #1154 as a reminder to potentially look at the other warnings.

rasolca

if and how to cache the try_compile result for detecting MKL

Not sure... maybe @albestro has an idea?

CMakeLists.txt

albestro · 2024-05-29T13:59:25Z

if and how to cache the try_compile result for detecting MKL

Not sure... maybe @albestro has an idea?

I would say that it might be nicer to not cache, so that any change to LAPACK_LIBRARY/LAPACK_INCLUDE_DIR will be reflected also in the MKL_TRY_COMPILE result.

Otherwise, let's suppose we do this sequence

cmake -DLAPACK_LIBRARY=<working-config> ...
cmake -DLAPACK_LIBRARY=<non-working-config> ...

with the cached result for MKL we might end up getting BLAS/LAPACK error, because of the check_function_exists in the FindLAPACK that explicitly clear the cache, but at the same time we get that MKL is found correctly (because it is checked just at the first time).

So, I would suggest to do the same we do for LAPACK in FindLAPACK, i.e. unset(DLAF_WITH_MKL_TRY_COMPILE CACHE). And, trivially, in the future start using NO_CACHE where possible (apparently up to 3.29 it is possible for try_compile but not for check_function_exists).

msimberg · 2024-05-29T15:44:10Z

if and how to cache the try_compile result for detecting MKL

Not sure... maybe @albestro has an idea?

I would say that it might be nicer to not cache, so that any change to LAPACK_LIBRARY/LAPACK_INCLUDE_DIR will be reflected also in the MKL_TRY_COMPILE result.

Otherwise, let's suppose we do this sequence
* `cmake -DLAPACK_LIBRARY=<working-config> ...`

* `cmake -DLAPACK_LIBRARY=<non-working-config> ...`
with the cached result for MKL we might end up getting BLAS/LAPACK error, because of the check_function_exists in the FindLAPACK that explicitly clear the cache, but at the same time we get that MKL is found correctly (because it is checked just at the first time).

So, I would suggest to do the same we do for LAPACK in FindLAPACK, i.e. unset(DLAF_WITH_MKL_TRY_COMPILE CACHE). And, trivially, in the future start using NO_CACHE where possible (apparently up to 3.29 it is possible for try_compile but not for check_function_exists).

Currently on this branch, the try_compile result is used as the default value for DLAF_WITH_MKL. This is done with the idea that DLAF_WITH_MKL can be set explicitly by a user to signal that they expect MKL to be used or not used (leading to compilation failures if MKL isn't actually set up correctly). In this case it doesn't make much difference if DLAF_WITH_MKL_TRY_COMPILE is a cache variable or not, and if it's reset or not because it'll only be used once to set the default.

I think if we reset and redetect MKL every time then DLAF_WITH_MKL_TRY_COMPILE should just be DLAF_WITH_MKL. I haven't tried it yet, but in this case I think we can simply let try_compile create the variable. If the user has already set it the try_compile should be skipped (but as I said, not verified). Then a user can -UDLAF_WITH_MKL to redetect it if they don't want to set it explicitly.

As a last option, we try_compile into DLAF_WITH_MKL and don't cache the variable. This, however, means that the user can't override the option (as far as I can think of...).

rasolca · 2024-05-29T16:25:04Z

Not sure what to say... just realized that auto-detection might have false positive if the mkl include directory is present (e.g. if using mkl fftw).
So the possibility for the user should be available.
I think it should work like this:

DLAF_WITH_MKL: is a cached value set by user

DLAF_WITH_MKL	try_compile	-DDLAF_WITH_MKL
ON	not run	added
OFF	not run	skipped
Not defined	OK	added
Not defined	FAIL	skipped

If a variable is also needed in the rest of the cmake scripts I would use a different name.

msimberg · 2024-05-30T07:52:57Z

cscs-ci run

msimberg · 2024-05-30T08:02:12Z

I think it should work like this:

DLAF_WITH_MKL: is a cached value set by user
DLAF_WITH_MKL try_compile -DDLAF_WITH_MKL
ON not run added
OFF not run skipped
Not defined OK added
Not defined FAIL skipped

If a variable is also needed in the rest of the cmake scripts I would use a different name.

This sounds pretty reasonable. I made an attempt at implementing this logic in d5a7ae4.

jst realized that auto-detection might have false positive if the mkl include directory is present (e.g. if using mkl fftw).

Yeah, I was worried about something like this as well. Without intending to I think this now covers that quite reasonably:

DLA-Future/CMakeLists.txt

Lines 81 to 84 in d5a7ae4

    
           if(DLAF_WITH_MKL) 
        
             # When using MKL there is no need to set the number of threads with 
        
             # omp_set_num_threads; it's sufficient to use MKL's own mechanisms. 
        
             set(DLAF_WITH_OPENMP OFF CACHE BOOL "${DLAF_WITH_OPENMP_DESCRIPTION}" FORCE)

. When DLAF_WITH_MKL is explicitly enabled we can safely disable OpenMP. If DLAF_WITH_MKL is undefined that first branch is always false so we'll keep OpenMP enabled (if it wasn't changed by the user). In the worst case we end up calling both omp_set_num_threads and mkl_set_num_threads, but we won't end up in the situation where we think we're using MKL, but we're not setting omp_set_num_threads.

In the spack package we set DLAF_WITH_MKL explicitly so there we only end up with one or the other, never both, enabled.

At least this is how I think it's working now and I think it seems reasonable, but not 100% sure.

I think warning/status messages can still be tweaked, but I'll leave that up to you to comment about.

msimberg · 2024-05-30T08:03:19Z

cscs-ci run

CMakeLists.txt

RMeli

LGTM, thanks!

ci/common-ci.yml

cmake/FindLAPACK.cmake

CMakeLists.txt

cmake/template/DLAFConfig.cmake.in

It is only to be set by users, but should be undefined otherwise.

Co-authored-by: Rocco Meli <[email protected]>

msimberg · 2024-05-31T10:55:04Z

cscs-ci run

msimberg added 6 commits May 28, 2024 09:57

Use spack's blas/lapack libraries for MKL from version 0.5.X onwards

5d0f37d

Remove DLAF_WITH_MKL_LEGACY CMake option

018ebcd

Add auto-detection for MKL

3f9ffc1

TEMP: Disable most CI configurations for testing

dc0ffe3

Update SPACK_SHA in CI

9967509

Add libssl-dev to CI image for external openssl

f71272b

msimberg added this to the v0.5.0 milestone May 28, 2024

msimberg self-assigned this May 28, 2024

msimberg requested review from albestro, RMeli and rasolca May 28, 2024 08:55

rasolca reviewed May 28, 2024

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

msimberg commented May 28, 2024

View reviewed changes

ci/.gitlab-ci.yml Outdated Show resolved Hide resolved

msimberg commented May 28, 2024

View reviewed changes

ci/docker/build.Dockerfile Show resolved Hide resolved

msimberg commented May 28, 2024

View reviewed changes

spack/packages/dla-future/package.py Outdated Show resolved Hide resolved

msimberg added 2 commits May 28, 2024 11:16

Don't explicitly set DLAF_WITH_MKL in spack package

910724d

Print MKL try_compile result

e681b72

rasolca reviewed May 28, 2024

View reviewed changes

spack/packages/dla-future/package.py Outdated Show resolved Hide resolved

rasolca reviewed May 28, 2024

View reviewed changes

albestro reviewed May 28, 2024

View reviewed changes

msimberg added 6 commits May 29, 2024 10:25

Don't find_package(MKL)

bedcec4

Don't use -Werror=all-warnings for CUDA in CI

3ec281b

Add TODO about enabling MKL and OpenMP

a5cdae8

Remove -Werror=missing-launch-bounds from CUDA CI configuration

c54431b

Use older try_compile signature

ecb341d

Add LAPACK_INCLUDE_DIR and SCALAPACK_INCLUDE_DIR variables

ebe5106

msimberg commented May 29, 2024

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

msimberg requested a review from albestro May 29, 2024 12:32

rasolca approved these changes May 29, 2024

View reviewed changes

albestro reviewed May 29, 2024

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

msimberg added 2 commits May 29, 2024 17:33

Rerun MKL detection every time CMake is configured

b25b3e1

Fix MKL try_compile build directory

672a700

Refactor MKL auto-detection

22b3f18

msimberg force-pushed the mkl-spack-cmake branch from d5a7ae4 to 22b3f18 Compare May 30, 2024 08:03

msimberg mentioned this pull request May 30, 2024

Use pika 0.25.0 in CI #1130

Closed

rasolca approved these changes May 31, 2024

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

rasolca mentioned this pull request May 31, 2024

Preparation for v0.5.0 release #1137

Merged

RMeli approved these changes May 31, 2024

View reviewed changes

ci/common-ci.yml Outdated Show resolved Hide resolved

albestro requested changes May 31, 2024

View reviewed changes

cmake/FindLAPACK.cmake Outdated Show resolved Hide resolved

CMakeLists.txt Outdated Show resolved Hide resolved

cmake/template/DLAFConfig.cmake.in Show resolved Hide resolved

msimberg and others added 4 commits May 31, 2024 11:26

Remove DLAF_WITH_MKL as an option

9dc2f78

It is only to be set by users, but should be undefined otherwise.

Simplify checks for empty LAPACK_LIBRARY and LAPACK_INCLUDE_DIR

1758a66

Remove DLAF_WITH_MKL from DLAFConfig.cmake.in

5f5bb92

Update SPACK_SHA

f0fd524

Co-authored-by: Rocco Meli <[email protected]>

rasolca requested a review from albestro May 31, 2024 12:08

albestro approved these changes May 31, 2024

View reviewed changes

rasolca merged commit 62718b9 into eth-cscs:master May 31, 2024
5 checks passed

github-actions bot pushed a commit that referenced this pull request May 31, 2024

Doc: Remove most special handling of MKL in CMake configuration (#1149)

ac0b8a8

msimberg deleted the mkl-spack-cmake branch May 31, 2024 12:19

RMeli mentioned this pull request Jun 2, 2024

Update Spack in CI and use intel-oneapi-mkl +gfortran variant eth-cscs/DLA-Future-Fortran#13

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove most special handling of MKL in CMake configuration #1149

Remove most special handling of MKL in CMake configuration #1149

msimberg commented May 28, 2024 •

edited

Loading

msimberg commented May 28, 2024

msimberg commented May 28, 2024

rasolca left a comment

albestro left a comment

msimberg commented May 29, 2024

msimberg commented May 29, 2024

rasolca left a comment

albestro commented May 29, 2024

msimberg commented May 29, 2024

rasolca commented May 29, 2024

msimberg commented May 30, 2024

msimberg commented May 30, 2024

msimberg commented May 30, 2024

RMeli left a comment

msimberg commented May 31, 2024

Remove most special handling of MKL in CMake configuration #1149

Remove most special handling of MKL in CMake configuration #1149

Conversation

msimberg commented May 28, 2024 • edited Loading

msimberg commented May 28, 2024

msimberg commented May 28, 2024

rasolca left a comment

Choose a reason for hiding this comment

albestro left a comment

Choose a reason for hiding this comment

msimberg commented May 29, 2024

msimberg commented May 29, 2024

rasolca left a comment

Choose a reason for hiding this comment

albestro commented May 29, 2024

msimberg commented May 29, 2024

rasolca commented May 29, 2024

msimberg commented May 30, 2024

msimberg commented May 30, 2024

msimberg commented May 30, 2024

RMeli left a comment

Choose a reason for hiding this comment

msimberg commented May 31, 2024

msimberg commented May 28, 2024 •

edited

Loading