-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for SimSYCL as a SYCL implementation #871
base: main
Are you sure you want to change the base?
Conversation
cmake/AddSYCLExecutable.cmake
Outdated
if (NOT ${SYCL_IMPLEMENTATION} IN_LIST KNOWN_SYCL_IMPLEMENTATIONS) | ||
message(FATAL_ERROR | ||
"The SYCL CTS requires specifying a SYCL implementation with " | ||
"-DSYCL_IMPLEMENTATION=[Intel_SYCL,DPCPP;hipSYCL]") | ||
"-DSYCL_IMPLEMENTATION=[Intel_SYCL,DPCPP;hipSYCL;SimSYCL]") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if we should just put KNOWN_SYCL_IMPLEMENTATIONS
directly into the message instead of modifying it every time we change list of known implementations?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps CTS_GRADE_SYCL_IMPLEMENTATION
or something better, because we know quite many other implementations. :-)
tests/event/event.cpp
Outdated
#if SYCL_CTS_COMPILING_WITH_SIMSYCL | ||
SKIP("SimSYCL does not implement asynchronous execution."); | ||
#endif |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that from the conformance point of view the better to use DISABLED_FOR_TEST_CASE
macro so the test appears as failed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure this is even non-conformant. IIRC an implementation only needs to guarantee forward progress on an event when wait
is called on it, which does not happen in any of these test cases.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think this test requires a guarantee of forward progress, does it? It seems like it is just calling event::get_wait_list
and making sure the list contains the direct dependencies of a pending command. There is no expectation that the command has completed when get_wait_list
is called.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree, I don't understand why SimSYCL couldn't implement these features.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You're right, I read this code the wrong way! It needs the host task to launch asynchronously in the background at least once wait()
is called, and then start running once the delayed_host_event
is signalled. SimSYCL is fully synchronous and does not overlap command groups with the main thread, so the host task unconditionally deadlocks. This will probably never work. I changed the SKIP
to FAIL
to account for this.
tests/usm/CMakeLists.txt
Outdated
if(SYCL_IMPLEMENTATION STREQUAL SimSYCL) | ||
message(WARNING "SimSYCL does not provide true concurrency between host and device, disabling USM atomic tests") | ||
list(FILTER test_cases_list EXCLUDE REGEX usm_atomic_access_.*\\.cpp$) | ||
endif() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Piggybacking off of @AlexeySachkov's comment, these should preferably also be disabled using DISABLE_FOR_TEST_CASE(SimSYCL, ...
!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same as above - this needs a forward progress guarantee in SYCL to pass.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you point out which tests in particular you're worried about?
SYCL explicitly does not guarantee that the device will make forward progress unless a host thread calls wait
. So if a test currently requires that behavior, I'd argue that it shouldn't be in the CTS.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed the filter, we're explicitly FAIL
ing now inside the test case (see my comment on the review above this one).
Please also document SimSYCL in |
Ideally we should also include SimSYCL in our CI setup. I think it should be relatively straightforward to add, analogously to hipSYCL/AdaptiveCpp. |
How do we move on this? |
ae3274e
to
7d5f825
Compare
7d5f825
to
a330782
Compare
3c601ad
to
28402a6
Compare
28402a6
to
47b7b26
Compare
I've added a CI step for a SimSYCL build, but creating the docker image needs some secrets. Not sure how to proceed on this. |
I've launched a manual run to create the image here. It went through, but it looks like there's some issue while building the CTS itself! |
Thanks, it's building now! |
66159bf
to
8518e9a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That looks good after merge conflict resolution.
atomicity_with_host_code requires a memory_scope argument (always `system`). atomic_ref<float> has no operator++ and must use fetch_add() instead.
8518e9a
to
acfbda6
Compare
acfbda6
to
613b97d
Compare
This adds SimSYCL as a supported SYCL implementation, complete with CMake integration and an exclude-list for the "fast" conformance build.
This should be merged after #870 #872 and #873, because the CTS it will not actually compile for many of the non-excluded cases unless those bugs are addressed.