-
Notifications
You must be signed in to change notification settings - Fork 114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support scratch space without result space #1773
Conversation
include/oneapi/dpl/pstl/hetero/dpcpp/parallel_backend_sycl_utils.h
Outdated
Show resolved
Hide resolved
…ls.h Co-authored-by: Sergey Kopienko <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this patch. LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks, this is useful.
This patch extends
__result_and_scratch_storage
to support cases where no result space is required. This removes the overhead of allocating result storage even when it is unneeded such as in the dummy container introduced in #1770. The signature is kept intact to support the different scan kernels.There is also some code cleanup and the usage in reduce is aligned with the recently merged scan. The container is also passed by reference to the reduce kernels to prevent copies.