Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable vectorized global loads for the reduction algorithms #1470

Merged
merged 30 commits into from
May 22, 2024
Merged
Show file tree
Hide file tree
Changes from 28 commits
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
1f1ee5d
Dynamic number of items per work-item
julianmi Feb 6, 2024
eeb4120
Enable vectorization
julianmi Feb 27, 2024
41fd20e
Code cleanup
julianmi Feb 27, 2024
3630da8
Fix WG adjustment
julianmi Feb 27, 2024
c612f61
Restructure vectorized reduction
julianmi Feb 28, 2024
e029fd7
Add single item process path
julianmi Feb 29, 2024
f5e8271
Template vector width
julianmi Feb 29, 2024
530e13d
Reduce branch divergence
julianmi Mar 19, 2024
d36acbc
Enable 32-bit addressing
julianmi Mar 22, 2024
42c9aab
Fix merge issues
julianmi Mar 22, 2024
556c184
Centralize tuning parameters
julianmi Mar 25, 2024
b8d067a
Cleanup diff
julianmi Mar 27, 2024
5c9a43e
Fix CPU backend issue
julianmi Apr 3, 2024
14c44df
Address review feedback
julianmi Apr 12, 2024
6fc2581
Fix merge issue
julianmi Apr 15, 2024
1db434d
Address review comments
julianmi Apr 15, 2024
ebd0da3
Remove another inline statement
julianmi Apr 15, 2024
f7ee7f1
Remove unintentional formatting changes
julianmi Apr 17, 2024
2ef7d5b
Remove move statement and ::std
julianmi Apr 18, 2024
17d5bce
Address review feedback
julianmi Apr 22, 2024
291b1c6
Address review feedback
julianmi Apr 24, 2024
fdaf78d
Update is_device_copyable trait
julianmi Apr 25, 2024
27a01f4
Update transform_reduce signature also in test
julianmi Apr 25, 2024
70939d6
Add missing out-of-bounds check
julianmi Apr 26, 2024
0245bbd
Improve bounds check based on review comments
julianmi Apr 26, 2024
48bf347
Further bounds check improvements
julianmi Apr 26, 2024
90c308b
Add check for shorter addressing support
julianmi May 21, 2024
3d93a31
Use static assert instead
julianmi May 21, 2024
3ebc344
Address review comments
julianmi May 21, 2024
27fd437
Rename union storeage based on review discussion
julianmi May 21, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading
Loading