-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TTG device tasks with coroutines #265
TTG device tasks with coroutines #265
Commits on Aug 28, 2023
-
Most functionality is an extension/copy of the CUDA support. Also ported the bspmm example to HIP. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 31af820 - Browse repository at this point
Copy the full SHA 31af820View commit details -
Bump PaRSEC version to current master
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9b817f3 - Browse repository at this point
Copy the full SHA 9b817f3View commit details -
SPMM: move tiles in the non-cuda version
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 30d7ee0 - Browse repository at this point
Copy the full SHA 30d7ee0View commit details
Commits on Aug 30, 2023
-
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 81cca94 - Browse repository at this point
Copy the full SHA 81cca94View commit details
Commits on Sep 6, 2023
-
Try to send from accelerator device directly
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 975e878 - Browse repository at this point
Copy the full SHA 975e878View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2b7e67c - Browse repository at this point
Copy the full SHA 2b7e67cView commit details
Commits on Sep 22, 2023
-
Add have_hip_op to TT produced by make_tt
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7faf783 - Browse repository at this point
Copy the full SHA 7faf783View commit details
Commits on Sep 25, 2023
-
Add TTG_FORCE_DEVICE_COMM to force use of device-side comm buffers
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a423e87 - Browse repository at this point
Copy the full SHA a423e87View commit details -
Bump TA to support ROCM allocators
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a7fe69a - Browse repository at this point
Copy the full SHA a7fe69aView commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6d73621 - Browse repository at this point
Copy the full SHA 6d73621View commit details -
SPMM: restore write-back to Eigen matrix for verification
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 88f8da9 - Browse repository at this point
Copy the full SHA 88f8da9View commit details
Commits on Sep 28, 2023
-
We need one cublas/hipblas context per device
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3ba4a4f - Browse repository at this point
Copy the full SHA 3ba4a4fView commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b570108 - Browse repository at this point
Copy the full SHA b570108View commit details
Commits on Sep 29, 2023
-
Bump TA sha to pull in newer Umpire
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5afb32b - Browse repository at this point
Copy the full SHA 5afb32bView commit details -
SPMM: Flush out PaRSEC data between measurements
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3aff11c - Browse repository at this point
Copy the full SHA 3aff11cView commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3fa8001 - Browse repository at this point
Copy the full SHA 3fa8001View commit details -
[cmake] bump MADNESS tag to pull in m-a-d-n-e-s-s/madness#501
avoids CMake errors related to lapack including OpenMP_C dependence that MADNESS does not know anything about
Configuration menu - View commit details
-
Copy full SHA for 4d60c3f - Browse repository at this point
Copy the full SHA 4d60c3fView commit details
Commits on Oct 1, 2023
-
Add missing device::sendk and sendv routines
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4f21ccd - Browse repository at this point
Copy the full SHA 4f21ccdView commit details -
Add device-based chain task benchmark
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f91fa92 - Browse repository at this point
Copy the full SHA f91fa92View commit details -
Configuration menu - View commit details
-
Copy full SHA for d1a6ef3 - Browse repository at this point
Copy the full SHA d1a6ef3View commit details -
Work in progress: adding CUDA version of POTRF task in POTRF example …
…-- not compiling
Configuration menu - View commit details
-
Copy full SHA for 7576321 - Browse repository at this point
Copy the full SHA 7576321View commit details -
Further steps towards device-enabled POTRF
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4a23f42 - Browse repository at this point
Copy the full SHA 4a23f42View commit details
Commits on Oct 2, 2023
-
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 57faac4 - Browse repository at this point
Copy the full SHA 57faac4View commit details -
Add first full draft of device-enabled POTRF
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 194e93d - Browse repository at this point
Copy the full SHA 194e93dView commit details -
POTRF: Use the right precompiler guard for CUDA and HIP
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e41a4b1 - Browse repository at this point
Copy the full SHA e41a4b1View commit details -
POTRF: make it run, still fails factorization
Notable changes: - add ttg::device::current_device and ttg::device::current_stream to query current streams - Consistently start counting devices at 0 - The host has ID -2 - Fixes in the broadcast Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 18577f9 - Browse repository at this point
Copy the full SHA 18577f9View commit details
Commits on Oct 3, 2023
-
Fix release of data copies in buffer
This fixes a segfault in multi-gpu operations. We should not destroy the full data_t but only remove the host-side copy and release our reference on the data. The data will be destroyed once all gpu copies have been removed. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b60577c - Browse repository at this point
Copy the full SHA b60577cView commit details
Commits on Oct 4, 2023
-
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a2f4bbe - Browse repository at this point
Copy the full SHA a2f4bbeView commit details -
POTRF: add -n parameters to control number of iterations
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9654335 - Browse repository at this point
Copy the full SHA 9654335View commit details -
Configuration menu - View commit details
-
Copy full SHA for 410dc23 - Browse repository at this point
Copy the full SHA 410dc23View commit details
Commits on Oct 5, 2023
-
buffer: cannot capture this pointer for deleter and move, use raw poi…
…nter instead Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5b81ac4 - Browse repository at this point
Copy the full SHA 5b81ac4View commit details -
POTRF: construct whole TTG each iteration
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 403b06b - Browse repository at this point
Copy the full SHA 403b06bView commit details -
Add TTG_ENABLE_ASAN CMake option
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1998655 - Browse repository at this point
Copy the full SHA 1998655View commit details
Commits on Oct 6, 2023
-
Chain benchmark: Disable wait_kernel co_yield
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 548b65b - Browse repository at this point
Copy the full SHA 548b65bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fdf6e2e - Browse repository at this point
Copy the full SHA fdf6e2eView commit details -
Remove TTG cublas_helper, superseeded by devblas helper in examples
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9d21225 - Browse repository at this point
Copy the full SHA 9d21225View commit details
Commits on Oct 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 24155bc - Browse repository at this point
Copy the full SHA 24155bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for ee82565 - Browse repository at this point
Copy the full SHA ee82565View commit details -
[cmake] introduce TTG_HAVE_DEVICE used to conditionally compile *port…
…able* device-specific code
Configuration menu - View commit details
-
Copy full SHA for 0993f79 - Browse repository at this point
Copy the full SHA 0993f79View commit details -
Configuration menu - View commit details
-
Copy full SHA for 366e60e - Browse repository at this point
Copy the full SHA 366e60eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 94f3a81 - Browse repository at this point
Copy the full SHA 94f3a81View commit details -
Merge pull request #7 from therault/potrf-cuda-wip
Add accelerator support for POTRF
Configuration menu - View commit details
-
Copy full SHA for fa4aedf - Browse repository at this point
Copy the full SHA fa4aedfView commit details
Commits on Oct 30, 2023
-
Add debugging capabilities for POTRF benchmark
Set DEBUG_TILES_VALUES=1 to enable norm2 check of tiles. This also fixes issues with the parsec data wrapper. This needs some more thought and might come back to haunt us if we try to use data with extrinsic tracking information. Also adds ttg::device::Device to query the ID and type of an instance of the current execution space (e.g., the currently assigned GPU). Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 148a3ad - Browse repository at this point
Copy the full SHA 148a3adView commit details -
Cleanup devblas support in rocm support
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 57f914e - Browse repository at this point
Copy the full SHA 57f914eView commit details -
Lazily compute norm on MatrixTile
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 162a0a1 - Browse repository at this point
Copy the full SHA 162a0a1View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9dbe648 - Browse repository at this point
Copy the full SHA 9dbe648View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fdc3c95 - Browse repository at this point
Copy the full SHA fdc3c95View commit details -
Fix bspmm-cuda after device and devblas helper changes
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7200e1c - Browse repository at this point
Copy the full SHA 7200e1cView commit details
Commits on Oct 31, 2023
-
POTRF: make tile argument to dispatcher const
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9d15ba1 - Browse repository at this point
Copy the full SHA 9d15ba1View commit details
Commits on Nov 3, 2023
-
Multi-GPU: Play nice with the PaRSEC accelerator
We need to a) push tiles back if they move from RW to RO tasks, and b) create a fake task-class that contains all the flows we actually need (i.e., that are part of to_device()). This way PaRSEC can select the right device and we get all the transfers in multi-GPU settings. This needs to be revisited in the PaRSEC backend. We should not need to do the push-back and create a fake task-class to select the correct device. That requires an overhaul of the device backend though... Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 166bdec - Browse repository at this point
Copy the full SHA 166bdecView commit details -
Bump PaRSEC and MADNESS tags for proper tear-down
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 33a5a8d - Browse repository at this point
Copy the full SHA 33a5a8dView commit details
Commits on Nov 6, 2023
-
Bump PaRSEC reference for TTG-specific fixes
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f9375c8 - Browse repository at this point
Copy the full SHA f9375c8View commit details
Commits on Nov 15, 2023
-
Bump PaRSEC ref for reworked GPU backend
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9903f25 - Browse repository at this point
Copy the full SHA 9903f25View commit details
Commits on Nov 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e9271ff - Browse repository at this point
Copy the full SHA e9271ffView commit details -
Add level zero support and refactor device naming
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e20758a - Browse repository at this point
Copy the full SHA e20758aView commit details -
Merge branch 'ttg-device-support-master-coro-with-stream-tasks' of gi…
…thub.com:devreal/ttg into ttg-device-support-master-coro-with-stream-tasks
Configuration menu - View commit details
-
Copy full SHA for af6f684 - Browse repository at this point
Copy the full SHA af6f684View commit details -
Configuration menu - View commit details
-
Copy full SHA for 97d0bf6 - Browse repository at this point
Copy the full SHA 97d0bf6View commit details -
Merge branch 'ttg-device-support-master-coro-with-stream-tasks' of gi…
…thub.com:devreal/ttg into ttg-device-support-master-coro-with-stream-tasks
Configuration menu - View commit details
-
Copy full SHA for c2da1f5 - Browse repository at this point
Copy the full SHA c2da1f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6af1828 - Browse repository at this point
Copy the full SHA 6af1828View commit details -
Agnostic async copy and non-device support
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0be1212 - Browse repository at this point
Copy the full SHA 0be1212View commit details -
Add missing parsec backend header file to CMake
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a119828 - Browse repository at this point
Copy the full SHA a119828View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f125ad - Browse repository at this point
Copy the full SHA 4f125adView commit details -
Merge branch 'ttg-device-support-master-coro-with-stream-tasks' of gi…
…thub.com:devreal/ttg into ttg-device-support-master-coro-with-stream-tasks
Configuration menu - View commit details
-
Copy full SHA for 67434d1 - Browse repository at this point
Copy the full SHA 67434d1View commit details -
Work on device_gpu_module instead of device_module
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 03b54bb - Browse repository at this point
Copy the full SHA 03b54bbView commit details -
Merge branch 'ttg-device-support-master-coro-with-stream-tasks' of gi…
…thub.com:devreal/ttg into ttg-device-support-master-coro-with-stream-tasks
Configuration menu - View commit details
-
Copy full SHA for a9e5959 - Browse repository at this point
Copy the full SHA a9e5959View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for dde416c - Browse repository at this point
Copy the full SHA dde416cView commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5569b57 - Browse repository at this point
Copy the full SHA 5569b57View commit details -
Return sycl queue as non-const
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 86b92db - Browse repository at this point
Copy the full SHA 86b92dbView commit details -
Use the right parsec device module types
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5aac5c2 - Browse repository at this point
Copy the full SHA 5aac5c2View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for afeda46 - Browse repository at this point
Copy the full SHA afeda46View commit details -
PaRSEC: Add have_level_zero_op()
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d1fb80f - Browse repository at this point
Copy the full SHA d1fb80fView commit details -
Add missing have_level_zero_op_non_type_t
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bfa3fde - Browse repository at this point
Copy the full SHA bfa3fdeView commit details -
Fix bug in level zero detection of derived TT
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 68e4825 - Browse repository at this point
Copy the full SHA 68e4825View commit details -
Don't ever assume that 2 is the first device
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9afb6f8 - Browse repository at this point
Copy the full SHA 9afb6f8View commit details
Commits on Nov 20, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 961b0d3 - Browse repository at this point
Copy the full SHA 961b0d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 78876e5 - Browse repository at this point
Copy the full SHA 78876e5View commit details
Commits on Nov 27, 2023
-
Proper use of device ID conversion functions
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cf757ee - Browse repository at this point
Copy the full SHA cf757eeView commit details -
Bump parsec commit to rebased fork
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d660b07 - Browse repository at this point
Copy the full SHA d660b07View commit details -
There is no allocators for level zero in TiledArray yet
Signed-off-by: Joseph Schuchart <[email protected]>
5Configuration menu - View commit details
-
Copy full SHA for 0ab8f47 - Browse repository at this point
Copy the full SHA 0ab8f47View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fb5f111 - Browse repository at this point
Copy the full SHA fb5f111View commit details -
ze: add the missing link-libraries to spmm
Signed-off-by: Aurelien Bouteiller <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 383d36d - Browse repository at this point
Copy the full SHA 383d36dView commit details -
Use std::size_t in parsec matrix wrapper
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f4b482a - Browse repository at this point
Copy the full SHA f4b482aView commit details -
MatrixTile: use std::size_t consistently
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0b1325e - Browse repository at this point
Copy the full SHA 0b1325eView commit details -
Fun with signed integers in parsec matrix wrapper
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1f3619a - Browse repository at this point
Copy the full SHA 1f3619aView commit details
Commits on Nov 29, 2023
-
Fix is_dummy name collision in parsec_ttg_task_base_t
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 81c25e3 - Browse repository at this point
Copy the full SHA 81c25e3View commit details
Commits on Dec 4, 2023
-
Bump PaRSEC version to get first set of eviction fixes
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bf821a5 - Browse repository at this point
Copy the full SHA bf821a5View commit details
Commits on Dec 5, 2023
-
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6727faf - Browse repository at this point
Copy the full SHA 6727fafView commit details -
Fix usage of wrong device ID in buffer
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 049297e - Browse repository at this point
Copy the full SHA 049297eView commit details -
Bump PaRSEC tag for more memory oversubscription fixes
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8146481 - Browse repository at this point
Copy the full SHA 8146481View commit details
Commits on Dec 13, 2023
-
Map blas/solver handles to streams, not threads and streams
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b9727a4 - Browse repository at this point
Copy the full SHA b9727a4View commit details
Commits on Dec 18, 2023
-
Naming convetions: rename ttg::Buffer
Objects in TTG start with a capital letter so rename ttg::buffer to ttg::Buffer to be consistent. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 603af4e - Browse repository at this point
Copy the full SHA 603af4eView commit details -
Remove legacy device implementation files
Both view.h and make_device_tt.h were the initial implementation and are not used anymore. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9afe0c0 - Browse repository at this point
Copy the full SHA 9afe0c0View commit details -
Put all device functions into ttg::device namespace
Except for ttg::Buffer, which is separate and may be used by non-device tasks/data structures. The following were renamed: - to_device -> select - wait_kernel -> wait Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f424bcd - Browse repository at this point
Copy the full SHA f424bcdView commit details
Commits on Dec 19, 2023
-
Add device-related fwd-decl to madness backend
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3bf1e16 - Browse repository at this point
Copy the full SHA 3bf1e16View commit details
Commits on Dec 20, 2023
-
Include fwd.h instead of impl_selector.h
fwd.h provides the symbols from the implementation namespace Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 913f792 - Browse repository at this point
Copy the full SHA 913f792View commit details -
Bump catch2 version to latest release
This is needed to get the fix for catchorg/Catch2#2421 Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7c6044a - Browse repository at this point
Copy the full SHA 7c6044aView commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ed055fb - Browse repository at this point
Copy the full SHA ed055fbView commit details -
Forward keyT in make_tt overload
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0c7d291 - Browse repository at this point
Copy the full SHA 0c7d291View commit details -
Bump github CI Ubuntu to 22.04 to get boost 1.74 needed for TA
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b486a2f - Browse repository at this point
Copy the full SHA b486a2fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 555ae2f - Browse repository at this point
Copy the full SHA 555ae2fView commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5cf2502 - Browse repository at this point
Copy the full SHA 5cf2502View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 220c038 - Browse repository at this point
Copy the full SHA 220c038View commit details -
Fix ttvalue and buffer in parsec
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e9b3233 - Browse repository at this point
Copy the full SHA e9b3233View commit details -
Add derived_has_*_op() to madness backend
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 65ffd4c - Browse repository at this point
Copy the full SHA 65ffd4cView commit details -
Configuration menu - View commit details
-
Copy full SHA for e7bba1b - Browse repository at this point
Copy the full SHA e7bba1bView commit details -
Configuration menu - View commit details
-
Copy full SHA for b653b9b - Browse repository at this point
Copy the full SHA b653b9bView commit details
Commits on Dec 21, 2023
-
Enable madness serialization for madness backend buffer
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 82a7b2a - Browse repository at this point
Copy the full SHA 82a7b2aView commit details -
Only build device_coro test if device support is detected
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0d64297 - Browse repository at this point
Copy the full SHA 0d64297View commit details -
Fix possible use-after-free in TT::release
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5f876e3 - Browse repository at this point
Copy the full SHA 5f876e3View commit details -
Use key directly in task_snprintf instead of locals
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6f645df - Browse repository at this point
Copy the full SHA 6f645dfView commit details -
Remove legacy devicemock example
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ba6f7a1 - Browse repository at this point
Copy the full SHA ba6f7a1View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c899f07 - Browse repository at this point
Copy the full SHA c899f07View commit details -
Comment preliminary mra device code pieces
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bd454dd - Browse repository at this point
Copy the full SHA bd454ddView commit details -
Gracefully handle missing TA allocators
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ec67c6b - Browse repository at this point
Copy the full SHA ec67c6bView commit details -
Use stock doxygen for relase test
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8ccab44 - Browse repository at this point
Copy the full SHA 8ccab44View commit details -
Fix const handling in reducer tasks
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for beed36a - Browse repository at this point
Copy the full SHA beed36aView commit details -
Only enable cuda op if cuda was detected
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f30f901 - Browse repository at this point
Copy the full SHA f30f901View commit details
Commits on Dec 22, 2023
-
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ece8cc3 - Browse repository at this point
Copy the full SHA ece8cc3View commit details -
Make sure new reducer target copies are marked mutable
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for db4f736 - Browse repository at this point
Copy the full SHA db4f736View commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4547d8c - Browse repository at this point
Copy the full SHA 4547d8cView commit details -
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f9900fe - Browse repository at this point
Copy the full SHA f9900feView commit details
Commits on Jan 19, 2024
-
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d42cb92 - Browse repository at this point
Copy the full SHA d42cb92View commit details
Commits on Jan 29, 2024
-
PaRSEC task: copy task locals to comply with profiling reqs
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7b9c248 - Browse repository at this point
Copy the full SHA 7b9c248View commit details -
SPMM: Make writing C back into Eigen optional
The Eigen format is not particularly suitable for thread-parallel writes so for benchmarks we drop the last part of pushing tiles back into the Eigen matrix. Tiles are still communicated and reduced properly. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 641c81f - Browse repository at this point
Copy the full SHA 641c81fView commit details -
SPMM: distribute A/B over all procs and throttle broadcasts
We distribute A and B over all three dimensions instead of just the first two. The 2.5D approach does not seem suitable for our approach since matrices will likely be distributed across all processes. Also, add the ability to throttle the number of parallel bcasts to avoid overloading the network. This should be revisited, since it seems to the an issue esp with UCX as a backend in OMPI. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d584e06 - Browse repository at this point
Copy the full SHA d584e06View commit details
Commits on Jan 30, 2024
-
btas::Tensor: do not fill with zeroes when constructing (e.g., in zer…
…o-copy serialization), unless necessary
Configuration menu - View commit details
-
Copy full SHA for a85c2c4 - Browse repository at this point
Copy the full SHA a85c2c4View commit details -
Merge branch 'ttg-device-support-master-coro-with-stream-tasks' of gi…
…thub.com:devreal/ttg into ttg-device-support-master-coro-with-stream-tasks
Configuration menu - View commit details
-
Copy full SHA for 7a5e2a5 - Browse repository at this point
Copy the full SHA 7a5e2a5View commit details
Commits on Jan 31, 2024
-
[cmake] bump MADNESS, BTAS, and TA tags to sync with latest; MADNESS …
…points to main repo now
Configuration menu - View commit details
-
Copy full SHA for 1f2a4fb - Browse repository at this point
Copy the full SHA 1f2a4fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1125f96 - Browse repository at this point
Copy the full SHA 1125f96View commit details -
Configuration menu - View commit details
-
Copy full SHA for 549ba94 - Browse repository at this point
Copy the full SHA 549ba94View commit details
Commits on Feb 1, 2024
-
can use bundled Boost.CallableTraits only if don't have real Boost an…
…d TTG_FETCH_BOOST is not set
Configuration menu - View commit details
-
Copy full SHA for f05530f - Browse repository at this point
Copy the full SHA f05530fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9765f75 - Browse repository at this point
Copy the full SHA 9765f75View commit details -
Configuration menu - View commit details
-
Copy full SHA for b70444c - Browse repository at this point
Copy the full SHA b70444cView commit details
Commits on Feb 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1ffaf6a - Browse repository at this point
Copy the full SHA 1ffaf6aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 41cc814 - Browse repository at this point
Copy the full SHA 41cc814View commit details -
Implement support for inlining of objects with SplitMetadata
For small objects it is beneficial of packing the data into the activation message instead of going through an extra GET. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8993e49 - Browse repository at this point
Copy the full SHA 8993e49View commit details -
SPMM: Allow changing the work distribution
Work may be performed by the owner of A, B, or C, or the default honoring the R parameter. Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for becdbf4 - Browse repository at this point
Copy the full SHA becdbf4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3dad2c1 - Browse repository at this point
Copy the full SHA 3dad2c1View commit details -
Enforce strict ordering of broadcasts in column/row
For matrix A, we make sure that tiles are sent out in column-order (left to right). For matrix B, we want to make sure that tiles are sent out in row-order (top to bottom). Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4416f3b - Browse repository at this point
Copy the full SHA 4416f3bView commit details -
Merge branch 'ttg-device-support-master-coro-with-stream-tasks' of gi…
…thub.com:devreal/ttg into ttg-device-support-master-coro-with-stream-tasks
Configuration menu - View commit details
-
Copy full SHA for 8fd7be9 - Browse repository at this point
Copy the full SHA 8fd7be9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 59779c0 - Browse repository at this point
Copy the full SHA 59779c0View commit details
Commits on Feb 14, 2024
-
Bump PaRSEC version for MPI backend instrumentation
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 94b2b75 - Browse repository at this point
Copy the full SHA 94b2b75View commit details
Commits on Feb 23, 2024
-
Add missing <numeric> header for CI
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c06654f - Browse repository at this point
Copy the full SHA c06654fView commit details -
Order broadcast keys so that we send to myrank+1 first
Signed-off-by: Joseph Schuchart <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b8c2cc8 - Browse repository at this point
Copy the full SHA b8c2cc8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 67b0fb3 - Browse repository at this point
Copy the full SHA 67b0fb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for b741e78 - Browse repository at this point
Copy the full SHA b741e78View commit details -
Configuration menu - View commit details
-
Copy full SHA for ed270da - Browse repository at this point
Copy the full SHA ed270daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 12d1c1f - Browse repository at this point
Copy the full SHA 12d1c1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 27c03f7 - Browse repository at this point
Copy the full SHA 27c03f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for fcbd2e8 - Browse repository at this point
Copy the full SHA fcbd2e8View commit details
Commits on Feb 24, 2024
-
unit test informational logging interferes with catch_discover_tests,…
… only produce if Catch's `verbosity` != `quiet`
Configuration menu - View commit details
-
Copy full SHA for 9cd39b8 - Browse repository at this point
Copy the full SHA 9cd39b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8104b49 - Browse repository at this point
Copy the full SHA 8104b49View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1826821 - Browse repository at this point
Copy the full SHA 1826821View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56ed942 - Browse repository at this point
Copy the full SHA 56ed942View commit details