COSMA-v2.6.0 #114
kabicm
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This release enables COSMA to take advantage of fast GPU-to-GPU interconnects like NVLink, to efficiently utilize modern Multi-GPU Systems. This is achieved in 2 ways:
NCCL/RCCL
Libraries: by specifying-DCOSMA_WITH_NCCL=ON
cmake option.-DCOSMA_WITH_GPU_AWARE_MPI=ON
cmake option, as proposed here.See README and INSTALL for more info on how to build.
In addition, the following performance improvemets have been made:
all-gather
andreduce-scatter
collectives are improved.MPI_Reduce_scatterv
which we previously used. We padded all the blocks to be able to overcome this issue.This discussion was created from the release COSMA-v2.6.0.
Beta Was this translation helpful? Give feedback.
All reactions