Optimize State Copy #116

tkoskela · 2020-07-24T10:43:05Z

The point-to-point communications in State Copy can cause significant slowdown when going off-node. See example below. The algorithm should be optimized to avoid copies of duplicate particles. At present the algorithm works in roughly the following way

1. Work out what particles rank i has
2. Work out what particles rank i needs after resampling
3. Work out which rank has the particles rank i needs
4. Work out which ranks need the particles rank i has
5. Send each particle from rank i to each process that needs it
6. Receive each particle from the process that has it to rank i

https://github.com/Team-RADDISH/TDAC.jl/blob/90a318dbbbd4f80e88e96d2a80522150745705bb/src/TDAC.jl#L452

This is kind of a brute-force solution, it often happens in reality that a single particle has an extremely high weight and is copied many times to many processes. This algorithm will send the same particle over the network every time, even multiple times to the same process. A step should be added to the algorithm that identifies duplicate particles and copies them locally after a single copy has been received.

The text was updated successfully, but these errors were encountered:

tkoskela · 2020-08-13T15:42:48Z

The root of the issue is that state_copy creates wild load imbalances due to the nature of the algorithm that are hard to fix. They will only cause a slowdown at the next collective call, which happens to be the MPI_Reduce in get_mean_and_var. The idea above may help but is unlikely to completely fix the issue.

tkoskela · 2020-08-18T10:40:36Z

https://github.com/Team-RADDISH/TDAC.jl/blob/8c67e8c6c3e492267983628f56406daac0c29dfe/src/TDAC.jl#L458-L465 Could be done on master and only the results sent via MPI. This would probably save some time compared to broadcasting the whole resampling_indices vector.

tkoskela added the enhancement New feature or request label Jul 24, 2020

tkoskela self-assigned this Jul 24, 2020

tkoskela mentioned this issue Aug 11, 2020

Parallel performance #115

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize State Copy #116

Optimize State Copy #116

tkoskela commented Jul 24, 2020 •

edited

Loading

tkoskela commented Aug 13, 2020

tkoskela commented Aug 18, 2020

Optimize State Copy #116

Optimize State Copy #116

Comments

tkoskela commented Jul 24, 2020 • edited Loading

tkoskela commented Aug 13, 2020

tkoskela commented Aug 18, 2020

tkoskela commented Jul 24, 2020 •

edited

Loading