Heraklion
Release
This release includes the new version of differential checkpointing, a complete implementation of incremental checkpointing, full support for GPU checkpointing and full support for HDF5 checkpointing, including the option for checkpointing into a single file (N-1) and restarting with a different number of processes.
Changelog
- New major feature allowing users to checkpoint data allocated in the GPU device memory.
- New implementation of differential checkpointing that addresses performance issues for highly fragmented differential updates.
- New major feature allowing users to use incremental checkpointing for CPU and GPU data by adding one by one the variables to the checkpoint file.
- New major feature for DCPPosix allowing to recover from last non-corrupted checkpoint file.
- New examples in the examples/GPU directory that checkpoint GPU data.
- New major feature allowing to restart with a different number of processes using a shared HDF5 checkpoint file.
- New unitary tests for the new features.
- New configurable/flexible local test structure.
- Fixed Bug of RecoverVar.
- Fixed Bug on DCP recovery.
- Complete and full code documentation generated with Doxygen.