AOMP Release 15.0-3
These are the release notes for AOMP 15.0-3. This release uses modifications to the LLVM development trunk called the "amd-stg-open" branch. This is found at https://github.com/RadeonOpenCompute/llvm-project. The amd-stg-open branch is constantly changing as AMD merges upstream development trunk with its internal open development efforts. Some AMD modifications are experimental and/or under review for the LLVM upstream mono-repo. The AOMP release is a snapshot of amd-stg-open and supporting repositories to build various components.
For AOMP 15.0-3, the last trunk commit is 1f8ae9d7e7e4afcc4e76728b28e64941660ca3eb on Jul 26 2022. This is the fourth AOMP release for LLVM 15 development. The last amd-only commit is b745843ebcb77f55de887b5741197184e7d0dcbd on Aug 01 2022. This forms a frozen branch now called "aomp-15.0-3". See https://github.com/RadeonOpenCompute/llvm-project/tree/aomp-15.0-3
AOMP is a "standalone" build of all necessary ROCm components with the exception of the kernel module. The non llvm-project components for this release were built with ROCM 5.2.x sources.
The changes from 15.0-2 to 15.0-3 include:
- Use the new openmp DeviceRTL by default.
- New DeviceRTL APIs for optimized cross-team reduction.
- Clang codegen changes to use the optimized cross-team reduction APIs for a reduction clause in a device construct.
- Added support for classic flang to use the new DeviceRTL.
Known Issues:
- Flang has issues at -O0 when using the new DeviceRTL on GPUs other than gfx90a.