v1.11.1: Patch Release
Llama3 has been validated on Gaudi
Fix issue with pytest
The latest SynapseAI Docker images come with Pytest v8 already installed, which is incompatible with the Transformers library and leads to an error in a few non-test cases. As a temporary workaround, Pytest is pinned and moved as a hard dependency.
Other
- Fp8 merge fix #863 @libinta
- Fixed "reuse_cache" Bug #888 @Danielohayon
- Remove deprecated AOT_HPU_TRAINING_BACKEND #877 @astachowiczhabana
- Add mark step and inplace residual add in llama model code #833 @puneeshkhanna
- Enable Flash Attention in recompute and causal modes #862 @wszczurekhabana
- Add mark_step for llama inference #875 @libinta
Full Changelog: v1.11.0...v1.11.1