0.2.1
Bug Fixes and Other Changes
- Fix known caveat related to
XLA_ENABLE_MULTIPLE_STREAM=1
. The accuracy issue is fixed and no need to set this environment variable anymore. - Fix known caveat related to
MHA=0
. The crash error is fixed and no need to set this environment variable anymore. - Fix compatibility issue with upgraded Driver LTS release 2350.29
- Fix random accuracy issue caused by
AllToAll
collective. - Upgrade transformers used by examples to 4.36 to fix open CVE.
Known Caveats
- Device number is restricted as 2/4/6/8/10/12 in the experimental supported collectives in single node.
- Do not use collectives (e.g.
AllReduce
) in nestedpjit
, it may cause random accuracy issue. Please refer JAX UTtestAutodiff
to understand the error scenario better.
Full Changelog: 0.2.0...0.2.1