NNFusion v0.2 Release

jlxue released this 25 Dec 08:46

· 116 commits to master since this release

7c96540

Major Features

Support the use of Python interface to accelerate the training and inference of PyTorch model
Support low-precision and mixed-precision model compilation, e.g., fp16
Provide auto kernel tuner integration:
- Add Antares IR for 60+ ops
- Support auto tuning via Antares tuning service
Support parallel training via SuperScaler
Enable local kernel cache through kernel database

Others

Update related documentations
Some enhancements on user experiences and bug fix

中文版本说明快捷通道-->#105 (comment)

Assets 2