NNFusion v0.2 Release
Major Features
- Support the use of Python interface to accelerate the training and inference of PyTorch model
- Support low-precision and mixed-precision model compilation, e.g., fp16
- Provide auto kernel tuner integration:
- Support parallel training via SuperScaler
- Enable local kernel cache through kernel database
Others
- Update related documentations
- Some enhancements on user experiences and bug fix
中文版本说明快捷通道-->#105 (comment)