Version v0.2.8 Release Today!
What's Changed
Release
Format
- [format] applied code formatting on changed files in pull request 3300 (#3302) by github-actions[bot]
- [format] applied code formatting on changed files in pull request 3296 (#3298) by github-actions[bot]
Doc
- [doc] add ColossalChat news (#3304) by binmakeswell
- [doc] add ColossalChat (#3297) by binmakeswell
- [doc] fix typo (#3222) by binmakeswell
- [doc] update chatgpt doc paper link (#3229) by Camille Zhong
- [doc] add community contribution guide (#3153) by binmakeswell
- [doc] add Intel cooperation for biomedicine (#3108) by binmakeswell
Application
Chat
Coati
- [coati] fix inference profanity check (#3299) by ver217
- [coati] inference supports profanity check (#3295) by ver217
- [coati] add repetition_penalty for inference (#3294) by ver217
- [coati] fix inference output (#3285) by ver217
- [Coati] first commit (#3283) by Fazzie-Maqianli
Colossalchat
- [ColossalChat]add cite for datasets (#3292) by Fazzie-Maqianli
Examples
- [examples] polish AutoParallel readme (#3270) by YuliangLiu0306
- [examples] Solving the diffusion issue of incompatibility issue#3169 (#3170) by NatalieC323
Fx
- [fx] meta registration compatibility (#3253) by HELSON
- [FX] refactor experimental tracer and adapt it with hf models (#3157) by YuliangLiu0306
Booster
- [booster] implemented the torch ddd + resnet example (#3232) by Frank Lee
- [booster] implemented the cluster module (#3191) by Frank Lee
- [booster] added the plugin base and torch ddp plugin (#3180) by Frank Lee
- [booster] added the accelerator implementation (#3159) by Frank Lee
- [booster] implemented mixed precision class (#3151) by Frank Lee
Ci
- [CI] Fix pre-commit workflow (#3238) by Hakjin Lee
Api
- [API] implement device mesh manager (#3221) by YuliangLiu0306
- [api] implemented the checkpoint io module (#3205) by Frank Lee
Hotfix
- [hotfix] skip torchaudio tracing test (#3211) by YuliangLiu0306
- [hotfix] layout converting issue (#3188) by YuliangLiu0306
Chatgpt
- [chatgpt] add precision option for colossalai (#3233) by ver217
- [chatgpt] unnify datasets (#3218) by Fazzie-Maqianli
- [chatgpt] support instuct training (#3216) by Fazzie-Maqianli
- [chatgpt]add reward model code for deberta (#3199) by Yuanchen
- [chatgpt]support llama (#3070) by Fazzie-Maqianli
- [chatgpt] add supervised learning fine-tune code (#3183) by pgzhang
- [chatgpt]Reward Model Training Process update (#3133) by BlueRum
- [chatgpt] fix trainer generate kwargs (#3166) by ver217
- [chatgpt] fix ppo training hanging problem with gemini (#3162) by ver217
- [chatgpt]update ci (#3087) by BlueRum
- [chatgpt]Fix examples (#3116) by BlueRum
- [chatgpt] fix lora support for gpt (#3113) by BlueRum
- [chatgpt] type miss of kwargs (#3107) by hiko2MSP
- [chatgpt] fix lora save bug (#3099) by BlueRum
Lazyinit
- [lazyinit] combine lazy tensor with dtensor (#3204) by ver217
- [lazyinit] add correctness verification (#3147) by ver217
- [lazyinit] refactor lazy tensor and lazy init ctx (#3131) by ver217
Auto
Analyzer
- [Analyzer] fix analyzer tests (#3197) by YuliangLiu0306
Dreambooth
- [dreambooth] fixing the incompatibity in requirements.txt (#3190) by NatalieC323
Auto-parallel
Zero
Test
- [test] fixed torchrec registration in model zoo (#3177) by Frank Lee
- [test] fixed torchrec model test (#3167) by Frank Lee
- [test] add torchrec models to test model zoo (#3139) by YuliangLiu0306
- [test] added transformers models to test model zoo (#3135) by Frank Lee
- [test] added torchvision models to test model zoo (#3132) by Frank Lee
- [test] added timm models to test model zoo (#3129) by Frank Lee
Refactor
- [refactor] update docs (#3174) by Saurav Maheshkar
Tests
- [tests] model zoo add torchaudio models (#3138) by ver217
- [tests] diffuser models in model zoo (#3136) by HELSON
Docker
- [docker] Add opencontainers image-spec to
Dockerfile
(#3006) by Saurav Maheshkar
Dtensor
- [DTensor] refactor dtensor with new components (#3089) by YuliangLiu0306
Workflow
Autochunk
- [autochunk] support complete benchmark (#3121) by Xuanlei Zhao
Tutorial
- [tutorial] update notes for TransformerEngine (#3098) by binmakeswell
Nvidia
- [NVIDIA] Add FP8 example using TE (#3080) by Kirthi Shankar Sivamani
Full Changelog: v0.2.8...v0.2.7