v0.4.13
What's Changed
- Make 0.4.13.dev1 version by @rasbt in #1722
- Enable MPS support for LitGPT by @rasbt in #1724
- Simplify MPS support by @rasbt in #1726
- Add Chainlit Studio by @rasbt in #1728
- Fixing the tokenizer for slimpajama data preparation by @tomaslaz in #1734
- Add pretrain conversion by @rasbt in #1735
- Typo fix and formatting improvements in API Trainer docs by @rasbt in #1736
- bump macos to m1 by @t-vi in #1725
- Improve filepath handling in unit tests by @rasbt in #1737
- Add a more informative message in case text exceeds context size by @rasbt in #1738
- Update Thunder README.md by @rasbt in #1740
- Add sliding window attention to Mistral and Phi 3 by @rasbt in #1741
- Extend context length for sliding window tests by @rasbt in #1742
- Fix jsonarparse version by @rasbt in #1748
- Update RoPE tests by @rasbt in #1746
- Make json parsing more robust by @rasbt in #1749
- Support for optimizers which don't have "fused" parameter such as grokadamw and 8bit bnb by @mtasic85 in #1744
- Increase rtol and atol in Gemma 2 for macOS by @rasbt in #1751
- Repair json files by @rasbt in #1752
- Llama 3.2 weights by @rasbt in #1750
- Bump version to 0.4.13 for new release by @rasbt in #1753
- Temporarily take out thunder dependency for deployment by @rasbt in #1754
New Contributors
Full Changelog: v0.4.12...v0.4.13