v1.0.2
What's Changed
- Have snippets in Python/JavaScript in quicktour by @osanseviero in #809
- Added two more features in readme.md file by @sawanjr in #831
- Fix rope dynamic + factor by @Narsil in #822
- fix: LlamaTokenizerFast to AutoTokenizer at flash_llama.py by @dongs0104 in #619
- README edit -- running the service with no GPU or CUDA support by @pminervini in #773
- Fix
tokenizers==0.13.4
. by @Narsil in #838 - Update README.md by @adarshxs in #848
- Fixing watermark. by @Narsil in #851
- Misc minor improvements for InferenceClient docs by @osanseviero in #852
- "Fix" for rw-1b. by @Narsil in #860
- Upgrading versions of python client. by @Narsil in #862
- Adding Idefics multi modal model. by @Narsil in #842
- Add streaming guide by @osanseviero in #858
- Adding small benchmark script. by @Narsil in #881
New Contributors
- @sawanjr made their first contribution in #831
- @dongs0104 made their first contribution in #619
- @pminervini made their first contribution in #773
- @adarshxs made their first contribution in #848
Full Changelog: v1.0.1...v1.0.2