v1.4.2
Highlights
- Add support for Google Gemma models
What's Changed
- Fix mistral with length > window_size for long prefills (rotary doesn't create long enough cos, sin). by @Narsil in #1571
- improve endpoint support by @drbh in #1577
- refactor syntax to correctly include structs by @drbh in #1580
- fix openapi and add jsonschema validation by @OlivierDehaene in #1578
- add support for Gemma by @OlivierDehaene in #1583
Full Changelog: v1.4.1...v1.4.2