v0.5.1
What's Changed
- feat(openai): Apply chat template for GGUF loader by @drummerv in #312
- Calculate total memory usage. by @sgsdxzy in #316
- chore: add new iMatrix quants by @AlpinDale in #320
- fix: optimize AQLM dequantization by @AlpinDale in #325
New Contributors
Full Changelog: v0.5.0...v0.5.1