Skip to content

v1.4.0

Compare
Choose a tag to compare
@OlivierDehaene OlivierDehaene released this 02 Jul 15:17
· 21 commits to main since this release
a0549e6

Notable Changes

  • Cuda support for the Qwen2 model architecture

What's Changed

  • feat(candle): support Qwen2 on Cuda by @OlivierDehaene in #316
  • fix(candle): fix last token pooling

Full Changelog: v1.3.0...v1.4.0