Skip to content

0.0.7post2

Compare
Choose a tag to compare
@bclavie bclavie released this 13 Feb 21:45
· 28 commits to main since this release
b7ae28a

Fixes & tweaks to the previous release:

  • Automatically adjust batch size on longer contexts (32 for 512 tokens, 16 for 1024, 8 for 2048, decreasing like this until a minimum of 1)
  • Apply dynamic max context length to reranking