Skip to content

v0.1.4

Latest
Compare
Choose a tag to compare
@UnicornChan UnicornChan released this 30 Aug 13:52
· 29 commits to main since this release
022b893

Bug fix

  1. Fix bug that ktransformers cannot offload whole layer in cpu.
  2. Update DeepseekV2‘s multi gpu yaml examples to evenly allocate layers.
  3. Update Docker file.
  4. Fix bug about Qwen2-57B can not loaded
  5. Fix bug with #66 , add requirements for uvicorn