Bug fix
- Fix bug that ktransformers cannot offload whole layer in cpu.
- Update DeepseekV2‘s multi gpu yaml examples to evenly allocate layers.
- Update Docker file.
- Fix bug about Qwen2-57B can not loaded
- Fix bug with #66 , add requirements for uvicorn
Bug fix