Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Recompute KV cache for Phi3 when switching from short to long factor (#…
…1161) Recompute KV cache for Phi3 when switching from short to long factor. Verified that this PR fixes the issue for: 1. Phi3.5 mini 2. Phi3 mini 128K 3. Phi3 small 4. Phi3 medium
- Loading branch information