What's Changed
💥 Improvements
- use weights iterator while loading by @RunningLeon in #2886
🐞 Bug fixes
- [dlinfer] fix ascend qwen2_vl graph_mode by @yao-fengchen in #3045
- fix error in interactive api by @lvhan028 in #3074
- fix sliding window mgr by @grimoire in #3068
- More arguments in api_client, update docstrings by @AllentDan in #3077
🌐 Other
- [ci] add internlm3 into testcase by @zhulinJulia24 in #3038
- add internlm3 to supported models by @lvhan028 in #3041
- update pre-commit config by @lvhan028 in #2683
- [maca] add cudagraph support on maca backend. by @Reinerzhou in #2834
- bump version to v0.7.0.post1 by @lvhan028 in #3076
Full Changelog: v0.7.0...v0.7.0.post1