Highlights:
fix incorrect device setting in autoround format inference by @WeiweiZhang1 in #383
remove the dependency on AutoGPTQ by @XuehaoSun in #380
What's Changed
- support_llava_hf_vlm_example by @WeiweiZhang1 in #381
- fix block_name_to_quantize by @WeiweiZhang1 in #382
- fix incorrect device setting in autoround format inference by @WeiweiZhang1 in #383
- refine homepage, update model links by @WeiweiZhang1 in #385
- update eval basic usage by @n1ck-guo in #384
- refine error msg and dump more log in the tuning by @wenhuach21 in #386
- remove the dependency on AutoGPTQ for CPU and bump to V0.4.3 by @XuehaoSun in #380
Full Changelog: v0.4.2...v0.4.3