[WIP] support for more vlms #390

n1ck-guo · 2024-12-19T06:08:06Z

support for more vlms: solvlm, aria(wip), llama-3.2v-cot
vl 70b+ on single card
new processor for all hf-model
modify get_multimodal_block_names (find language/vision/...)

Signed-off-by: n1ck-guo <[email protected]>

wenhuach21 · 2024-12-24T01:20:48Z

auto_round/mllm/autoround_mllm.py

@@ -160,6 +160,9 @@ def __init__(
                self.template, model=model, tokenizer=tokenizer, processor=processor, image_processor=image_processor)
            dataset = self.template.default_dataset if dataset is None else dataset

+        if model.config.model_type == "deepseek_vl_v2":


the setting here is a little tricky. Could the quantizing-non-text-module still be supported?

support for more vlms

919995d

Signed-off-by: n1ck-guo <[email protected]>

n1ck-guo requested review from WeiweiZhang1 and wenhuach21 and removed request for WeiweiZhang1 December 19, 2024 06:08

Merge branch 'main' into hengguo/more_vlms_support

f1c9dff

wenhuach21 reviewed Dec 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] support for more vlms #390

[WIP] support for more vlms #390

n1ck-guo commented Dec 19, 2024 •

edited

Loading

wenhuach21 Dec 24, 2024

[WIP] support for more vlms #390

Are you sure you want to change the base?

[WIP] support for more vlms #390

Conversation

n1ck-guo commented Dec 19, 2024 • edited Loading

wenhuach21 Dec 24, 2024

Choose a reason for hiding this comment

n1ck-guo commented Dec 19, 2024 •

edited

Loading