add NormalizedConfig support qwen, baichuan, chatglm #1490

changwangss · 2023-10-27T07:13:42Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Signed-off-by: changwangss <[email protected]>

echarlaix · 2023-10-31T18:32:51Z

optimum/utils/normalized_config.py

+        "mixformer-sequential": GPTBigCodeNormalizedTextConfig,
+        "baichuan": NormalizedTextConfig,
+        "qwen": NormalizedTextConfig,
+        "chatglm": NormalizedTextConfig.with_args(num_layers="num_layers"),


what about the vocab size, shouldn't it be padded_vocab_size for ChatGLM models ?
https://huggingface.co/THUDM/chatglm3-6b/blob/main/config.json#L32

chatglm has 3 models , I find chatglm is vocab_size, but chatglm2&3 is padded_vocab_size，Could you help me deal with this situation? @echarlaix
https://huggingface.co/THUDM/chatglm-6b/blob/main/config.json#L27

changwangss · 2024-01-08T08:49:38Z

Phi, Mixtral has been added to transformers, so add the Phi and Mixtral with #1625 first.
chatglm will add it again when huggingface/transformers#27883 ready.

fxmarty · 2024-02-29T13:23:56Z

optimum/utils/normalized_config.py

@@ -262,6 +262,11 @@ class NormalizedConfigManager:
        "whisper": WhisperLikeNormalizedTextConfig,
        "xlm-roberta": NormalizedTextConfig,
        "yolos": NormalizedVisionConfig,
+        "mpt": MPTNormalizedTextConfig,


mpt is already in the list.

fxmarty · 2024-02-29T13:24:39Z

optimum/utils/normalized_config.py

+        "baichuan": NormalizedTextConfig,
+        "qwen": NormalizedTextConfig,
+        "chatglm": NormalizedTextConfig.with_args(num_layers="num_layers"),


qwen2 is now available in transformers.

baichuan and chatglm are not.

changwangss added 2 commits October 26, 2023 17:09

add models config

deda7e3

Signed-off-by: changwangss <[email protected]>

fix black

e0ed336

Signed-off-by: changwangss <[email protected]>

echarlaix reviewed Oct 31, 2023

View reviewed changes

changwangss mentioned this pull request Nov 9, 2023

[LLM] text-generation example support chatglm2&3 intel/intel-extension-for-transformers#638

Merged

changwangss and others added 5 commits November 10, 2023 11:06

Merge branch 'huggingface:main' into wangchang/optimum-config

359b38e

Merge branch 'huggingface:main' into wangchang/optimum-config

a178662

Update key name

fa8ad73

update phi due to susnato/phi-1_dev

927e947

Update normalized_config.py

9e291db

changwangss closed this Jan 8, 2024

changwangss reopened this Feb 27, 2024

Merge branch 'main' into wangchang/optimum-config

e38d40a

changwangss changed the title ~~add NormalizedConfig support qwen, baichuan, chatglm, mixformer-sequential~~ add NormalizedConfig support qwen, baichuan, chatglm Feb 27, 2024

changwangss mentioned this pull request Feb 27, 2024

Update requirements for Chinese model NormalizedConfig intel/intel-extension-for-transformers#1323

Merged

fxmarty reviewed Feb 29, 2024

View reviewed changes

fxmarty requested a review from echarlaix February 29, 2024 13:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add NormalizedConfig support qwen, baichuan, chatglm #1490

add NormalizedConfig support qwen, baichuan, chatglm #1490

changwangss commented Oct 27, 2023

echarlaix Oct 31, 2023 •

edited

Loading

changwangss Nov 1, 2023

changwangss commented Jan 8, 2024

fxmarty Feb 29, 2024

fxmarty Feb 29, 2024

add NormalizedConfig support qwen, baichuan, chatglm #1490

Are you sure you want to change the base?

add NormalizedConfig support qwen, baichuan, chatglm #1490

Conversation

changwangss commented Oct 27, 2023

What does this PR do?

Before submitting

echarlaix Oct 31, 2023 • edited Loading

Choose a reason for hiding this comment

changwangss Nov 1, 2023

Choose a reason for hiding this comment

changwangss commented Jan 8, 2024

fxmarty Feb 29, 2024

Choose a reason for hiding this comment

fxmarty Feb 29, 2024

Choose a reason for hiding this comment

echarlaix Oct 31, 2023 •

edited

Loading