使用strategy.model_init_context()加载chatglm-6b模型提示mismatch #3761
Unanswered
zhangyuanscall
asked this question in
Community | Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
加载代码如下所示,world_size 为4,使用单机4卡运行提示mismatch,transformer.word_embeddings.weight的维度为[130528,4096],经过chunk之后成为[130528,1024](即代码中x变量),但是p的维度为256(4096被切分为256),chatglm_model_path 为chatglm的huggingface权重
报错日志如下:
Beta Was this translation helpful? Give feedback.
All reactions