[BugFix] fix_outputlayer.weight_distributed #9135

SevenSamon · 2024-09-13T06:03:41Z

PR types

Bug fixes

PR changes

models

Description

手动切分参数时, 需要设置is_distributed为True, distributed_model中广播参数时就会把这些切分的参数跳过
初始loss 11->1.63

paddle-bot · 2024-09-13T06:03:45Z

Thanks for your contribution!

wtmlon · 2024-09-13T08:30:58Z

paddlenlp/transformers/chatglm_v2/modeling.py

@@ -1126,7 +1125,10 @@ def forward(self, hidden_states, return_last_logit=False):
        if self.config.sequence_parallel:
            hidden_states = GatherOp.apply(hidden_states)
            hidden_states = paddle.reshape_(hidden_states, [self.config.seq_length, -1, self.config.hidden_size])
-        logits = parallel_matmul(hidden_states, self.decoder_weight, self.config.tensor_parallel_output)
+        if self.config.tensor_parallel_degree > 1:
+            logits = parallel_matmul(hidden_states, self.weight, self.config.tensor_parallel_output)


为啥要在这里parallel_matmul，我看只有非 tp 的情况才会走进这里的 forward

wtmlon · 2024-09-13T08:32:22Z

paddlenlp/transformers/chatglm_v2/modeling.py

@@ -1238,9 +1240,9 @@ def forward(
            lm_logits = parallel_matmul(


这里为啥要手动掉用 parallel_matmul，内部的 forward 函数我看已经支持 tp > 1的情况，具体在哪一层面做 parallel_matmul还是统一点好

codecov · 2024-09-13T08:54:19Z

Codecov Report

Attention: Patch coverage is 67.85714% with 9 lines in your changes missing coverage. Please review.

Project coverage is 53.28%. Comparing base (399490b) to head (7f1c537).
Report is 4 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/chatglm_v2/modeling.py	67.85%	9 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9135      +/-   ##
===========================================
- Coverage    53.29%   53.28%   -0.02%     
===========================================
  Files          652      652              
  Lines       105483   105579      +96     
===========================================
+ Hits         56222    56254      +32     
- Misses       49261    49325      +64

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fix_outputlayer.weight_distributed

4a3d326

SevenSamon added 2 commits September 13, 2024 07:23

update_glmhead

b9381bf

update_transpose

7229529

ZHUI changed the title ~~fix_outputlayer.weight_distributed~~ [BugFix] fix_outputlayer.weight_distributed Sep 13, 2024

wtmlon reviewed Sep 13, 2024

View reviewed changes

update_parallel_matmul

7f1c537

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] fix_outputlayer.weight_distributed #9135

[BugFix] fix_outputlayer.weight_distributed #9135

SevenSamon commented Sep 13, 2024 •

edited

Loading

paddle-bot bot commented Sep 13, 2024

wtmlon Sep 13, 2024

wtmlon Sep 13, 2024

codecov bot commented Sep 13, 2024 •

edited

Loading

		@@ -1238,9 +1240,9 @@ def forward(
		lm_logits = parallel_matmul(

[BugFix] fix_outputlayer.weight_distributed #9135

Are you sure you want to change the base?

[BugFix] fix_outputlayer.weight_distributed #9135

Conversation

SevenSamon commented Sep 13, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Sep 13, 2024

wtmlon Sep 13, 2024

Choose a reason for hiding this comment

wtmlon Sep 13, 2024

Choose a reason for hiding this comment

codecov bot commented Sep 13, 2024 • edited Loading

Codecov Report

SevenSamon commented Sep 13, 2024 •

edited

Loading

codecov bot commented Sep 13, 2024 •

edited

Loading