[LLM] Add deepseekv2 #9061

DrownFish19 · 2024-08-30T11:18:15Z

PR types

New features

PR changes

Models

Description

Add DeepSeekV2.

paddle-bot · 2024-08-30T11:18:21Z

Thanks for your contribution!

codecov · 2024-08-30T11:49:07Z

Codecov Report

Attention: Patch coverage is 14.64435% with 816 lines in your changes missing coverage. Please review.

Project coverage is 52.91%. Comparing base (8212b53) to head (c33429e).

Files with missing lines	Patch %	Lines
paddlenlp/transformers/deepseek_v2/modeling.py	14.15%	764 Missing ⚠️
...addlenlp/transformers/deepseek_v2/configuration.py	13.04%	40 Missing ⚠️
...ddlenlp/transformers/deepseek_v2/tokenizer_fast.py	29.41%	12 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9061      +/-   ##
===========================================
- Coverage    53.26%   52.91%   -0.35%     
===========================================
  Files          652      655       +3     
  Lines       105615   106571     +956     
===========================================
+ Hits         56254    56394     +140     
- Misses       49361    50177     +816

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ZHUI · 2024-09-05T12:59:06Z

paddlenlp/transformers/deepseek_v2/modeling.py

+        if version != "0.0.0" and version <= "2.5.2":
+            attn_output, attn_weights = flash_attention(
+                query_states,
+                key_states,
+                value_states,
+                causal=True,
+                return_softmax=output_attentions,
+            )
+            attn_output *= (head_dim ** (0.5)) * softmax_scale
+            attn_weights *= (head_dim ** (0.5)) * softmax_scale
+        else:


这个分支干掉吧，现在应该不需要了。可以统一清理

ZHUI · 2024-09-05T13:03:21Z

paddlenlp/transformers/deepseek_v2/modeling.py

+        if config.sequence_parallel and use_sequence_parallel:
+            mark_as_sequence_parallel_parameter(self.weight)
+
+    def forward(self, hidden_states):


fuse_rms_norm 可以考虑支持

…d_deepseekv2

CLAassistant · 2024-09-19T11:53:57Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ DrownFish19
❌ Mangodadada
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

…PaddleNLP into dev_20240819_add_deepseekv2

…d_deepseekv2

DrownFish19 and others added 2 commits August 30, 2024 04:05

add deepseekv2

c1278c5

Merge branch 'PaddlePaddle:develop' into dev_20240819_add_deepseekv2

c0b6728

DrownFish19 and others added 8 commits September 2, 2024 11:55

update

875fc64

update softmax_scale and attn_weights

f798697

fix dtype during forward

37b4cf4

update create_parameter

2bbea4a

update

52ffa3b

update modeling

1d7ea20

Merge branch 'PaddlePaddle:develop' into dev_20240819_add_deepseekv2

ffa11c4

update

788290a

ZHUI reviewed Sep 5, 2024

View reviewed changes

DrownFish19 and others added 4 commits September 13, 2024 15:34

Merge branch 'PaddlePaddle:develop' into dev_20240819_add_deepseekv2

bb9e0df

Merge remote-tracking branch 'paddlenlp/develop' into dev_20240819_ad…

f2acc7c

…d_deepseekv2

update flash_attention

c191adc

Merge branch 'PaddlePaddle:develop' into dev_20240819_add_deepseekv2

e27d423

DrownFish19 added 3 commits September 20, 2024 03:27

Merge branch 'dev_20240819_add_deepseekv2' of github.com:DrownFish19/…

29f227c

…PaddleNLP into dev_20240819_add_deepseekv2

Merge remote-tracking branch 'paddlenlp/develop' into dev_20240819_ad…

b00a9a6

…d_deepseekv2

Merge remote-tracking branch 'paddlenlp/develop' into dev_20240819_ad…

c33429e

…d_deepseekv2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Add deepseekv2 #9061

[LLM] Add deepseekv2 #9061

DrownFish19 commented Aug 30, 2024

paddle-bot bot commented Aug 30, 2024

codecov bot commented Aug 30, 2024 •

edited

Loading

ZHUI Sep 5, 2024

ZHUI Sep 5, 2024

CLAassistant commented Sep 19, 2024 •

edited

Loading

[LLM] Add deepseekv2 #9061

Are you sure you want to change the base?

[LLM] Add deepseekv2 #9061

Conversation

DrownFish19 commented Aug 30, 2024

PR types

PR changes

Description

paddle-bot bot commented Aug 30, 2024

codecov bot commented Aug 30, 2024 • edited Loading

Codecov Report

ZHUI Sep 5, 2024

Choose a reason for hiding this comment

ZHUI Sep 5, 2024

Choose a reason for hiding this comment

CLAassistant commented Sep 19, 2024 • edited Loading

codecov bot commented Aug 30, 2024 •

edited

Loading

CLAassistant commented Sep 19, 2024 •

edited

Loading