[LLM] Add tools for parameters #9137

Hanyonggong · 2024-09-13T07:28:40Z

PR types

Others

PR changes

Others

Description

add some llm tools:

llm/tools/convert_gqa_to_mha.py: convert model from gqa to mha
llm/tools/split_weights.py: split model weight according to tensor parallel degree

paddle-bot · 2024-09-13T07:28:45Z

Thanks for your contribution!

codecov · 2024-09-13T08:01:45Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 51.98%. Comparing base (9806293) to head (a912ff6).
Report is 50 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9137      +/-   ##
===========================================
- Coverage    53.34%   51.98%   -1.36%     
===========================================
  Files          652      657       +5     
  Lines       105401   110297    +4896     
===========================================
+ Hits         56222    57342    +1120     
- Misses       49179    52955    +3776

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ZHUI · 2024-09-13T08:18:22Z

emm，你们搞得太复杂了。safetensor本来就支持多文件直接加载，还支持mmap切分load。不需要这么麻烦的。

ZHUI

@DesmonDay 看看我们已有的方法能不能帮推理直接切分参数。

DesmonDay · 2024-09-14T03:38:10Z

对safetensors格式的权重做TP切分，我们在 https://github.com/PaddlePaddle/PaddleNLP/blob/develop/paddlenlp/transformers/model_utils.py#L2770 这个函数中已经做了实现，直接调用该函数就行，不需要对safetensors合并后再拆分。 @Hanyonggong

DesmonDay · 2024-09-14T03:39:21Z

llm/tools/convert_gqa_to_mha.py

+    config_file = open(config_path, "r")
+    config_json = json.load(config_file)
+
+    model = paddle.load(gqa_model_path)


如果对于safetensors格式的权重，就不支持了？

DesmonDay · 2024-09-14T03:39:32Z

llm/tools/merge_satetensors.py

@@ -0,0 +1,65 @@
+# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.
+#


这个文件可以直接删掉。

DesmonDay · 2024-09-14T03:40:01Z

llm/tools/split_weights.py

+
+    model_state_dict = paddle.load(model_path)
+
+    state_dict = model.convert_tensor_parallel(


前面的切分TP权重这部分，直接调用https://github.com/PaddlePaddle/PaddleNLP/blob/develop/paddlenlp/transformers/model_utils.py#L2770

感觉你没有理解我的意思。我的意思是切分TP权重的这个函数，convert_tensor_parallel这个东西，不需要你自己调用了，前面的load_tp_checkpoint已经做了这个功能，拿到的state_dict就是切分后的。

CLAassistant · 2024-09-19T15:09:51Z

All committers have signed the CLA.

DesmonDay

LGTM

add tools

1bab416

ZHUI requested changes Sep 13, 2024

View reviewed changes

DesmonDay self-requested a review September 14, 2024 03:38

DesmonDay reviewed Sep 14, 2024

View reviewed changes

use load_tp_checkpoint

07fa343

Hanyonggong added 2 commits September 20, 2024 11:07

fix bugs

ecd10a1

mod file

a912ff6

DesmonDay approved these changes Oct 9, 2024

View reviewed changes

yuanlehome approved these changes Oct 9, 2024

View reviewed changes

ZHUI closed this Oct 9, 2024

ZHUI reopened this Oct 9, 2024

ZHUI approved these changes Oct 9, 2024

View reviewed changes

yuanlehome closed this Oct 10, 2024

yuanlehome reopened this Oct 10, 2024

ZHUI merged commit 37c211a into PaddlePaddle:develop Oct 10, 2024
12 of 16 checks passed

Hanyonggong mentioned this pull request Oct 15, 2024

[WeeklyReports] 2024.09.09~2024.09.23 周报汇总 PFCCLab/Camp#382

Open

26 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Add tools for parameters #9137

[LLM] Add tools for parameters #9137

Hanyonggong commented Sep 13, 2024 •

edited by yuanlehome

Loading

paddle-bot bot commented Sep 13, 2024

codecov bot commented Sep 13, 2024 •

edited

Loading

ZHUI commented Sep 13, 2024

ZHUI left a comment

DesmonDay commented Sep 14, 2024 •

edited

Loading

DesmonDay Sep 14, 2024

DesmonDay Sep 14, 2024

Hanyonggong Sep 19, 2024

DesmonDay Sep 14, 2024

Hanyonggong Sep 19, 2024

DesmonDay Sep 20, 2024

Hanyonggong Sep 23, 2024

CLAassistant commented Sep 19, 2024 •

edited

Loading

DesmonDay left a comment

		@@ -0,0 +1,65 @@
		# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.
		#


		model_state_dict = paddle.load(model_path)

		state_dict = model.convert_tensor_parallel(

[LLM] Add tools for parameters #9137

[LLM] Add tools for parameters #9137

Conversation

Hanyonggong commented Sep 13, 2024 • edited by yuanlehome Loading

PR types

PR changes

Description

paddle-bot bot commented Sep 13, 2024

codecov bot commented Sep 13, 2024 • edited Loading

Codecov Report

ZHUI commented Sep 13, 2024

ZHUI left a comment

Choose a reason for hiding this comment

DesmonDay commented Sep 14, 2024 • edited Loading

DesmonDay Sep 14, 2024

Choose a reason for hiding this comment

DesmonDay Sep 14, 2024

Choose a reason for hiding this comment

Hanyonggong Sep 19, 2024

Choose a reason for hiding this comment

DesmonDay Sep 14, 2024

Choose a reason for hiding this comment

Hanyonggong Sep 19, 2024

Choose a reason for hiding this comment

DesmonDay Sep 20, 2024

Choose a reason for hiding this comment

Hanyonggong Sep 23, 2024

Choose a reason for hiding this comment

CLAassistant commented Sep 19, 2024 • edited Loading

DesmonDay left a comment

Choose a reason for hiding this comment

Hanyonggong commented Sep 13, 2024 •

edited by yuanlehome

Loading

codecov bot commented Sep 13, 2024 •

edited

Loading

DesmonDay commented Sep 14, 2024 •

edited

Loading

CLAassistant commented Sep 19, 2024 •

edited

Loading