如何进行zero优化+TP+PP #2181
Unanswered
yhcc
asked this question in
Community | Q&A
Replies: 1 comment 5 replies
-
目前在开发ZeRO2,后续会更新ZeRO2+PP的例子。不推荐使用ZeRO3+PP,效率比较低 |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
hello, 根据#1839 这里的说法,似乎ZeroInitContext不再推荐使用了,应该修改为https://github.com/hpcaitech/ColossalAI/blob/main/examples/language/gpt/train_gpt_demo.py 这种,然后根据https://colossalaiworkspace.slack.com/archives/C02NAJARJ9Y/p1665482115336439?thread_ts=1665444351.685429&cid=C02NAJARJ9Y 中提到的,ZeroDDP没有测试PP;但在https://github.com/hpcaitech/ColossalAI-Examples/blob/main/language/gpt/train_gpt.py 这里有ZeroInitContext和TP+PP的example。如果不推荐使用ZeroInitContext的话,有无什么案例可以参考更新的做法呀?
Beta Was this translation helpful? Give feedback.
All reactions