How is stage 1 used in the next stage? #3365
Replies: 2 comments 1 reply
-
Hi @raihan0824 For 3 stages, you can refer to https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat#how-to-use |
Beta Was this translation helpful? Give feedback.
-
Hii everyone
It is mentioned that i need to used the pretrain from base |
Beta Was this translation helpful? Give feedback.
-
Hello, I am relatively new to this field, so I’m still confused a little about how the pipeline is structured. Stage 1 is supervised fine-tuning, the output is an sft model. My question is simple: How and where does the sft model used in the next stage?
Also, is there any interface/UI to generate the rlhf dataset? Because in my case, I want the model to be specific to my needs.
Also, what are the hardware requirements for training, and how long does it takes to complete the training?
Thank you!
Beta Was this translation helpful? Give feedback.
All reactions