You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
https://arxiv.org/abs/2303.11366 Is a really cool paper about reflection in LLMs
That is after training on like 20 samples for 50 epochs on my 3090 on the 7B model.
User: [Topic or question]
Assistant Hypothetical Response: [Brief or simplified answer to the topic or question]
Agent Reflection: [Critique of the hypothetical response, highlighting the limitations, inaccuracies, or areas that need improvement or expansion, while providing guidance on how to address these issues in the revised response]
Bot Actual Response: [The natural and contextually appropriate answer to the topic or question, as generated by the advanced language model, which incorporates the suggestions and improvements from the agent reflection for a more comprehensive and accurate response]
This + training sets generated with this frame work seem to really improve the generations of these models with fairly limited training sets. Just thought i would share.
The text was updated successfully, but these errors were encountered:
https://arxiv.org/abs/2303.11366 Is a really cool paper about reflection in LLMs
That is after training on like 20 samples for 50 epochs on my 3090 on the 7B model.
User: [Topic or question]
Assistant Hypothetical Response: [Brief or simplified answer to the topic or question]
Agent Reflection: [Critique of the hypothetical response, highlighting the limitations, inaccuracies, or areas that need improvement or expansion, while providing guidance on how to address these issues in the revised response]
Bot Actual Response: [The natural and contextually appropriate answer to the topic or question, as generated by the advanced language model, which incorporates the suggestions and improvements from the agent reflection for a more comprehensive and accurate response]
This + training sets generated with this frame work seem to really improve the generations of these models with fairly limited training sets. Just thought i would share.
The text was updated successfully, but these errors were encountered: