Not a problem - but like people should know #26

Atlas3DSS · 2023-03-26T22:07:33Z

https://arxiv.org/abs/2303.11366 Is a really cool paper about reflection in LLMs

That is after training on like 20 samples for 50 epochs on my 3090 on the 7B model.

User: [Topic or question]

Assistant Hypothetical Response: [Brief or simplified answer to the topic or question]

Agent Reflection: [Critique of the hypothetical response, highlighting the limitations, inaccuracies, or areas that need improvement or expansion, while providing guidance on how to address these issues in the revised response]

Bot Actual Response: [The natural and contextually appropriate answer to the topic or question, as generated by the advanced language model, which incorporates the suggestions and improvements from the agent reflection for a more comprehensive and accurate response]

This + training sets generated with this frame work seem to really improve the generations of these models with fairly limited training sets. Just thought i would share.

lxe · 2023-03-28T16:15:09Z

Nice to see you can get this from such a small sample set!

Atlas3DSS · 2023-03-28T18:04:24Z

I have been keeping track of my datasets if anyone else wants to play they are here
https://docs.google.com/spreadsheets/d/1QSwJFiyzUQ6H1CloDmJWcHJfYiT7SVxfwBDOOcbvFEo/edit?usp=sharing

Thank you again for making this lovely tool.

lxe added the documentation Improvements or additions to documentation label Mar 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not a problem - but like people should know #26

Not a problem - but like people should know #26

Atlas3DSS commented Mar 26, 2023

lxe commented Mar 28, 2023

Atlas3DSS commented Mar 28, 2023

Not a problem - but like people should know #26

Not a problem - but like people should know #26

Comments

Atlas3DSS commented Mar 26, 2023

lxe commented Mar 28, 2023

Atlas3DSS commented Mar 28, 2023