This repository has been archived by the owner on Jun 24, 2024. It is now read-only.
Better generation stats #331
Labels
meta:help-wanted
Extra attention is needed
meta:maintenance
Changes that will make it easier for us to maintain code
topic:api-design
API design considerations, including new functionality and changes
I'm currently facing an issue where the generation on a gpu sometimes slows down and its very hard to determine why. (see #325)
It would be great if we could have an option to get more detailed information from the generation process. Maybe we could divide the per token times into the following categories:
evaluate
function of the modelThe text was updated successfully, but these errors were encountered: