Better generation stats #331

LLukas22 · 2023-06-25T15:20:39Z

I'm currently facing an issue where the generation on a gpu sometimes slows down and its very hard to determine why. (see #325)

It would be great if we could have an option to get more detailed information from the generation process. Maybe we could divide the per token times into the following categories:

Forward pass: Raw time spend in the evaluate function of the model
Sampler: Time spend sampling the tokens
Decoding: Time taken by the tokenizer to decode the tokens
Printing: Time spend invoking the callback and printing to the CLI

The text was updated successfully, but these errors were encountered:

jafioti · 2023-06-25T17:39:27Z

It would also be helpful to see the max and min time of each category, alongside the mean

philpax · 2023-06-25T20:31:14Z

Sounds good to me, would anyone be interested in doing this?

LLukas22 · 2023-06-26T11:48:34Z

I could give it a try but im still kinda bussy with the CUDA/OpenCL stuff and i have no idea how i would implement performance metrics and loggin correctly in rust 😬

philpax · 2023-06-26T23:27:13Z

You can probably just use std::time::Instant - it should be precise enough for this application. Just create some Instants at each measurement point, then call .elapsed() on them to find the amount of time that has passed since that instant.

LLukas22 added meta:help-wanted Extra attention is needed meta:maintenance Changes that will make it easier for us to maintain code topic:api-design API design considerations, including new functionality and changes labels Jun 25, 2023

LLukas22 mentioned this issue Jul 14, 2023

feat(tracing): add tracing to llm and llm-base crates #367

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better generation stats #331

Better generation stats #331

LLukas22 commented Jun 25, 2023

jafioti commented Jun 25, 2023

philpax commented Jun 25, 2023

LLukas22 commented Jun 26, 2023

philpax commented Jun 26, 2023

Better generation stats #331

Better generation stats #331

Comments

LLukas22 commented Jun 25, 2023

jafioti commented Jun 25, 2023

philpax commented Jun 25, 2023

LLukas22 commented Jun 26, 2023

philpax commented Jun 26, 2023