Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch over logprobs to work with post-sampling #66

Merged
merged 2 commits into from
Feb 15, 2024
Merged

Conversation

bdashore3
Copy link
Member

@bdashore3 bdashore3 commented Feb 12, 2024

Description
Change logits to return post-sampling vs pre-sampling to accurately represent the correct percentages.

Also switch the generator to use the streaming_ex gen which returns a more flexible dictionary.

Blocking on release of exllamav2 v0.0.13.post2

Previously, pre-sampling logprobs were used from the raw logits,
but newer versions of exl2 allow for returning token probs post-sampling.
Convert these to logprobs and send to the user.

Signed-off-by: kingbri <[email protected]>
Update to v0.0.13.post2

Signed-off-by: kingbri <[email protected]>
@bdashore3 bdashore3 merged commit ea00a6b into main Feb 15, 2024
1 check passed
@bdashore3 bdashore3 deleted the better-logprobs branch February 16, 2024 05:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant