Consider a `suppressed_tokens` arg in `generate()` #975

abheesht17 · 2023-04-08T11:05:13Z

The user can optionally pass a suppressed_tokens arg to GPT-2/BART's generate() function. We will set the probability of these tokens to 0 (i.e., logits to -infinity), so that they aren't generated. This is super-useful in order to avoid generating a special token (like bos token) during generation.

The text was updated successfully, but these errors were encountered:

mattdangerw · 2023-04-14T20:36:36Z

I think the general problem we need to think about is how we transform logit outputs. We need to be able to ship default transformations (e.g. for whisper logit supression), and potentially allow users a hook to provide their own.

One option is adding more and more sampler config #978. A few other options to consider...

Users can pass in an arbitrary logit transformation somewhere. logit_transform_fn=None.
Users need to subclass a language model and override a method to do custom logit transformation.

jbischof · 2023-04-14T20:42:07Z

I do like keeping this general instead of making framework-y promises that we can fix the "bad words"

abheesht17 changed the title ~~Consider a "suppressed_tokens" args in generate()~~ Consider a suppressed_tokens args in generate() Apr 8, 2023

abheesht17 changed the title ~~Consider a suppressed_tokens args in generate()~~ Consider a suppressed_tokens arg in generate() Apr 8, 2023

abheesht17 mentioned this issue Apr 10, 2023

Add functionality to suppress tokens during generation #978

Draft

sachinprasadhs added the type:feature New feature or request label Apr 23, 2024

mattdangerw closed this as completed Aug 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider a `suppressed_tokens` arg in `generate()` #975

Consider a `suppressed_tokens` arg in `generate()` #975

abheesht17 commented Apr 8, 2023

mattdangerw commented Apr 14, 2023 •

edited

Loading

jbischof commented Apr 14, 2023

Consider a suppressed_tokens arg in generate() #975

Consider a suppressed_tokens arg in generate() #975

Comments

abheesht17 commented Apr 8, 2023

mattdangerw commented Apr 14, 2023 • edited Loading

jbischof commented Apr 14, 2023

Consider a `suppressed_tokens` arg in `generate()` #975

Consider a `suppressed_tokens` arg in `generate()` #975

mattdangerw commented Apr 14, 2023 •

edited

Loading