[DRAFT] Generation refactor #1425

mattdangerw · 2024-02-06T22:01:36Z

Move the compiled while loop to the task base class.
Move as much common generative code to the task base classes.
Expose separate prefill() and decode() functions, which can be overridden from subclass.

This will preserve all high level usages (generate(), compile(sampler="top-k"), etc), and the way to subclass a Sampler. However it will break compat on the way you have to call a sampler--that's kinda the point of the pr. Should be an improvement overall, but definitely a friction there.

Will continue to flesh this out and add a colab demo.

We will update our samplers in the near future to push the backend specific compilation details out: keras-team#1425 Also in general, we want our documentation to reflect the main usage of our classes, which is using them with Seq2SeqLM and CausalLM classes. So with that in mind, this updates our sampler docs to show the practical usage of the sampling classes with our modeling classes. For the base class, we show the main use case of overriding the `get_next_token()` function.

We will update our samplers in the near future to push the backend specific compilation details out: #1425 Also in general, we want our documentation to reflect the main usage of our classes, which is using them with Seq2SeqLM and CausalLM classes. So with that in mind, this updates our sampler docs to show the practical usage of the sampling classes with our modeling classes. For the base class, we show the main use case of overriding the `get_next_token()` function.

We will update our samplers in the near future to push the backend specific compilation details out: keras-team#1425 Also in general, we want our documentation to reflect the main usage of our classes, which is using them with Seq2SeqLM and CausalLM classes. So with that in mind, this updates our sampler docs to show the practical usage of the sampling classes with our modeling classes. For the base class, we show the main use case of overriding the `get_next_token()` function.

divyashreepathihalli · 2024-05-01T00:24:45Z

keras_nlp/models/causal_lm.py

@@ -373,11 +543,11 @@ def postprocess(x):
                inputs = inputs.prefetch(tf.data.AUTOTUNE)
            else:
                # Fast path for non-dataset, single-batch input.
-                inputs = [preprocess(x) for x in inputs]


this is for list inputs correct?

We will update our samplers in the near future to push the backend specific compilation details out: keras-team/keras-hub#1425 Also in general, we want our documentation to reflect the main usage of our classes, which is using them with Seq2SeqLM and CausalLM classes. So with that in mind, this updates our sampler docs to show the practical usage of the sampling classes with our modeling classes. For the base class, we show the main use case of overriding the `get_next_token()` function.

mattdangerw force-pushed the generation-refactor branch 5 times, most recently from 5dce72d to f96b5f2 Compare February 14, 2024 03:12

mattdangerw mentioned this pull request Feb 16, 2024

Update our sampler documentation to reflect usage #1444

Merged

mattdangerw mentioned this pull request Feb 23, 2024

Fix BytePair special tokens tokenization #1447

Closed

mattdangerw mentioned this pull request Mar 11, 2024

Expose Task and Backbone #1506

Merged

mattdangerw force-pushed the generation-refactor branch from f96b5f2 to 52b3d77 Compare April 13, 2024 01:04

divyashreepathihalli reviewed May 1, 2024

View reviewed changes

divyashreepathihalli approved these changes May 1, 2024

View reviewed changes

Generation refactor

071f352

mattdangerw force-pushed the generation-refactor branch from 52b3d77 to 071f352 Compare July 25, 2024 02:21

mattdangerw changed the title ~~Generation refactor~~ [DRAFT] Generation refactor Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] Generation refactor #1425

[DRAFT] Generation refactor #1425

mattdangerw commented Feb 6, 2024 •

edited

Loading

divyashreepathihalli May 1, 2024

[DRAFT] Generation refactor #1425

Are you sure you want to change the base?

[DRAFT] Generation refactor #1425

Conversation

mattdangerw commented Feb 6, 2024 • edited Loading

divyashreepathihalli May 1, 2024

Choose a reason for hiding this comment

mattdangerw commented Feb 6, 2024 •

edited

Loading