use fast softmax only on prefill #1159

jaygala223 · 2024-07-25T09:49:26Z

use fast softmax only on prefill

yafshar · 2024-07-26T14:03:19Z

@jaygala223 can you point to the source for this change? or is there an issue? From literature, fast softmax can be used in prefill and can be beneficial in any context where the softmax operation is a computational bottleneck.

yafshar · 2024-07-29T16:13:29Z

If there is an accuracy issue raised by using fast softmax, can you point out to the issue, or bring it up here for the reference. It could be beneficial for other models and prevent others to root causing in similar cases.

use fast softmax only on prefill huggingface#1159

jaygala223 · 2024-08-02T08:48:15Z

@yafshar apologies for the delayed response. There was a performance regression which got introduced in a patch revert for context manager. This PR fixes it

yafshar · 2024-08-02T11:54:41Z

@yafshar apologies for the delayed response. There was a performance regression which got introduced in a patch revert for context manager. This PR fixes it

@jaygala223 thanks, does this affect other models? Should we consider the same change for other models?

yafshar

LGTM!

@regisss please look at this PR

yafshar · 2024-08-02T12:10:24Z

@libinta please correct the PR label

jaygala223 · 2024-08-02T12:13:33Z

@jaygala223 thanks, does this affect other models? Should we consider the same change for other models?

No, it does not affect other models

regisss · 2024-08-04T17:34:44Z

Please run make style

HuggingFaceDocBuilderDev · 2024-08-04T17:35:02Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

use fast softmax only on prefill (HabanaAI#306)

b3c450a

jaygala223 requested review from mandy-li, libinta and dvarshney-habana as code owners July 25, 2024 09:49

dvarshney-habana approved these changes Jul 25, 2024

View reviewed changes

libinta added the synapse1.17 PR that should be available along with Synapse 1.17 but have no dependency on Synapse 1.17 content. label Jul 26, 2024

astachowiczhabana mentioned this pull request Jul 29, 2024

reland - use fast softmax only on prefill HabanaAI/optimum-habana-fork#306

Merged

3 tasks

vidyasiv pushed a commit to emascarenhas/optimum-habana that referenced this pull request Aug 1, 2024

Merge branch 'jay-oh-softmax' into syn1.17tr4.43

ef9ad51

use fast softmax only on prefill huggingface#1159

vidyasiv added a commit to emascarenhas/optimum-habana that referenced this pull request Aug 2, 2024

Merge branch '1159' into syn1.17tr4.43

6a0993f

use fast softmax only on prefill huggingface#1159

yafshar approved these changes Aug 2, 2024

View reviewed changes

yafshar mentioned this pull request Aug 2, 2024

Use fast softmax only on prefill #1180

Closed

libinta added the run-test Run CI for PRs from external contributors label Aug 4, 2024

regisss approved these changes Aug 4, 2024

View reviewed changes

astachowiczhabana mentioned this pull request Aug 5, 2024

reland - use fast softmax only on prefill HabanaAI/optimum-habana-fork#311

Merged

3 tasks

make style

d338f55

regisss approved these changes Aug 5, 2024

View reviewed changes

regisss merged commit fb72aac into huggingface:main Aug 5, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use fast softmax only on prefill #1159

use fast softmax only on prefill #1159

jaygala223 commented Jul 25, 2024

yafshar commented Jul 26, 2024 •

edited

Loading

yafshar commented Jul 29, 2024

jaygala223 commented Aug 2, 2024

yafshar commented Aug 2, 2024

yafshar left a comment

yafshar commented Aug 2, 2024

jaygala223 commented Aug 2, 2024

regisss commented Aug 4, 2024

HuggingFaceDocBuilderDev commented Aug 4, 2024

use fast softmax only on prefill #1159

use fast softmax only on prefill #1159

Conversation

jaygala223 commented Jul 25, 2024

yafshar commented Jul 26, 2024 • edited Loading

yafshar commented Jul 29, 2024

jaygala223 commented Aug 2, 2024

yafshar commented Aug 2, 2024

yafshar left a comment

Choose a reason for hiding this comment

yafshar commented Aug 2, 2024

jaygala223 commented Aug 2, 2024

regisss commented Aug 4, 2024

HuggingFaceDocBuilderDev commented Aug 4, 2024

yafshar commented Jul 26, 2024 •

edited

Loading