Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUGFIX] Llama3.2 fa crash fix #274

Merged
merged 2 commits into from
Nov 13, 2024
Merged

[BUGFIX] Llama3.2 fa crash fix #274

merged 2 commits into from
Nov 13, 2024

Conversation

maleksan85
Copy link

For cross attention case K and V might be non contiguous which was causing int32 overflow and FA kernel to crash. Plus for the case there is no need to trim K and V

@maleksan85 maleksan85 requested a review from gshtras November 12, 2024 23:26
@maleksan85 maleksan85 changed the base branch from main to develop November 12, 2024 23:30
@gshtras gshtras merged commit 9a46e97 into develop Nov 13, 2024
1 check passed
@gshtras gshtras deleted the llama3.2_FA_crash_fix branch November 13, 2024 17:02
maleksan85 added a commit that referenced this pull request Nov 13, 2024
Co-authored-by: Aleksandr Malyshev <[email protected]>
(cherry picked from commit 9a46e97)
maleksan85 added a commit that referenced this pull request Nov 13, 2024
Co-authored-by: Aleksandr Malyshev <[email protected]>
(cherry picked from commit 9a46e97)
maleksan85 added a commit that referenced this pull request Nov 19, 2024
Co-authored-by: Aleksandr Malyshev <[email protected]>
(cherry picked from commit 9a46e97)
shajrawi pushed a commit that referenced this pull request Dec 3, 2024
* corrected types for strides in triton FA (#274) (#276)

Co-authored-by: Aleksandr Malyshev <[email protected]>
(cherry picked from commit 9a46e97)

* fused_moe configs for MI325X

New fused_moe configs for Mixtral-8x7B and Mixtral-8x22B with
TP=1,2,4,8 for both FP8 and FP16 on the recently announced MI325X.

---------

Co-authored-by: Aleksandr Malyshev <[email protected]>
gshtras pushed a commit that referenced this pull request Dec 9, 2024
Co-authored-by: Aleksandr Malyshev <[email protected]>
(cherry picked from commit 9a46e97)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants