Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve performance of MHA #62

Merged
merged 1 commit into from
Sep 21, 2023
Merged

Improve performance of MHA #62

merged 1 commit into from
Sep 21, 2023

Conversation

cbalioglu
Copy link
Contributor

This PR refactors the implementation of MHA to make it more efficient. With the changes in this PR, LLaMA 7B inference is about ~%10 faster than the official LLaMA implementation.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 21, 2023
@cbalioglu cbalioglu merged commit a012f91 into main Sep 21, 2023
17 checks passed
@cbalioglu cbalioglu deleted the mha branch September 21, 2023 18:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants