Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add FP8 rocblas gemm support #2473

Merged
merged 97 commits into from
Dec 6, 2023
Merged

Add FP8 rocblas gemm support #2473

merged 97 commits into from
Dec 6, 2023

Conversation

umangyadav
Copy link
Member

Depends on #2455

  • Checks if BETA API for FP8 rocblas gemm exists.
  • If it does and if the hardware is supported (>=gfx940) then calls into FP8 API.
  • If hardware is not supported or BETA API doesn't exist then it uses eliminate_fp8 pass that adds converts around unsupported operator to emulate them into Float and convert back result to FP8.
  • Adds verify tests for FP8
  • tested this on gfx942, it passes verification.

src/targets/gpu/target.cpp Outdated Show resolved Hide resolved
src/eliminate_data_type.cpp Outdated Show resolved Hide resolved
src/eliminate_data_type.cpp Outdated Show resolved Hide resolved
src/eliminate_data_type.cpp Outdated Show resolved Hide resolved
@umangyadav umangyadav requested a review from pfultz2 December 5, 2023 15:05
@TedThemistokleous
Copy link
Collaborator

If we're not concerned about the hipRTC because thats expected right now. run looks good.

@umangyadav
Copy link
Member Author

Opend issue #2514 for the CK failures.

CK failure is not related to this PR. therefore disabling those for now.

@causten causten merged commit 6d0b6bc into develop Dec 6, 2023
44 checks passed
@causten causten deleted the rocblas_fp8 branch December 6, 2023 01:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
FP8 issues related to FP8 implemenation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants