Delete FP8 Scaling Factors in GEMM Rewriter #16841

philipphack · 2024-09-05T20:34:19Z

Removes the scaling factors of C and D (matrix bias and result) from FP8 Custom Calls created in the GEMM rewriter when their data types are not FP8. See #15795.

reedwm · 2024-09-26T22:47:55Z

xla/service/gpu/transforms/gemm_rewriter.cc

+      one = instr->AddInstruction(
+          HloInstruction::CreateConstant(LiteralUtil::One(F32)));


Why not unconditionally create this, as was done before? Even before, it's possible this was unused. I worry this may be accidentally used without being initialized in the future.

Imported from GitHub PR #16841 Removes the scaling factors of C and D (matrix bias and result) from FP8 Custom Calls created in the GEMM rewriter when their data types are not FP8. See #15795. Copybara import of the project: -- fd9750f by Philipp Hack <[email protected]>: Removes superfluous FP8 scaling factors in GEMM rewriter. Merging this change closes #16841 FUTURE_COPYBARA_INTEGRATE_REVIEW=#16841 from philipphack:u_fp8_scales_xla fd9750f PiperOrigin-RevId: 679766659

Imported from GitHub PR openxla/xla#16841 Removes the scaling factors of C and D (matrix bias and result) from FP8 Custom Calls created in the GEMM rewriter when their data types are not FP8. See openxla/xla#15795. Copybara import of the project: -- fd9750fa8474fe72fe641c7b3bc005ff30396e0a by Philipp Hack <[email protected]>: Removes superfluous FP8 scaling factors in GEMM rewriter. Merging this change closes #16841 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#16841 from philipphack:u_fp8_scales_xla fd9750fa8474fe72fe641c7b3bc005ff30396e0a PiperOrigin-RevId: 679766659

Imported from GitHub PR #16841 Removes the scaling factors of C and D (matrix bias and result) from FP8 Custom Calls created in the GEMM rewriter when their data types are not FP8. See #15795. Copybara import of the project: -- fd9750f by Philipp Hack <[email protected]>: Removes superfluous FP8 scaling factors in GEMM rewriter. Merging this change closes #16841 FUTURE_COPYBARA_INTEGRATE_REVIEW=#16841 from philipphack:u_fp8_scales_xla fd9750f PiperOrigin-RevId: 679766659

Imported from GitHub PR openxla/xla#16841 Removes the scaling factors of C and D (matrix bias and result) from FP8 Custom Calls created in the GEMM rewriter when their data types are not FP8. See openxla/xla#15795. Copybara import of the project: -- fd9750fa8474fe72fe641c7b3bc005ff30396e0a by Philipp Hack <[email protected]>: Removes superfluous FP8 scaling factors in GEMM rewriter. Merging this change closes #16841 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#16841 from philipphack:u_fp8_scales_xla fd9750fa8474fe72fe641c7b3bc005ff30396e0a PiperOrigin-RevId: 679766659

Imported from GitHub PR openxla/xla#16841 Removes the scaling factors of C and D (matrix bias and result) from FP8 Custom Calls created in the GEMM rewriter when their data types are not FP8. See openxla/xla#15795. Copybara import of the project: -- fd9750fa8474fe72fe641c7b3bc005ff30396e0a by Philipp Hack <[email protected]>: Removes superfluous FP8 scaling factors in GEMM rewriter. Merging this change closes #16841 PiperOrigin-RevId: 679784586

openxla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test

Imported from GitHub PR #18062 #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b PiperOrigin-RevId: 685632239

Imported from GitHub PR openxla/xla#18062 openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b8de0785d43e18dbdb0773307870084e32 by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b8de0785d43e18dbdb0773307870084e32 PiperOrigin-RevId: 685632239

Imported from GitHub PR #18062 #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b PiperOrigin-RevId: 685632239

Imported from GitHub PR openxla/xla#18062 openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b8de0785d43e18dbdb0773307870084e32 by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b8de0785d43e18dbdb0773307870084e32 PiperOrigin-RevId: 685632239

Imported from GitHub PR #18062 #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b PiperOrigin-RevId: 685632239

Imported from GitHub PR openxla/xla#18062 openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b8de0785d43e18dbdb0773307870084e32 by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b8de0785d43e18dbdb0773307870084e32 PiperOrigin-RevId: 685632239

Imported from GitHub PR #18062 #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b PiperOrigin-RevId: 688022433

Imported from GitHub PR openxla/xla#18062 openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b8de0785d43e18dbdb0773307870084e32 by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b8de0785d43e18dbdb0773307870084e32 PiperOrigin-RevId: 688022433

Imported from GitHub PR #18062 #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b PiperOrigin-RevId: 688022433

Imported from GitHub PR openxla/xla#18062 openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b8de0785d43e18dbdb0773307870084e32 by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b8de0785d43e18dbdb0773307870084e32 PiperOrigin-RevId: 688022433

Imported from GitHub PR #18062 #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 COPYBARA_INTEGRATE_REVIEW=#18062 from ROCm:ci_fix_gemm_rewriter_fp8_tests_20241008 be4da5b PiperOrigin-RevId: 688034088

Imported from GitHub PR openxla/xla#18062 openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Copybara import of the project: -- be4da5b8de0785d43e18dbdb0773307870084e32 by Harsha HS <[email protected]>: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch openxla/xla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test Merging this change closes #18062 PiperOrigin-RevId: 688034088

reedwm self-requested a review September 6, 2024 00:04

NaiyerRizz self-assigned this Sep 6, 2024

reedwm requested changes Sep 26, 2024

View reviewed changes

Removes superfluous FP8 scaling factors in GEMM rewriter.

fd9750f

philipphack force-pushed the u_fp8_scales_xla branch from 2ef615e to fd9750f Compare September 27, 2024 23:20

reedwm approved these changes Sep 27, 2024

View reviewed changes

copybara-service bot mentioned this pull request Sep 27, 2024

PR #16841: Delete FP8 Scaling Factors in GEMM Rewriter #17731

Merged

copybara-service bot mentioned this pull request Sep 27, 2024

PR #16841: Delete FP8 Scaling Factors in GEMM Rewriter tensorflow/tensorflow#76693

Merged

copybara-service bot closed this in 3c5c920 Sep 28, 2024

hsharsha added a commit to ROCm/xla that referenced this pull request Oct 8, 2024

[ROCm] Fix gemm_rewriter_test for AMD GCN Arch

85619f7

openxla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test

hsharsha mentioned this pull request Oct 8, 2024

[ROCm] Fix gemm_rewriter_test for AMD GCN Arch #18062

Closed

hsharsha added a commit to ROCm/xla that referenced this pull request Oct 8, 2024

[ROCm] Fix gemm_rewriter_test for AMD GCN Arch

be4da5b

openxla#16841 removes scaling factor constants in gemm_rewriter for FP8 data types. This patch address the same in the gemm_rewriter_test

copybara-service bot mentioned this pull request Oct 14, 2024

PR #18062: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #18275

Open

copybara-service bot mentioned this pull request Oct 14, 2024

PR #18062: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch tensorflow/tensorflow#77851

Draft

copybara-service bot mentioned this pull request Oct 21, 2024

PR #18062: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch #18533

Merged

copybara-service bot mentioned this pull request Oct 21, 2024

PR #18062: [ROCm] Fix gemm_rewriter_test for AMD GCN Arch tensorflow/tensorflow#78418

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delete FP8 Scaling Factors in GEMM Rewriter #16841

Delete FP8 Scaling Factors in GEMM Rewriter #16841

philipphack commented Sep 5, 2024

reedwm Sep 26, 2024

		one = instr->AddInstruction(
		HloInstruction::CreateConstant(LiteralUtil::One(F32)));

Delete FP8 Scaling Factors in GEMM Rewriter #16841

Delete FP8 Scaling Factors in GEMM Rewriter #16841

Conversation

philipphack commented Sep 5, 2024

reedwm Sep 26, 2024

Choose a reason for hiding this comment