FP8 2D forward convolution using rocMLIR #2507

umangyadav · 2023-12-03T17:13:38Z

FP8 convolutions that uses rocMLIR.

~~Uses rocMLIR from SHA : 5085343bca363109ae9ebabb7ca2b65c52bc861c~~

regular convolution takes in both inputs as FP8 and generates FP8 output. Internally on hardware it will do accumulation in Fp32 but final result is converted back to fp8 using downcasting.

quant_convolution takes in both inputs as fp8 and generates FP32 output. This version can utilize QDQ quantization scheme to use scales to downcast FP32 output to fp8.

rocMLIR fp8 convolution are limited to 2d forward convolutions only. Therefore only have added tests for those.
3D convolutions, 1D convolution, backwards convolution (transposed convolutions) "tests" therefore are not enabled.

convert fusion with mlir-convolution is not compiling on non-MI300 hardware therefore disabled it.

For now i've kept both versions of convolutions (quant and regular). In future if there is no use for regular fp8 convolution then it can be removed.

Testing : make check passes on MI300.

depends on #2473

migraphx-bot · 2023-12-03T21:20:07Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ torchvision-inceptionv3_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ slim-vgg16_1: PASSED: MIGraphX meets tolerance

✅ slim-mobilenet_1: PASSED: MIGraphX meets tolerance

✅ slim-inceptionv4_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

🔴bert_base_cased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

✅ bert_large_uncased_fp16: PASSED: MIGraphX meets tolerance

✅ bert_large: PASSED: MIGraphX meets tolerance

🔴distilgpt2_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

…DMIGraphX into rocblas_mlir_fp8

requirements.txt

TedThemistokleous · 2023-12-07T01:19:00Z

LGTM

umangyadav and others added 30 commits November 9, 2023 23:23

changes for the FP8 ref implementation

df7f8a3

cppcheck fixes

9bc1828

move FNUZ as template parameter

155a2b1

Fix numeric limits

d9f11e3

Working FNUZ and FN

4e9d51f

use float equal

7639c28

add test for fp8e5m2

a6372c5

add test for fp8e5m2fnuz

439ea40

refactor add some comments

183db78

Review updates

ab653af

Fix tidy

8319e01

Fix test failure

9ee0418

fix isfinite

355e4f6

Merge remote-tracking branch 'origin/develop' into ref_fp8

ba471f4

fix test for neg inf

6aec703

fix warning

12aac37

add tests

6009232

Fix tests

03f7139

add stringstream tests

1e220c0

Remove clang diagnostics

a83e9dc

Merge remote-tracking branch 'origin/develop' into ref_fp8

dfb35a6

Remove NOLINTS

26956f1

Bugfixes and additional tests

269ce6d

Fix undoing

6414ee3

Handle underflow case separately to avoid sanitization errors

cd26ada

use std::min to avoid sanitization errors

1cf87ef

Merge branch 'develop' into ref_fp8

e7e5ba2

formatting

98a838f

use 31 for min value

61e4e1d

add note

a5c38eb

umangyadav and others added 13 commits December 4, 2023 13:38

Merge branch 'rocblas_mlir_fp8' of github.com:ROCmSoftwarePlatform/AM…

c60a4a4

…DMIGraphX into rocblas_mlir_fp8

use updated eliminate_fp8 pass

5423577

use eliminate_data_type pass instead of eliminate_fp8 pass

402c66a

Merge branch 'develop' into rocblas_fp8

8738f3b

remove older files

4ca90ec

remove header

b099a7d

fix typo

7d6e6ad

add changes for the eliminate_data_type pass

cf91c2b

add comments

82f9847

fix typo

a9db2bf

remove else

aeaac20

disable tests that uses CK

a196e90

formatting

7e80f62

causten reviewed Dec 5, 2023

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

umangyadav added 3 commits December 5, 2023 23:30

use same SHA as develop branch

a3d4b01

Merge branch 'rocblas_fp8' into rocblas_mlir_fp8

a98d86d

use angled brackets

de27b91

umangyadav changed the base branch from develop to rocblas_fp8 December 6, 2023 00:07

umangyadav added 2 commits December 6, 2023 00:17

add comment

b6250a4

formatting

b254223

Base automatically changed from rocblas_fp8 to develop December 6, 2023 01:20

Merge branch 'develop' into rocblas_mlir_fp8

acd9bd3

umangyadav requested review from TedThemistokleous and CharlieL7 December 6, 2023 23:06

TedThemistokleous approved these changes Dec 7, 2023

View reviewed changes

TedThemistokleous requested a review from causten December 7, 2023 01:18

causten merged commit 6a72e8f into develop Dec 7, 2023
44 checks passed

causten deleted the rocblas_mlir_fp8 branch December 7, 2023 02:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP8 2D forward convolution using rocMLIR #2507

FP8 2D forward convolution using rocMLIR #2507

umangyadav commented Dec 3, 2023 •

edited

Loading

migraphx-bot commented Dec 3, 2023 •

edited

Loading

TedThemistokleous commented Dec 7, 2023

FP8 2D forward convolution using rocMLIR #2507

FP8 2D forward convolution using rocMLIR #2507

Conversation

umangyadav commented Dec 3, 2023 • edited Loading

migraphx-bot commented Dec 3, 2023 • edited Loading

TedThemistokleous commented Dec 7, 2023

umangyadav commented Dec 3, 2023 •

edited

Loading

migraphx-bot commented Dec 3, 2023 •

edited

Loading