Add f8E4M3 and f8E3M4 types support #2482

apivovarov · 2024-08-09T00:35:21Z

This PR adds f8E4M3 and f8E3M4 types support.

f8E4M3 and f8E3M4 types follow IEEE 754 convention.

f8E4M3 (IEEE 754)
- Exponent bias: 7
- Maximum stored exponent value: 14 (binary 1110)
- Maximum unbiased exponent value: 14 - 7 = 7
- Minimum stored exponent value: 1 (binary 0001)
- Minimum unbiased exponent value: 1 − 7 = −6
- Precision specifies the total number of bits used for the significand (mantisa), 
    including implicit leading integer bit = 3 + 1 = 4
- Follows IEEE 754 conventions for representation of special values
- Has Positive and Negative zero
- Has Positive and Negative infinity
- Has NaNs

Additional details:
- Max exp (unbiased): 7
- Min exp (unbiased): -6
- Infinities (+/-): S.1111.000
- Zeros (+/-): S.0000.000
- NaNs: S.1111.{001, 010, 011, 100, 101, 110, 111}
- Max normal number: S.1110.111 = +/-2^(7) x (1 + 0.875) = +/-240
- Min normal number: S.0001.000 = +/-2^(-6)
- Max subnormal number: S.0000.111 = +/-2^(-6) x 0.875 = +/-2^(-9) x 7
- Min subnormal number: S.0000.001 = +/-2^(-6) x 0.125 = +/-2^(-9)

f8E3M4 (IEEE 754)
- Exponent bias: 3
- Maximum stored exponent value: 6 (binary 110)
- Maximum unbiased exponent value: 6 - 3 = 3
- Minimum stored exponent value: 1 (binary 001)
- Minimum unbiased exponent value: 1 − 3 = −2
- Precision specifies the total number of bits used for the significand (mantissa), 
    including implicit leading integer bit = 4 + 1 = 5
- Follows IEEE 754 conventions for representation of special values
- Has Positive and Negative zero
- Has Positive and Negative infinity
- Has NaNs

Additional details:
- Max exp (unbiased): 3
- Min exp (unbiased): -2
- Infinities (+/-): S.111.0000
- Zeros (+/-): S.000.0000
- NaNs: S.111.{0,1}⁴ except S.111.0000
- Max normal number: S.110.1111 = +/-2^(6-3) x (1 + 15/16) = +/-2^3 x 31 x 2^(-4) = +/-15.5
- Min normal number: S.001.0000 = +/-2^(1-3) x (1 + 0) = +/-2^(-2)
- Max subnormal number: S.000.1111 = +/-2^(-2) x 15/16 = +/-2^(-2) x 15 x 2^(-4) = +/-15 x 2^(-6)
- Min subnormal number: S.000.0001 = +/-2^(-2) x 1/16 =  +/-2^(-2) x 2^(-4) = +/-2^(-6)

Related PRs:

LLVM PR-97179 [APFloat] Add support for f8E4M3 IEEE 754 type (Merged)
LLVM PR-97118 [MLIR] Add f8E4M3 IEEE 754 type (Merged)
LLVM PR-99698 [APFloat] Add support for f8E3M4 IEEE 754 type (Merged)
LLVM PR-101230 [MLIR] Add f8E3M4 IEEE 754 type (Merged)
StableHLO PR-2486 [RFC] Add f8E4M3 and f8E3M4 types support
ml_dtypes PR-161 Add float8_e4m3 (Merged)
ml_dtypes PR-171 Add float8_e3m4 (Merged)
XLA PR-16585 Add support for float8_e4m3

GleasonK · 2024-08-09T15:08:30Z

Hello! Could you convert the PR description into a MD file in rfcs/, similar to:
https://github.com/GleasonK/stablehlo/blob/main/rfcs/20230321-fp8_fnuz.md

Then we can share that RFC on OpenXLA Discuss. Opset / type changes require an RFC / short waiting period (~2w, unless very non-controversial) prior to merge.

apivovarov · 2024-08-09T21:51:38Z

Opened - StableHLO PR-2486 [RFC] Add f8E4M3 and f8E3M4 types support

### Summary This is a proposal to add `Float8E4M3` and `Float8E3M4` floating point types to StableHLO. Feedback welcome, see [RFC: Float8E4M3 and Float8E3M4](https://github.com/apivovarov/stablehlo/blob/rfc_f8E4M3_f8E3M4/rfcs/20240808-f8E4M3_f8E3M4.md) for more details. ### References and Links - LLVM [PR-97179](llvm/llvm-project#97179) [APFloat] Add support for f8E4M3 IEEE 754 type (Merged) - LLVM [PR-97118](llvm/llvm-project#97118) [MLIR] Add f8E4M3 IEEE 754 type (Merged) - LLVM [PR-99698](llvm/llvm-project#99698) [APFloat] Add support for f8E3M4 IEEE 754 type (Merged) - LLVM [PR-101230](llvm/llvm-project#101230) [MLIR] Add f8E3M4 IEEE 754 type (Merged) - [RFC: FP8 in StableHLO](https://github.com/openxla/stablehlo/blob/main/rfcs/20221031-fp8.md) - [RFC: Float8E4M3FNUZ and Float8E5M2FNUZ](https://github.com/openxla/stablehlo/blob/main/rfcs/20230321-fp8_fnuz.md) - StableHLO [PR-2482](#2482) Add f8E4M3 and f8E3M4 types support - [Amazon EC2 Trn1 Instances](https://aws.amazon.com/ec2/instance-types/trn1/) - ml_dtypes [PR-161](jax-ml/ml_dtypes#161) Add float8_e4m3 (Merged) - ml_dtypes [PR-171](jax-ml/ml_dtypes#171) Add float8_e3m4 (Merged) - XLA [PR-16585](openxla/xla#16585) Add support for float8_e4m3

stablehlo/tests/vhlo/stablehlo_legalize_to_vhlo.mlir

stablehlo/dialect/VhloBytecode.cpp

apivovarov force-pushed the f8E4M3_f8E3M4 branch from 08b0641 to 2707975 Compare August 9, 2024 00:39

apivovarov force-pushed the f8E4M3_f8E3M4 branch 2 times, most recently from 90631ba to cf35625 Compare August 9, 2024 18:23

apivovarov mentioned this pull request Aug 9, 2024

[RFC] Add f8E4M3 and f8E3M4 types support #2486

Merged

apivovarov force-pushed the f8E4M3_f8E3M4 branch from cf35625 to 4433e5b Compare August 9, 2024 22:58

apivovarov force-pushed the f8E4M3_f8E3M4 branch 4 times, most recently from 8b45661 to 5861863 Compare August 27, 2024 21:00

apivovarov mentioned this pull request Aug 28, 2024

Add support for float8_e4m3 and float8_e3m4 types openxla/xla#16585

Open

apivovarov force-pushed the f8E4M3_f8E3M4 branch from 3800a6b to 356dc4b Compare August 30, 2024 00:27

GleasonK approved these changes Sep 3, 2024

View reviewed changes

stablehlo/tests/vhlo/stablehlo_legalize_to_vhlo.mlir Show resolved Hide resolved

GleasonK reviewed Sep 3, 2024

View reviewed changes

stablehlo/dialect/VhloBytecode.cpp Show resolved Hide resolved

Add f8E4M3 and f8E3M4 types support

679bfdd

apivovarov force-pushed the f8E4M3_f8E3M4 branch from 526cebc to 679bfdd Compare September 3, 2024 20:28

GleasonK approved these changes Sep 4, 2024

View reviewed changes

GleasonK merged commit ed8c91e into openxla:main Sep 4, 2024
10 checks passed

apivovarov mentioned this pull request Sep 12, 2024

Add float8_e4m3 and float8_e3m4 types support jax-ml/jax#23585

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add f8E4M3 and f8E3M4 types support #2482

Add f8E4M3 and f8E3M4 types support #2482

apivovarov commented Aug 9, 2024 •

edited

Loading

GleasonK commented Aug 9, 2024

apivovarov commented Aug 9, 2024 •

edited

Loading

Add f8E4M3 and f8E3M4 types support #2482

Add f8E4M3 and f8E3M4 types support #2482

Conversation

apivovarov commented Aug 9, 2024 • edited Loading

GleasonK commented Aug 9, 2024

apivovarov commented Aug 9, 2024 • edited Loading

apivovarov commented Aug 9, 2024 •

edited

Loading

apivovarov commented Aug 9, 2024 •

edited

Loading