[CUDA] Fix NumericLimits #22738

tianleiwu · 2024-11-05T21:56:18Z

Fix NumericLimits<float> that used infinity as max, which is not consistent with std::numeric_limits<float>::max()
In Windows, (float)(1e+300) is used for INFINITY, which causes compiler error in Visual Studio 2022 v17.12 Preview 5.
Rename NumericLimits<T>::Min to Lowest to be consistent with std::numeric_limits
Fix topk implementation: use NumericLimits<CudaT> instead of NumericLimits<T> in kernel. That could avoid defining a confusing defintion of NumericLimits<MLFloat16> that returns half instead of MLFloat16.
Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly, which is faster than converting float to half.

Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3 and __nv_fp8_e5m2 right now.

Fix NumericLimits

ce702b4

tianleiwu marked this pull request as draft November 5, 2024 22:16

refine

d27100f

tianleiwu marked this pull request as ready for review November 6, 2024 00:27

tianleiwu requested review from yufenglee and snnn November 6, 2024 01:22

snnn approved these changes Nov 6, 2024

View reviewed changes

tianleiwu merged commit d993ec3 into main Nov 6, 2024
91 checks passed

tianleiwu deleted the tlwu/fix_numeric_limits branch November 6, 2024 17:53

Provide feedback