Fix Clang ASAN issue by handling float to integer overflow in convert operator #3071

nives-vukovic · 2024-05-10T16:24:46Z

No description provided.

nives-vukovic · 2024-05-10T17:01:20Z

Explanation can be found in the link: https://clang.llvm.org/docs/UndefinedBehaviorSanitizer.html
-fsanitize=float-cast-overflow: Conversion to, from, or between floating-point types which would overflow the destination. Because the range of representable values for all floating-point types supported by Clang is [-inf, +inf], the only cases detected are conversions from floating point to integer types.

codecov · 2024-05-10T17:34:53Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.92%. Comparing base (30cab64) to head (4572474).
Report is 158 commits behind head on develop.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #3071   +/-   ##
========================================
  Coverage    91.92%   91.92%           
========================================
  Files          489      489           
  Lines        19275    19278    +3     
========================================
+ Hits         17719    17722    +3     
  Misses        1556     1556

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

migraphx-bot · 2024-05-10T18:41:31Z

Test	Batch	Rate new 457247	Rate old 30cab6	Diff	Compare
torchvision-resnet50	64	1,713.75	1,713.85	-0.01%	✅
torchvision-resnet50_fp16	64	3,809.82	3,810.00	-0.00%	✅
torchvision-densenet121	32	1,452.84	1,453.61	-0.05%	✅
torchvision-densenet121_fp16	32	2,436.81	2,437.45	-0.03%	✅
torchvision-inceptionv3	32	883.48	883.14	0.04%	✅
torchvision-inceptionv3_fp16	32	1,412.60	1,415.16	-0.18%	✅
cadene-inceptionv4	16	407.14	407.50	-0.09%	✅
cadene-resnext64x4	16	413.67	413.59	0.02%	✅
slim-mobilenet	64	3,823.06	3,820.83	0.06%	✅
slim-nasnetalarge	64	97.00	97.03	-0.04%	✅
slim-resnet50v2	64	1,651.45	1,651.24	0.01%	✅
bert-mrpc-onnx	8	591.46	590.73	0.12%	✅
bert-mrpc-tf	1	288.99	289.90	-0.31%	✅
pytorch-examples-wlang-gru	1	333.00	351.58	-5.28%	🔴
pytorch-examples-wlang-lstm	1	295.34	299.62	-1.43%	✅
torchvision-resnet50_1	1	451.64	455.92	-0.94%	✅
cadene-dpn92_1	1	244.62	244.63	-0.00%	✅
cadene-resnext101_1	1	189.05	187.95	0.59%	✅
onnx-taau-downsample	1	204.07	204.11	-0.02%	✅
dlrm-criteoterabyte	1	22.28	22.30	-0.07%	✅
dlrm-criteoterabyte_fp16	1	41.63	41.61	0.05%	✅
agentmodel	1	6,119.61	6,092.92	0.44%	✅
unet_fp16	2	33.73	33.74	-0.05%	✅
resnet50v1_fp16	1	560.36	570.97	-1.86%	✅
resnet50v1_int8	1	463.11	463.87	-0.16%	✅
bert_base_cased_fp16	64	620.83	620.81	0.00%	✅
bert_large_uncased_fp16	32	193.79	193.83	-0.02%	✅
bert_large_fp16	1	103.89	103.97	-0.08%	✅
distilgpt2_fp16	16	1,186.27	1,189.15	-0.24%	✅
yolov5s	1	298.12	298.14	-0.01%	✅
tinyllama	1	23.33	23.33	0.01%	✅
vicuna-fastchat	1	133.82	133.18	0.49%	✅
whisper-tiny-encoder	1	241.43	240.81	0.26%	✅
whisper-tiny-decoder	1	245.67	245.89	-0.09%	✅

This build is not recommended to merge 🔴

migraphx-bot · 2024-05-10T18:41:32Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

✅ bert_base_cased_fp16: PASSED: MIGraphX meets tolerance

🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

✅ bert_large: PASSED: MIGraphX meets tolerance

✅ yolov5s: PASSED: MIGraphX meets tolerance

✅ tinyllama: PASSED: MIGraphX meets tolerance

✅ vicuna-fastchat: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-encoder: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-decoder: PASSED: MIGraphX meets tolerance

✅ distilgpt2_fp16: PASSED: MIGraphX meets tolerance

causten · 2024-05-14T13:51:16Z

Jenkinsfile

@@ -165,7 +165,8 @@ rocmtest clang_debug: rocmnode('mi100+') { cmake_build ->
 }, clang_asan: rocmnode('nogpu') { cmake_build ->
    stage('Clang ASAN') {
        def sanitizers = "undefined,address"
-        def debug_flags = "-g -O2 -fno-omit-frame-pointer -fsanitize=${sanitizers} -fno-sanitize-recover=${sanitizers}"
+        def sanitizers_disabled = "float-cast-overflow"


Doing it this way means we lose asan coverage of float-cast-overflow on the entire code base. Is there a way to target just the function/file?

agree with chris.

@nives-vukovic can you try disabling sanitizer just on the convert op's compute function and see if that works ?
https://clang.llvm.org/docs/AddressSanitizer.html#disabling-instrumentation-with-attribute-no-sanitize-address

The issue is reported in AMDMIGraphX/src/include/migraphx/shape.hpp, and when I add __attribute__((no_sanitize("float-cast-overflow"))) above:

type operator()(U u) const { return type(u); }

in 'as' struct, the issue is not reported on our system. However, this is a common function used in many places so I don't know if this would be a satisfactory solution for you.

I dont not think it is a good idea to disable sanitizer on type operator()(U u) either.

We can conditionally handle floating point to integer conversion inside the convert operator itself.

shape::visit(type, [&](auto as) { // clamping value between target_type's max and min doesn't work for NaNs, if(std::isnan(static_cast<double>(x))) { y = as.nan(); } ---------------------> Here // if "type" is integer and "x" has floating point then, first do the clamping and then do the conversion. ------------------------ else { // clamp overflowing/underflowing values to min()/max() instead of +/-infinity // during downcasting y = std::min(std::max(as(x), as.min()), as.max()); } });

Doing this works inside the convert.hpp's apply() function.

else if(shape::is_integral(type) and std::is_floating_point_v<decltype(x)>) { // for the floating point to integer conversion, clamp first and then convert to // avoid undefined behaviour y = as(std::min(std::max(static_cast<double>(x), static_cast<double>(as.min())), static_cast<double>(as.max()))); }

It was compiled as : g++ -fsanitize=address

It was compiled as : g++ -fsanitize=address

Try adding -fsanitize=undefined as well

(Yes that option does highlight the issue. Thanks.)

One answer is: https://clang.llvm.org/docs/UndefinedBehaviorSanitizer.html#disabling-instrumentation-with-attribute-no-sanitize-undefined

(Yes that option does highlight the issue. Thanks.)

One answer is: https://clang.llvm.org/docs/UndefinedBehaviorSanitizer.html#disabling-instrumentation-with-attribute-no-sanitize-undefined

It has been tried but that would disable sanitizer on entire type cast function in migraphx. it is not desired. Therefore need to handle it inside the convert.hpp itself

Okay. Thanks.
Maybe try, on a test_case basis, passing on an environment flag/variable:
https://clang.llvm.org/docs/UndefinedBehaviorSanitizer.html#runtime-suppressions

…ast overflow

… operator (#3071)

… operator (ROCm#3071)

Fix Clang ASAN issue by disabling float-cast-overflow check in sanitizer

29c51b6

nives-vukovic requested a review from umangyadav May 13, 2024 12:57

nives-vukovic marked this pull request as ready for review May 13, 2024 15:44

nives-vukovic requested a review from causten as a code owner May 13, 2024 15:44

causten requested a review from pfultz2 May 14, 2024 13:36

causten reviewed May 14, 2024

View reviewed changes

nives-vukovic added 2 commits May 15, 2024 16:41

Merge remote-tracking branch 'origin/develop' into clang_asan_uint8_fix

0e6e32c

Add conidition in convert apply function to handle float to integer c…

a7e03ba

…ast overflow

nives-vukovic changed the title ~~Fix Clang ASAN issue by disabling float-cast-overflow check in sanitizer~~ Fix Clang ASAN issue by handling float to integer overflow in covert operator May 15, 2024

umangyadav approved these changes May 16, 2024

View reviewed changes

umangyadav self-requested a review May 16, 2024 14:27

lakhinderwalia changed the title ~~Fix Clang ASAN issue by handling float to integer overflow in covert operator~~ Fix Clang ASAN issue by handling float to integer overflow in convert operator May 21, 2024

Merge branch 'develop' into clang_asan_uint8_fix

4572474

causten merged commit 5fcf86e into develop Jun 10, 2024
46 of 47 checks passed

causten deleted the clang_asan_uint8_fix branch June 10, 2024 13:35

causten pushed a commit that referenced this pull request Jun 26, 2024

Fix Clang ASAN issue by handling float to integer overflow in convert…

f8fd6dd

… operator (#3071)

lajagapp pushed a commit to lajagapp/AMDMIGraphX that referenced this pull request Jul 8, 2024

Fix Clang ASAN issue by handling float to integer overflow in convert…

32ca32b

… operator (ROCm#3071)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Clang ASAN issue by handling float to integer overflow in convert operator #3071

Fix Clang ASAN issue by handling float to integer overflow in convert operator #3071

nives-vukovic commented May 10, 2024

nives-vukovic commented May 10, 2024

codecov bot commented May 10, 2024 •

edited

Loading

migraphx-bot commented May 10, 2024 •

edited

Loading

migraphx-bot commented May 10, 2024 •

edited

Loading

causten May 14, 2024

umangyadav May 14, 2024

nives-vukovic May 14, 2024 •

edited

Loading

umangyadav May 15, 2024

umangyadav May 15, 2024

lakhinderwalia May 21, 2024 •

edited

Loading

umangyadav May 21, 2024

lakhinderwalia May 21, 2024

umangyadav May 21, 2024

lakhinderwalia May 21, 2024 •

edited

Loading

Fix Clang ASAN issue by handling float to integer overflow in convert operator #3071

Fix Clang ASAN issue by handling float to integer overflow in convert operator #3071

Conversation

nives-vukovic commented May 10, 2024

nives-vukovic commented May 10, 2024

codecov bot commented May 10, 2024 • edited Loading

Codecov Report

migraphx-bot commented May 10, 2024 • edited Loading

migraphx-bot commented May 10, 2024 • edited Loading

causten May 14, 2024

Choose a reason for hiding this comment

umangyadav May 14, 2024

Choose a reason for hiding this comment

nives-vukovic May 14, 2024 • edited Loading

Choose a reason for hiding this comment

umangyadav May 15, 2024

Choose a reason for hiding this comment

umangyadav May 15, 2024

Choose a reason for hiding this comment

lakhinderwalia May 21, 2024 • edited Loading

Choose a reason for hiding this comment

umangyadav May 21, 2024

Choose a reason for hiding this comment

lakhinderwalia May 21, 2024

Choose a reason for hiding this comment

umangyadav May 21, 2024

Choose a reason for hiding this comment

lakhinderwalia May 21, 2024 • edited Loading

Choose a reason for hiding this comment

codecov bot commented May 10, 2024 •

edited

Loading

migraphx-bot commented May 10, 2024 •

edited

Loading

migraphx-bot commented May 10, 2024 •

edited

Loading

nives-vukovic May 14, 2024 •

edited

Loading

lakhinderwalia May 21, 2024 •

edited

Loading

lakhinderwalia May 21, 2024 •

edited

Loading