Driver quantize fp8 update #3715

CharlieL7 · 2024-12-13T22:46:52Z

Updates the quantization to always quantize fp8 to the OCP fp8e4m3fn type
Removes running simplify_qdq and optimize_module during the quantization so that the ocp_to_fnuz conversion pass can work properly
Don't merge this until FP8 OCP to FP8 FNUZ on hardware with only FP8 FNUZ support #3684 is done.

…_quantize_fp8_update

codecov · 2024-12-13T23:02:21Z

Codecov Report

Attention: Patch coverage is 50.00000% with 1 line in your changes missing coverage. Please review.

Project coverage is 92.25%. Comparing base (3c36b9b) to head (b62a304).

Files with missing lines	Patch %	Lines
src/quantization.cpp	50.00%	1 Missing ⚠️

Additional details and impacted files

@@               Coverage Diff               @@
##           ocp_to_fnuz    #3715      +/-   ##
===============================================
+ Coverage        92.23%   92.25%   +0.02%     
===============================================
  Files              517      517              
  Lines            21819    21808      -11     
===============================================
- Hits             20124    20120       -4     
+ Misses            1695     1688       -7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pfultz2 · 2024-12-13T23:19:22Z

src/targets/gpu/include/migraphx/gpu/context.hpp

@@ -311,7 +311,6 @@ struct context
        value result;
        result["events"]  = events.size();
        result["streams"] = current_device->nstreams();
-        result["gfx_name"] = get_current_device().get_gfx_name();


Why is this removed on serialization?

I added this earlier when getting FP8 OCP in to query the gfx number from the driver. We could keep it, but it would not be used anywhere anymore.

CharlieL7 added 6 commits December 10, 2024 15:18

initial

0318f32

temporary

3b48242

disable simpilify_qdq in quantization_8bits

b373d10

revert

28aab5f

disable extra passes after quantize_8bits

7e0142f

Merge branch 'ocp_to_fnuz' of github.com:ROCm/AMDMIGraphX into driver…

b62a304

…_quantize_fp8_update

CharlieL7 self-assigned this Dec 13, 2024

CharlieL7 requested a review from causten as a code owner December 13, 2024 22:46

pfultz2 reviewed Dec 13, 2024

View reviewed changes

causten requested a review from TedThemistokleous December 16, 2024 20:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Driver quantize fp8 update #3715

Driver quantize fp8 update #3715

CharlieL7 commented Dec 13, 2024 •

edited

Loading

codecov bot commented Dec 13, 2024

pfultz2 Dec 13, 2024

CharlieL7 Dec 16, 2024

Driver quantize fp8 update #3715

Are you sure you want to change the base?

Driver quantize fp8 update #3715

Conversation

CharlieL7 commented Dec 13, 2024 • edited Loading

codecov bot commented Dec 13, 2024

Codecov Report

pfultz2 Dec 13, 2024

Choose a reason for hiding this comment

CharlieL7 Dec 16, 2024

Choose a reason for hiding this comment

CharlieL7 commented Dec 13, 2024 •

edited

Loading