[CoreML MLProgram] Support Float16 (1/N) #22068

wejoncy · 2024-09-12T03:26:18Z

Description

Support Float16 for CoreML MLProgram EP.
Operations:
Unary/Binary/Act/pool/shape/gemm/conv

Motivation and Context

onnxruntime/core/providers/coreml/builders/impl/builder_utils.cc

onnxruntime/core/providers/coreml/builders/impl/unary_op_builder.cc

skottmckay · 2024-09-12T04:37:25Z

onnxruntime/test/providers/coreml/coreml_basic_test.cc

@@ -257,6 +257,98 @@ TEST(CoreMLExecutionProviderTest, TestNameSanitization) {
  // TensorRT does not support Clip opset 11 yet.
  test.Run(OpTester::ExpectResult::kExpectSuccess, "", {kTensorrtExecutionProvider});
 }
+
+TEST(CoreMLExecutionProviderTest, TestBinaryFp16) {


Do we need to add separate tests here or could we update onnxruntime\test\providers\cpu\math\element_wise_ops_test.cc to run the MLFloat16 tests it has for CoreML?

We're also going to add some xnnpack fp16 kernels so the more common test code that is used the better.

Good idea. Refactored some of elemenwise UTs. It's still a bit messy for FP16/Bf16 test.

onnxruntime/test/providers/cpu/math/element_wise_ops_test.cc

tools/ci_build/github/apple/coreml_supported_mlprogram_ops.md

onnxruntime/core/providers/coreml/builders/impl/builder_utils.cc

onnxruntime/core/providers/coreml/builders/impl/builder_utils.h

onnxruntime/core/providers/coreml/builders/impl/gemm_op_builder.cc

onnxruntime/test/providers/cpu/math/element_wise_ops_test.cc

onnxruntime/test/providers/cpu/math/gemm_test.cc

skottmckay · 2024-09-24T07:00:47Z

onnxruntime/test/providers/cpu/math/gemm_test.cc

+    test.AddAttribute("beta", 1.0f);
+    test.AddInput<MLFloat16>("A", {2, 4}, f_A);
+    test.AddInput<MLFloat16>("B", {4, 3}, f_B, true);
+    f_C.resize(3);


nit: create a new local variable so that a developer doesn't have to track the changes made to f_C across multiple different tests.

would also be good to capture in a comment what exactly is being tested by using different input sizes for f_C and why.

may also be good to use something other than all 1's for the bias input as that could potentially hide an issue in the handling.

skottmckay · 2024-09-24T07:08:51Z

onnxruntime/test/providers/cpu/nn/conv_fp16_test.cc

@@ -37,6 +37,12 @@ void TestConvFp16Op(const ConvOpAndTestAttributes& attributes,
                    OpTester::ExpectResult expect_result = OpTester::ExpectResult::kExpectSuccess,
                    const std::string& err_str = "",
                    int opset = 11) {
+#if !defined(MLAS_F16VEC_INTRINSICS_SUPPORTED)
+  // a `return` after tester will make binary crash


Is the reason that if you do not have MLAS_F16VEC_INTRINSICS_SUPPORTED there's no support for the custom NhwcFusedConv attribute? And now that we hit this code when the CoreML EP is enabled we need to do an early exit in that case as no EP can handle the node?

I think I misunderstand it. I just need to exclude coremlEP when it has activation fused.

My question was more about making the comment explain 'why' rather than 'what' will happen. If you say 'it will crash' the next developer to look at the code has to start from scratch to figure out the reason in order to decide if this applies to their potential changes not.

I think it comes down to the activation attribute being something that the FusedConv/NhwcFusedConv operators have that the ONNX Conv doesn't. As the CoreML EP doesn't support those contrib ops, if the CPU EP can't handle it due to the lack of MLAS support the OpTester won't find any EPs that can run and fails.

i.e. if attributes.activation is not empty we're using a contrib op that only a small number of EPs support.

I'm surprised this check isn't also required for the QNN EP as I don't see FusedConv or NhwcFusedConv in the ops it supports.

Got it.
Comments added

onnxruntime/core/providers/coreml/builders/model_builder.h

Co-authored-by: Scott McKay <[email protected]>

…r.cc Co-authored-by: Scott McKay <[email protected]>

Co-authored-by: Scott McKay <[email protected]>

wejoncy force-pushed the jicwen/coreml_fp16 branch from e134133 to 305e9ab Compare September 12, 2024 04:23

skottmckay reviewed Sep 12, 2024

View reviewed changes

wejoncy added 5 commits September 17, 2024 20:51

support coremlfp16

1767fee

support unary and binary ops

bb99008

format

4e866d1

more ops

0611bf5

fix

3944fd6

wejoncy force-pushed the jicwen/coreml_fp16 branch from 305e9ab to 3944fd6 Compare September 18, 2024 03:51

unify UT

4f935e7

wejoncy force-pushed the jicwen/coreml_fp16 branch from 957c4d9 to 4f935e7 Compare September 19, 2024 07:47

skottmckay reviewed Sep 19, 2024

View reviewed changes

onnxruntime/test/providers/cpu/math/element_wise_ops_test.cc Outdated Show resolved Hide resolved

skottmckay reviewed Sep 19, 2024

View reviewed changes

tools/ci_build/github/apple/coreml_supported_mlprogram_ops.md Outdated Show resolved Hide resolved

wejoncy added 2 commits September 20, 2024 00:53

gemm conv support

d765095

gemm/conv test

8129643

wejoncy marked this pull request as ready for review September 20, 2024 09:07

address comments

9d665f3

wejoncy requested a review from skottmckay September 20, 2024 09:19

wejoncy added 3 commits September 20, 2024 04:58

build issue

dbf25b9

fix crash test

293b9f2

lint

4b344a5

skottmckay reviewed Sep 24, 2024

View reviewed changes

wejoncy and others added 6 commits September 25, 2024 10:51

Update onnxruntime/core/providers/coreml/builders/impl/builder_utils.cc

ca581bc

Co-authored-by: Scott McKay <[email protected]>

Update onnxruntime/core/providers/coreml/builders/impl/builder_utils.h

e9b2a42

Co-authored-by: Scott McKay <[email protected]>

Update onnxruntime/core/providers/coreml/builders/impl/gemm_op_builde…

8fa2b48

…r.cc Co-authored-by: Scott McKay <[email protected]>

Update onnxruntime/core/providers/coreml/builders/model_builder.h

154a399

Co-authored-by: Scott McKay <[email protected]>

Update onnxruntime/test/providers/cpu/math/gemm_test.cc

b7c6078

Co-authored-by: Scott McKay <[email protected]>

address comments && add tolerance

fe3a3a3

wejoncy force-pushed the jicwen/coreml_fp16 branch from 47de692 to fe3a3a3 Compare September 25, 2024 11:25

wejoncy added 2 commits September 25, 2024 20:54

add comments to explain convfp16 test

43f6e19

format

7ef5e1e

wejoncy added 2 commits September 25, 2024 22:43

add curly brace for code block

97e87ad

add conv fp16 with intilizer=true

48c98ec

wejoncy force-pushed the jicwen/coreml_fp16 branch from c3fc5bb to 48c98ec Compare September 26, 2024 08:30

wejoncy added 2 commits September 26, 2024 01:50

qnn convfp16

c9e75c9

fix qnn

a8e5485

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CoreML MLProgram] Support Float16 (1/N) #22068

[CoreML MLProgram] Support Float16 (1/N) #22068

wejoncy commented Sep 12, 2024 •

edited

Loading

skottmckay Sep 12, 2024

wejoncy Sep 19, 2024

skottmckay Sep 24, 2024

wejoncy Sep 25, 2024

skottmckay Sep 24, 2024

wejoncy Sep 25, 2024 •

edited

Loading

skottmckay Sep 26, 2024

wejoncy Sep 26, 2024 •

edited

Loading

[CoreML MLProgram] Support Float16 (1/N) #22068

Are you sure you want to change the base?

[CoreML MLProgram] Support Float16 (1/N) #22068

Conversation

wejoncy commented Sep 12, 2024 • edited Loading

Description

Motivation and Context

skottmckay Sep 12, 2024

Choose a reason for hiding this comment

wejoncy Sep 19, 2024

Choose a reason for hiding this comment

skottmckay Sep 24, 2024

Choose a reason for hiding this comment

wejoncy Sep 25, 2024

Choose a reason for hiding this comment

skottmckay Sep 24, 2024

Choose a reason for hiding this comment

wejoncy Sep 25, 2024 • edited Loading

Choose a reason for hiding this comment

skottmckay Sep 26, 2024

Choose a reason for hiding this comment

wejoncy Sep 26, 2024 • edited Loading

Choose a reason for hiding this comment

wejoncy commented Sep 12, 2024 •

edited

Loading

wejoncy Sep 25, 2024 •

edited

Loading

wejoncy Sep 26, 2024 •

edited

Loading