Is there any plan to make NNAPI EP up to date? #17464

fhc1985 · 2023-09-08T05:47:07Z

fhc1985
Sep 8, 2023

Since Android API Level 29, NNAPI has introduced many new api like device discovery, burst etc. Is there any plan to introduce these features in NNAPI EP?

Answered by skottmckay

Sep 8, 2023

Not sure how that is relevant. The NNAPI EP converts ONNX operators to equivalent NNAPI operators to execute the ONNX model.

Anything like device discovery seems orthogonal to that.

The 'burst' mode sounds like it does some form of batching, but as the NNAPI EP is executing within the context of the overall ONNX model there's not really a way to surface that.

View full answer

skottmckay · 2023-09-08T06:59:40Z

skottmckay
Sep 8, 2023
Collaborator

Not sure how that is relevant. The NNAPI EP converts ONNX operators to equivalent NNAPI operators to execute the ONNX model.

Anything like device discovery seems orthogonal to that.

The 'burst' mode sounds like it does some form of batching, but as the NNAPI EP is executing within the context of the overall ONNX model there's not really a way to surface that.

0 replies

fhc1985 · 2023-09-08T08:32:41Z

fhc1985
Sep 8, 2023
Author

Thanks a lot! Meanwhile, I think the new API such as ANeuralNetworksCompilation_createForDevices can be employed to determine whether using certain accelerating hardware or just using CPU. For my ORT model, on some specific device, the CPU EP is even faster than NNAPI EP. So I need to known whether to use NNAPI EP at runtime. And I don't think I'm the only one encountering this case.

0 replies

skottmckay · 2023-09-08T09:05:18Z

skottmckay
Sep 8, 2023
Collaborator

We already use that API:

onnxruntime/onnxruntime/core/providers/nnapi/nnapi_builtin/builders/model_builder.cc

Lines 605 to 613 in ae74a51

    
           // If an op is supported and assigned to a device, it will be compiled by that device. 
        
           // An internal compiling error will lead to the whole compilation process failure. 
        
           // However, ANeuralNetworksCompilation_create will fall back to CPU if compilation fails. 
        
           if (use_create_for_devices) { 
        
             RETURN_STATUS_ON_ERROR_WITH_NOTE( 
        
                 nnapi_.ANeuralNetworksCompilation_createForDevices( 
        
                     nnapi_model_->model_, device_handles.data(), 
        
                     static_cast<uint32_t>(device_handles.size()), &nnapi_model_->compilation_), 
        
                 "on createForDevices");

NNAPI performance is hugely dependent on the individual device. Basically NNAPI is an abstraction layer and the hardware vendor (GPU/NPU) implements the actual operations. If the vendor has not implemented an operation NNAPI will use a reference CPU implementation (i.e. the simplest and most basic way to do it, which is not optimized in any way). If the vendor implements something badly you're also stuck with that.

Due to this, our recommendation is to always test on-device whether enabling the NNAPI EP is faster or not, save the result of that test, and use that option going forward. If your model is using 32-bit float data I'd recommend also testing with the XNNPACK execution provider.

If it was me, the first time my app was run I'd try using the CPU execution provider, the NNAPI execution provider, and optionally the XNNPACK execution provider (create an InferenceSession for each test and free it before the next one) with some representative input and save the execution time from one or more Run calls. Ignore the first call - that will always be slower as we cache information from it.

If NNAPI execution time is in the same ballpark as CPU or XNNPACK I'd choose NNAPI. It will be using GPU or NPU and most likely have better power consumption performance.

If not, pick whichever is better of XNNPACK or CPU. Both are CPU based - just different assembly instructions used if the model is 32-bit float when XNNPACK is enabled.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any plan to make NNAPI EP up to date? #17464

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Is there any plan to make NNAPI EP up to date? #17464

fhc1985 Sep 8, 2023

Replies: 3 comments

skottmckay Sep 8, 2023 Collaborator

fhc1985 Sep 8, 2023 Author

skottmckay Sep 8, 2023 Collaborator

fhc1985
Sep 8, 2023

skottmckay
Sep 8, 2023
Collaborator

fhc1985
Sep 8, 2023
Author

skottmckay
Sep 8, 2023
Collaborator