Is 0.4.3 NNOM c backend slower than CMSIS-5.9.0? #182

bfs18 · 2023-03-02T07:56:44Z

I test the mnist-simple model in rt-thread environment and run model predict for 1000 times.
using CMSIS 5.9.0 backend, it consumes 1423 ms
using NNOM 0.4.3 C backend, it consumes 1392 ms
The document says that replacing c backend with CMSIS would bring 5 times speed-up, however, the performance is similar according to my test. What's the reason? NNOM updated or I didn't set CMSIS macros correctly?

majianjia · 2023-03-03T11:25:36Z

Probably you havent enable CMSIS's NN acceleration, i.e. you are running CMSIS on their C backend instead of SIMD assemblys.
You may check CMSIS instruction or the note under HWC format

bfs18 · 2023-03-04T09:19:22Z

UPDATE: When -Ofast is added, using CMSIS 5.9.0 backend, reduces to 369 ms.

Hi Jia, thanks for your advice. I add

ARM_MATH_DSP,ARM_MATH_CM7,__FPU_PRESENT=1

using CMSIS 5.9.0 backend, reduces to 1055 ms.
I used the rt-thread qemu-vexpress-a9 bsp. Is this normal?

Probably you havent enable CMSIS's NN acceleration, i.e. you are running CMSIS on their C backend instead of SIMD assemblys. You may check CMSIS instruction or the note under HWC format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is 0.4.3 NNOM c backend slower than CMSIS-5.9.0? #182

Is 0.4.3 NNOM c backend slower than CMSIS-5.9.0? #182

bfs18 commented Mar 2, 2023

majianjia commented Mar 3, 2023

bfs18 commented Mar 4, 2023 •

edited

Loading

Is 0.4.3 NNOM c backend slower than CMSIS-5.9.0? #182

Is 0.4.3 NNOM c backend slower than CMSIS-5.9.0? #182

Comments

bfs18 commented Mar 2, 2023

majianjia commented Mar 3, 2023

bfs18 commented Mar 4, 2023 • edited Loading

bfs18 commented Mar 4, 2023 •

edited

Loading