Inference performances decrease as the inference thread(also process) increase #6937

mysjo7789 · 2021-03-08T11:48:03Z

mysjo7789
Mar 8, 2021

On my server, I run the multiple onnx inference processes and I measured the time for a inference each process. As the number of inference process increase, the time to calc by onnx also increase. how can I configure the onnxRuntime not to decrease the performances on each process

snnn · 2021-03-08T17:39:38Z

snnn
Mar 8, 2021
Collaborator

By default each process assumes it can use all the CPUs. If you have multiple processes, then the CPUs are over subscribed and the perf is less optimal. If you have a fine control of the number of process, then you can divide the number of CPUs by the number of process to get how many CPUs each process should use. Then you can use ONNX Runtime session options to adjust the number of threads of each process.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference performances decrease as the inference thread(also process) increase #6937

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Inference performances decrease as the inference thread(also process) increase #6937

mysjo7789 Mar 8, 2021

Replies: 1 comment

snnn Mar 8, 2021 Collaborator

mysjo7789
Mar 8, 2021

snnn
Mar 8, 2021
Collaborator