How to run multiple cuda sessions in parallel #8559

sunhmy · 2021-07-30T09:05:12Z

sunhmy
Jul 30, 2021

Hi,

I'm new to OnnxRuntime and trying to increase performance by running 2 GPU sessions in parallel. I'd assume I can do it as simple as:

create 2 sessions and 2 cuda streams.
assign 1st stream to 1st session by OrtCUDAProviderOptions, same for the 2nd stream to the 2nd session
create 2 threads, each for one of the sessions above.

When running it, I do see that lots of the time the 2 GPU streams are running in parallel, however, from profiling the GPU perf, I'm seeing a lot of pthread_mutex_lock causing huge latency. So that now running 2 GPU streams in parallel is as slow as running them sequentially for me...

Below is a screenshot showing those pthread_mutex_lock causing the extra dependency(false dependency?).

1)Are those inserted by OnnxRuntime?
2) Is there any way to get rid of those locks if they are false dependency?
3) Is there any other way to achieve running multiple cuda sessions in parallel?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to run multiple cuda sessions in parallel #8559

{{title}}

Replies: 0 comments

Select a reply

How to run multiple cuda sessions in parallel #8559

sunhmy Jul 30, 2021

Replies: 0 comments

sunhmy
Jul 30, 2021