[RFC] "Multi-Streaming" in CUDA Graphs #4318
ezhulenev
started this conversation in
Show and tell
Replies: 2 comments 2 replies
-
Any progress? |
Beta Was this translation helpful? Give feedback.
0 replies
-
It's behind the flag today: https://github.com/openxla/xla/blob/main/xla/xla.proto#L615-L616, we are working on more explicit multi-streaming + cuda graphs support on XLA, it's here: https://github.com/openxla/xla/blob/main/xla/service/gpu/execution_stream_assignment.h |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi everyone,
We are working on adding multi-streaming to CUDA graphs in XLA. The high level plan is capture in this document.
Beta Was this translation helpful? Give feedback.
All reactions