Replies: 1 comment 1 reply
-
The GPU is not a silver bullet when it comes to performance. There are so many reasons why the CPU could be faster than the GPU when running a query. Often this can be fixed with proper tuning. https://nvidia.github.io/spark-rapids/docs/FAQ.html#why-is-my-query-in-gpu-mode-slower-than-cpu-mode can help you start to look/debug why it might be slower. But there are some situations where the GPU inherently will not be faster than the CPU.
It is also possible that you found a performance bug where there is an operator that we have implemented badly for some cases. If you could share with us details about the query and your data we can probably help you debug why it is slower. You can also look at the Spark UI and compare a CPU run to a GPU run. You can try to match up stages and see which ones are taking longer on the GPU vs the CPU to debug what is happening. We do have a few tools that can help with some of this, https://nvidia.github.io/spark-rapids/docs/spark-profiling-tool.html but it is not perfect. |
Beta Was this translation helpful? Give feedback.
-
Hi, I plugged in spark 3.1 with rapids, and everything setup.
GPU: Tesla T4 16gb
the overall spark flow is giving less completion time on cpu as compared to gpu, I want to know what I am doing wrong because gpu is supposed to be faster.
Beta Was this translation helpful? Give feedback.
All reactions