Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with GPU Kernel Execution in Unified Memory Mode #128

Closed
DDantalion opened this issue Dec 30, 2024 · 1 comment
Closed

Issue with GPU Kernel Execution in Unified Memory Mode #128

DDantalion opened this issue Dec 30, 2024 · 1 comment

Comments

@DDantalion
Copy link

I'm running the sample implementation of PageRank with the following command:
./pagerank -timing -trace-mem -unified-gpus 1,2,3,4 -use-unified-memory -report-all
However, I noticed that only GPU1 shows kernel execution time, while GPUs 2, 3, and 4 remain idle. According to the code, workgroups should have been distributed across all four GPUs.

pr_unified
@DX990307
Copy link
Contributor

The size of the benchmark is not enough to drive all GPUs. You can increase the size of the benchmark like

var numNode = flag.Int("node", 16, "The number of nodes")
change the node counts from 16 to 512.

@DX990307 DX990307 closed this as completed Jan 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants