-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
an illegal memory access was encountered #39
Comments
Sorry for late response. Could you show me your python file? |
Of course, I guess the ThreadblockPolicy I'm using is ThreadblockPolicy.auto
|
Yeah, please try to use manual first. The auto policy is not maintained for a while and maybe deprecated in future. |
When I tested the hierarchical_allreduce.py file that comes with msccl-tools, the command I used was : python ./hierarchical_allreduce.py --protocol=Simple --schedule=manual 4 2 1 > hierarch_Simple_4_2_1.xml, but after running the mpirun command, the following error occurred. Why is that? @Binyang2014 hierarch_Simple_4_2_1.xml : xmlFile.txt |
@Binyang2014 Excuse me, do you have time to answer my question? |
Sorry, I don't have time to go through your case in recent weeks. One thing I suggested is using Simple protocol not LL. LL will double the buffer which make cause some issues. Maybe I can get time to check the error in next week. |
Of course, thank you for your suggestions and response. |
What could be the possible reasons for the following issue?
And my xml file:
xml.txt
The text was updated successfully, but these errors were encountered: