Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

coolgpus Driver Mismatch error #13

Open
datacubeR opened this issue Feb 26, 2021 · 2 comments
Open

coolgpus Driver Mismatch error #13

datacubeR opened this issue Feb 26, 2021 · 2 comments

Comments

@datacubeR
Copy link

Hi guys,
Iḿ having trouble to make this module work. Is there any restrictions to use this?
I have an RTX 3090, with the following drivers:
image
I'm using this as a server connected through ssh for training deep learning models purposes. I just assembled this server a couple of weeks ago, and during these days I started intensive training load getting not so high temperatures (around 65C full load) but I notice fan speed is close to 70%.

I installed coolgpus but I got a Driver mismatch error with code exit 255. I got hell of a scare, because nvidia-smi stopped working giving this message NVML driver mismatch and I thought I ruined my drivers, and needed to reinstall over everything in my server... So I quickly uninstall this and reboot and now it is working, but I really like the idea of using a pip installable for controlling fan speed.

Any help to make this work will be deeply appreciated.

Alfonso

@andyljones
Copy link
Owner

andyljones commented Feb 26, 2021

There's some general debugging steps here. Run the script manually, let it crash, look at the logs and see which command's raising the error. Then run that command manually and see if you get the same error.

Incidentally if that Xorg instance and gnome-shell is because you have a monitor attached, coolgpus will straight-up not work.

@datacubeR
Copy link
Author

In my case I'm connected to a laptop that through SSH connects the server, that is why I can see the output of nvidia-smi. I will check the steps you provided.

Thanks,

Alfonso

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants