How to Mine on Linux (Debian 11) with Nvidia GPUs. A Step-by-step Guide.

Here is a step-by-step guide on setting up and overclocking GPUs on Debian Linux with useful links and tips. I spent around a week searching for a stable solution, and below is what works for me.

Pre-requisits

First, you have to install Debian with the graphical desktop manager (I prefer GNOME). The reason for that is simple: to overclock Nvidia GPUs, you need an X-Server running.
If you have newly installed Debian 11, you need to add the user to the sudo group to run commands with admin privileges. To do that, run

su
/usr/bin/adduser <username> sudo

If you want to mine with multiple GPUs, it is recommended to switch PCIE[slot-number] speed from "Auto" to "Gen2" in your BIOS to detect all the GPUs correctly. For instance, in ASRock motherboard, it is done in the "Advanced" -> "Chipset config" section.

I also set the primary monitor to PCIE[slot-number], where the physical monitor is attached to GPU on the rig.

Tip. By default Debian with graphical desktop goes to suspend mode after several minutes of inactivity. To prevent it, go to GNOME settings, and choose

Keep display switched on in "Power Management" settings.
Enable autologin for the user under the "Users" setting to avoid suspending the login screen and running startup scripts after GNOME is loaded.

Nvidia Drivers Installation

Here is a beatiful article describing the installation process. Please avoid installing the default driver included in apt package manager. Instead, choose the manual installation with the latest driver available.

The idea of manual installation is following:

Navigate Nvidia Driver Download Page to choose correct driver
Download the driver to your home directory wget hppt://link/to/driver_name.run
Regenerate kernel's initramfs sudo update-initramfs -u
Reboot in no-graphic mode sudo systemctl set-default multi-user.target && sudo reboot
Run installer bash name_of_driver.run
Finally reboot into graphical target sudo systemctl set-default graphical.target && sudo reboot

I have tested different drivers and noticed that the x495 driver gives a better hash rate for LHR cards.

P.S. If you are not familiar with LHR concept, please have a look at this article

Tip. For the Nvidia drivers to start and overclock correctly, you need a real monitor to be switched on and connected to one of the GPUs on the rig. You need it at least for the first time while you are tuning your GPUs.

P.P.S. If for some reason you want to completely uninstall drivers, simply run sudo apt-get --purge remove "*nvidia*" and sudo update-initramfs -u in no-graphical mode. Then reboot into graphical mode as shown above.

The Nvidia X-Server configuration

Concept. X Server is a daemon that allows rendering graphics. Nvidia X Server is a set of additional configurations to X Server to allow Nvidia GPU render graphics/video.

By default, Nvidia protects its GPUs from accidental overlocking. For instance, if you call "nvidia-settings" GUI utility, you will notice no options to change clock offsets.

Fortunately, you may add a few lines to the X-Server config file to change that. An even better solution is to use "nvidia-xsettings" command, which adds necessary lines to the config file for you.

Run:

sudo nvidia-xconfig --enable-all-gpus --cool-bits=12

It will update/create /etc/X11/xorg.conf file. You may notice that the new lines with the text "Option" "Coolbits 12" were added to the config file. This directive says "nvidia-settings" utility to allow overclocking options.

Now, if you run nvidia-settings in your rig's terminal, you will see the Nvidia Setting GUI window, which has a new option: Editable performance levels (in PowerMizer section). Ok, now you can overclock.

Miner installation

Before proceeding with overclocking, it is a good idea to install the miner. I prefer to test my overlock setting in the actual mining process to estimate the effectiveness of my tuning.

My personal preference is NBMiner

When writing this article, NBMiner allows unlocking LHRv2 Nvidia cards up to 74% of their typical performance. Again, if you are not familiar with LHR concept, please read this article

With the NBMiner installed, you can run the default script to test how your GPUs perform. Later on, you will adjust the miner script once you are happy with your overclock settings.

For instance, to test your performance on the epherium network, go to the miner's directory and run ./start_eth.sh. You will see the current hash rate, power consumption, and GPU's temperature. These are the most important KPIs to control.

[18:39:26] INFO - ===================== [nbminer v40.1] Summary 2021-12-01 18:39:26 =====================
[18:39:26] INFO - |ID|Device|Hashrate| LHR|Accept|Reject|Inv|Powr|CTmp|MTmp|Fan|CClk|GMClk|MUtl|Eff/Watt|
[18:39:26] INFO - | 0|3060ti| 42.19 M|  74|     6|     0|  0| 112|  58|    | 61|1350| 7525| 100| 376.7 K|
[18:39:26] INFO - |------------------+----+------+------+---+----+--------------------------------------|
[18:39:26] INFO - |    Total: 42.19 M|    |     6|     0|  0| 112| Uptime:  0D 00:14:40        CPU:  0% |
[18:39:26] INFO - =======================================================================================

Here is the miner's output:

ID0 means GPU 0
LHR 74 means the miner unlocked LHR GPU up to 74% of its performance.
CTmp 58 - GPU temperature 58 degrees
Fan 61 - GPU fan gets 61% of its maximum speed
GMClk 7525 - Memory Clock is 7525 MHz (factory's default is 7100 according to nvidia-smi -q -d CLOCK | less. That is an easy way to check if your GPU is overclocked)
GClk 1351 - GPU Clock is is 1351 MHz
MUtl 100 - GPU Memory gets 100% of its performance.

Nvidia-setting and overclocking

When nvidia-settings starts, it reads the current settings from its configuration file (typically ~/.nvidia-settings-rc) and sends those settings to the X server. Then, it displays a graphical user interface (GUI) for configuring the current settings. When nvidia-settings exits, it queries the current settings from the X server and saves them to the configuration file.

Nvidia-settings over ssh

As you see, nvidia-setting utility runs as a GUI application, even if you run it as a command-line directive with some parameters.

It means that you need to run it locally in front of the display attached to the rig.

If you prefer to use ssh connection and run graphical utility remotely, you need:

Install X-Server on your local machine if you are on Mac/Windows. It allows you to run GUI applications remotely. To check if it works, ssh to your rig (ssh -X <your_rig_username>192.168.xx.xx), type firefox, and watch X window opening. If you are on Linux, you have X server installed by default.
Check your current display in ssh session by typing echo $DISPLAY in ssh terminal. Typically display has number 10, which means you are using 6010 port to run X Window apps via ssh (Do not forget to use -X parameter ssh -X ... to enable X-session over ssh).
Now, instead of running nvidia-settings locally on the rig, run nvidia-settings --display=:10 --ctrl-display=:0 over ssh. Here
- "10" means the monitor where the X Window is shown (your local computer which initiates ssh connection to the rig).
- "0" means the physical monitor attached to the rig. In my case, I have the rig's monitor connected to GPU0, so I use 0.

Overclocking

The text below is my understanding of how overclocking works. Of course, it doesn't necessarily mean I am right, but I realized the following during my research.

Depending on the load, the GPU changes its frequency. A sudden change of load causes a change in frequency, leading to power surges, instability and computational error.

                    p2
p                  _________
o        p1 ______/ |     ||
w       ___/.......>|     || 
e    __/  |........>|     || 
r  _/     |........>|     || 
-----frequency------+----(max)-->

The best option would be to lock the processor frequency at a certain level - this will increase the stability of the GPU.

p           locked _________
o           ______/       ||
w       ___/   |          ||
e    __/       |          ||
r  _/          |          ||
-----frequency-+---------(max)-->

The higher the locked frequency of the processor, the higher the power consumption and the higher the computational performance.

Another critical parameter is the frequency of RAM. It determines the amount of information that the GPU can download/upload per second.

Higher RAM frequencies are not as power-greedy as GPU's frequencies.

Therefore, it is often a wise decision to increase the RAM frequency while fixing the GPU frequency in the area where it consumes a little power. You will increase the hash rate with a minor power consumption increase.

Some mining algorithms are more sensitive to high GPU frequency, another - to high RAM frequency. If you increase GPU frequency, you are at risk of overheating your GPU due to high power consumption:

Great performance
but with high temp.

                  locked freq.
p .................____|____
o           ______/    |  ||
w       ___/           |  ||
e    __/               |  ||
r  _/                  |  ||
-----frequency-+-------+-(max)-->

Here is overclock comes to help. If you shift the maximum limit of your frequency, the whole function Power_consumption(frequency) will change.

                 locked freq.
p .....................   _____________
o .............._______|_/           ||
w         _____/       |             ||
e     ___/             |             ||
r  __/                 |             ||
-----frequency-+-------+-(old max)--(max)-->

You can see that you can decrease power consumption on a higher frequency by shifting the maximum frequency up. However, be cautious: setting a new maximum may lead to instability of GPU.

Some miners who have good cooling systems may prefer to set a new GPU frequency beyond a factory maximum limit:

                             locked freq.
p                         _________|____
o               _________/         |  ||
w         _____/                   |  ||
e     ___/                         |  ||
r  __/                             |  ||
-----frequency-+-------+-(old max)-+-(max)-->

The locked frequency may be a certain point (in MHz) or a narrow corridor:

p          corridor_________
o           ______/       ||
w       ___/ |..|         ||
e    __/     |..|         ||
r  _/        |..|         ||
-----frequency----------(max)-->

Remember, if the frequency you set requires power beyond a factory limit, then the changes will not take effect - you will get the maximum allowable (real) frequency instead of desired:

                desired freq.
p       real freq. __|_____
..limit.....|_____/--|---||
w       ___/|--------|---||
e    __/    |- N/A --|---||
r  _/       |--------|---||
-----frequency-------+--(max)-->

Nvidia provides us with tools to play with these settings.

You can lock GPU frequencies and set up maximum power consumption limits with nvidia-smi utility. If you want to extend the factory limits of the frequency, use nvidia-settings GUI utility.

From the command line

To overclock your memory frequency (extend the upper limit), send a command to nvidia utility:

nvidia-settings --display=:10 --ctrl-display=:0 -a '[gpu:0]/GPUMemoryTransferRateOffsetAllPerformanceLevels=1200'

The result applies to Nvidia X Server imidiately. Here

--display=:10 --ctrl-display=:0 says nvidia utility to run via ssh (see section above)
-a - apply immidiately
'[gpu:0]/GPUMemoryTransferRateOffsetAllPerformanceLevels=1200' - extend factory maximum RAM frequency by 1200 MHz.

You can see the result of this command by querying Nvidia X Server back:

nvidia-settings --display=:10 --ctrl-display=:0 -q '[gpu:0]/GPUMemoryTransferRateOffset[3]'

It is also good idea to set up "maximum performance mode" by typing

nvidia-settings --display=:10 --ctrl-display=:0 -a [gpu:0]/GPUPowerMizerMode=1

This command requires GPU to stay on maximum alloable frequencies, even if there is no computational load. It encreases stability.

Interesting fact The command line directives are not well documented and may change from one GPU generation to another. It is a good idea to ask for the right/actual command on Nvidia Developers Forum

From GUI

It is much easier to call Nvidia-utility nvidia-settings --display=:10 --ctrl-display=:0 and to change memory clock offset from there (GPU#XX -> PowerMizer). But in my case, the GUI utility didn't perform as well as the command line. So I have opened an issue after discussion with Nvidia technicians.

Undervoltage by GPU frequency

As I mentioned earlier, locking the GPU frequency will reduce power consumption. If you lock it at too low frequencies, you'll drop down your hash rate. If you lock it at a very high level, you will not add essential results, since the RAM frquency may have much bigger impact on hash rate. So your aim is to find a "golden middle"

The sudo nvidia-smi command may lock the GPU frequencies in the narrow corridor. For instance, the commands below set the persistence mode and lock GPU#0 frequency in 1350 MHz.

nvidia-smi -pm 1
nvidia-smi --id=0 --lock-gpu-clocks=1350

If I run the miner again, I notice that the power consumption decreased from 200W to 115 +/- 10 W. The GPU temperature dropped to 58-60 degrees Celsium. At the same time the hash rate is the same.

Undervoltage by Power Limit

It always good idea to put the limits for power consumption as a protection measure (in case you accidentally lock the GPU frequencies at a very high frequency). The following command limits the power consumption by 120W:

sudo nvidia-smi --id=0 -pl 120

Loading your configuration on boot

Once you are happy with the results, it is time to apply your configuration on load.

Power management config on load

The power consumption config is set up with nvidia-smi utility, which requires root privileges and doesn't require X Server running. Hence the simplest way to apply this config is to write these commands in root's crontab by typing sudo crontob -e and adding your config like this:

@reboot nvidia-smi -pm 1
@reboot nvidia-smi --id=0 --lock-gpu-clocks=1350,1350

Overclock settings on load

The overclock settings need to be auto-load after Nvidia X server starts, i.e., after GNOME (or another desktop environment) is loaded.

Option 1

Not elegant, but the working solution is to create a startup application that GNOME runs on user login. Remember, we set a user autologin option in GNOME so that we can use this method.

To make a startup application in GNOME, you need to create a file in ~/.config/autostart/ with .desktop extension with following content:

[Desktop Entry]
Name=Nvidia Setup
Encodinf=UTF-8
Type=Application
Exec=sh -c "/home/aborealis/start_miner_overclock.sh"
Hidden=false
X-GNOME-Autostart-enabled=true
Terminal=false
Comment=NVIDIA GPU Settings

Change "/home/aborealis/start_miner_overclock.sh" to your actial autoload script.

In your script, write the command you want to send to X Server on load. In my case, this is:

#!/bin/bash
nvidia-settings -a '[gpu:0]/GPUMemoryTransferRateOffsetAllPerformanceLevels=1450'
...
### Example of commands sent to separate screen:
screen -d -m -S miner
sleep 5
screen -S miner -X -p 0 stuff "/home/aborealis/miners/NBMiner_Linux/start_eth.sh^M"

Explanation:

First, I run the command to set up a 0th GPU memory offset to +1450 MHz,
Similarly I write other directives to X Server,
Then, I run the miner in a new screen session. It allows me to log in to the rig via ssh and switch to the running miner to see how it goes by typing screen -r. I use this approach withe the option 1 above. More about screen utility is here

screen -d -m -S miner starts a new screen named "miner" in detached mode
sleep 5 I deliberately wait for 5 seconds to allow all Nvidia settings to apply (not an elegant solution, I know)
screen -S miner -X -p 0 stuff "/home/aborealis/miners/NBMiner_Linux/start_eth.sh^M" - 5 seconds later I send a command to new screen to start a miner. Here is "/home/aborealis/miners/NBMiner_Linux" is where NBMiner lives.

Finally, do not forget to make both

the startup script and
your desktop app executable (type sudo chmod +x <filename>)

Option 2

The right way is to start via systemd service. Create a file /etc/systemd/system/miner.service with following content

[Unit]
Description=start miner script
Wants=network-online.target
After=network-online.target


[Service]
Environment="DISPLAY=:0"
Environment="XAUTHORITY=/run/user/1000/gdm/Xauthority"
ExecStart=/home/aborealis/start_miner_overclock-service.sh

[Install]
WantedBy=graphical.target

Here I use environment variable DISPLAY (it should be 0) and XAUTHORITY. You van find the XAUTHORIY value for your OS by typing ps a |grep X

Change "/home/aborealis/start_miner_overclock.sh" to your actial autoload script.

In your script, write the command you want to send to X Server on load. In my case, this is:

#!/bin/bash
nvidia-settings -a '[gpu:0]/GPUMemoryTransferRateOffsetAllPerformanceLevels=1450'
...
/home/aborealis/miners/NBMiner_Linux/start_eth.sh

Explanation:

First, I run the command to set up a 0th GPU memory offset to +1450 MHz,
Similarly I write other directives to X Server,
Finally, I start the script as a root

P.S. If you are luckier than me and your nvidia-setting utility reads/writes the ~/.nvidia-confog-rc configuration file as expected,

you may save your overclock configuration setting right into the config file (by adjusting settings in nvidia-config GUI app and then exiting the window).
load the settings on startup with one command: nvidia-settings --load-config-only instead of sending sets of commands in the script above.

Configuring the miner

Now it is time to configure your miner to use your desired pool and wallet. In my case I use NBMiner and it is loaded by "/home/aborealis/miners/NBMiner_Linux/start_eth.sh" script.

/home/aborealis/miners/NBMiner_Linux/nbminer --log-cycle 20 -a ethash -o stratum+ssl://eu1.ethermine.org:5555 -u <...my wallet..>.<my rig name> --api 127.0.0.1:22333 -log

Note. I have added --api 127.0.0.1:22333 option to access the miner statistics via http (I'll explain this later)

In your case, the script will be different, depending on your mining preferences.

Remote access to the rig behind a router

If your rig is behind a home router/NAT server, you may access it via remote ssh tunneling. Here is an article, which explains the principle.

As you see from the article, you need a separate server (www server) with a public I.P. address which you will use as a connect-point to access your rig from the outer world. You may rent such a virtual web host at Digital Ocean, Hetzner, or any other hosting provider. It is really cheap.

The basic steps are following

Rent a web host with public IP.
Create ssh keys to log in remotely to the web host from the rig without the need to enter a password:

ssh-keygen -t rsa
ssh-copy-id -i ~/.ssh/id_rsa.pub username@webserver

The first command creates the pair of the public (id_rsa.pub) and private (id_rsa) keys and puts them into ~/.ssh directory
The second command copies the content of your id_rsa.pub file and writes it to the web host into ~/.ssh/authorized_keys file

Create a ~/.ssh/config file on your rig:

host my_web_host
  HostName <webhost_public_ip>
  port 22
  IdentityFile ~/.ssh/id_rsa
  User <username>

now you can ssh from rig to remote web host by simply typing ssh my_web_host

Do the same procedure to connect from your local laptop/desktop/mobile terminal app to the webserver.
Now, create a reversed ssh tunnel between the rig and web host. From your rig type:

ssh -R -N -f 10022:localhost:22 my_web_host

This command asks to redirect all connections coming to the web host's localhost:10022 to your rig's 22nd port.

You can easily check how it works. Ssh to your web server and type there:

ssh -p 10022 <your_rig_username>@localhost

Option -p 10022 means that you connect to the 10022nd port to the localhost on the webserver. It will immediately redirect you back to the rig's terminal.

To avoid asking for a password each time, create the ssh key on a webserver to connect to the rig as described in steps 2 & 3. 6. You may now connect to the rig remotely from your desktop/laptop/mobile app terminal by typing:

ssh -t my_web_host 'ssh -p 10022 <your_rig_username>@localhost'

Load Remote SSH Access on Load

To make this connection resistant to failure and to load it automatically, use autossh utility (sudo apt install autossh) and write the following lines in your user's crontab:

@reboot /usr/bin/autossh -v -N -o "CheckHostIP=no" -o "ExitOnForwardFailure=yes" -o "ServerAliveInterval=10" -o "PubkeyAuthentication=yes" -o "PasswordAuthentication=no" -i /root/.ssh/id_rsa -R my_web_host:10022:localhost:22 -f my_web_host

LAN Remote web monitoring

As I mentioned before, I have added --api 127.0.0.1:22333 option to the mining script. So from now on, one can open firefox on the rig and go to http://127.0.0.1:22333 to see real-time web monitoring of NBMiner's work. But at this point, you can only access this web monitoring page from the rig.

To make it visible in the local network,

Install and enable Nginx server
Write a simple config file instead of the default one in /etc/nginx/nginx.conf

user www-data;

events {
        worker_connections 768;
}

http {

        server {
          listen 80;
          server_name  localhost;

          location / {
            proxy_pass http://127.0.0.1:22333;
          }
        }
}

Restart Nginx sudo systemctl restart nginx

Now you can access the web-monitoring page from anywhere in your local network. For example, if you go to the http://192.168.xx.xx (input your rig's IP address), you will see a web monitoring page.

WWW Remote web monitoring

Now, if you run the command below, you will create another reversed ssh tunnel.

ssh -R _N -f 8486:127.0.0.1:80 my_web_host

From now on, all requests to localhost:8486 on the webserver will be redirected to the rig's 80th port. As you remember, Nginx listens to this post and returns an HTML page in response.

Let's check it. Ssh to your web host and type:

wget -O- 127.0.0.1:8486

You will see the HTML content of the web monitoring page. To load your ssh tunnel on startup, add:

@reboot /usr/bin/autossh -o "CheckHostIP=no" -o "ExitOnForwardFailure=yes" -o "ServerAliveInterval=10" -o "PubkeyAuthentication=yes" -o "PasswordAuthentication=no" -i /root/.ssh/id_rsa -R 8486:127.0.0.1:80 -N -f my_web_host

to your crontab.

Now you can:

Assign the domain to the webserver (not covered in this guide)
Configure the Nginx server on the remote host to redirect all requests to 443 port (https://) to the localhost:8486. (not covered in this guide)

Once you have done it, you can access web monitoring statistics via the domain you created.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to Mine on Linux (Debian 11) with Nvidia GPUs. A Step-by-step Guide.

Pre-requisits

Nvidia Drivers Installation

The Nvidia X-Server configuration

Miner installation

Nvidia-setting and overclocking

Nvidia-settings over ssh

Overclocking

From the command line

From GUI

Undervoltage by GPU frequency

Undervoltage by Power Limit

Loading your configuration on boot

Power management config on load

Overclock settings on load

Option 1

Option 2

Configuring the miner

Remote access to the rig behind a router

Load Remote SSH Access on Load

LAN Remote web monitoring

WWW Remote web monitoring

About

Releases

Packages

aborealis/linux-mining-debian

Folders and files

Latest commit

History

Repository files navigation

How to Mine on Linux (Debian 11) with Nvidia GPUs. A Step-by-step Guide.

Pre-requisits

Nvidia Drivers Installation

The Nvidia X-Server configuration

Miner installation

Nvidia-setting and overclocking

Nvidia-settings over ssh

Overclocking

From the command line

From GUI

Undervoltage by GPU frequency

Undervoltage by Power Limit

Loading your configuration on boot

Power management config on load

Overclock settings on load

Option 1

Option 2

Configuring the miner

Remote access to the rig behind a router

Load Remote SSH Access on Load

LAN Remote web monitoring

WWW Remote web monitoring

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages