You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
start a cluster normally with ray up golem-cluster.dev.yaml
ray attach golem-cluster.dev.yaml
pip install numba
Collecting numba
Downloading https://pypi.dev.golem.network/packages/73/d5/d359cece32302442c8ea9742b1324c4eda689fd54281eb3144f520c81f6d/numba-0.59.0-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata
- 2.7 kB ? 0:00:00
Collecting llvmlite<0.43,>=0.42.0dev0 (from numba)
Downloading https://pypi.dev.golem.network/packages/2b/01/764489e364948f52aa7cb958a91a8dafd489357d2401f66946542bbc1764/llvmlite-0.42.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata
- 4.8 kB ? 0:00:00
Collecting numpy<1.27,>=1.22 (from numba)
Downloading https://pypi.dev.golem.network/packages/4b/d7/ecf66c1cd12dc28b4040b15ab4d17b773b87fa9d29ca16125de01adb36cd/numpy-1.26.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata
- 61.0 kB 271.3 MB/s 0:00:00
Downloading https://pypi.dev.golem.network/packages/73/d5/d359cece32302442c8ea9742b1324c4eda689fd54281eb3144f520c81f6d/numba-0.59.0-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
\ 3.7 MB 25.9 MB/s 0:00:00
Downloading https://pypi.dev.golem.network/packages/2b/01/764489e364948f52aa7cb958a91a8dafd489357d2401f66946542bbc1764/llvmlite-0.42.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- 43.8 MB 24.4 MB/s 0:00:02
Downloading https://pypi.dev.golem.network/packages/4b/d7/ecf66c1cd12dc28b4040b15ab4d17b773b87fa9d29ca16125de01adb36cd/numpy-1.26.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
/ 18.2 MB 31.4 MB/s 0:00:00
Installing collected packages: numpy, llvmlite, numba
ERROR: Could not install packages due to an OSError: [Errno 28] No space left on device
suggested solution: we should change the VM image to use a VOLUME and ensure that pip is configured to keep at least the temporary files but preferably all the installed packages on an external volume
The text was updated successfully, but these errors were encountered:
when running any payloads on ray on golem, if there are artifacts written to the current working directory, they'll pretty quickly fill up the remaining space
when one installs packages inside the VM, the downloaded package files and then packages themselves also take up space on the current filesystem, which also means that you cannot install anything itself larger than the available space
when anything happens to the ray node on the VM, to pull the logs we now need to log through SSH but it would be easier to just snatch those logs from the machine with regular exescript transfers - but afair/k, they require the files to be transferred residing on a volume...
so, it all boils down to the fact that we need to rework the images so that they use VOLUMES appropriately to alleviate the above issues...
steps to reproduce:
ray up golem-cluster.dev.yaml
ray attach golem-cluster.dev.yaml
pip install numba
suggested solution: we should change the VM image to use a VOLUME and ensure that pip is configured to keep at least the temporary files but preferably all the installed packages on an external volume
The text was updated successfully, but these errors were encountered: