Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

make link_nvidia_host_libraries.sh script a bit more robust, in case target of host_injections directory is a non-existing directory #437

Conversation

boegel
Copy link
Contributor

@boegel boegel commented Dec 21, 2023

If the target of the host_injections variant symlink, you get the following puzzling error:

$ ls -l ${EESSI_CVMFS_REPO}/host_injections
lrwxrwxrwx 1 cvmfs cvmfs 23 Oct  3 13:51 /cvmfs/software.eessi.io/host_injections -> /tmp/opt/eessi

$ scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
Found ldconfig in the following locations:
- /sbin/ldconfig
- /usr/sbin/ldconfig
Using first version
Found NVIDIA GPU driver version 535.129.03
Found host CUDA version 12.2
Creating /cvmfs/software.eessi.io/host_injections/nvidia/x86_64/host (real path /tmp/opt/eessi/nvidia/x86_64/host) failed with:
 mkdir: cannot create directory ‘/cvmfs/software.eessi.io/host_injections’: File exists
ERROR: No write permissions to directory /cvmfs/software.eessi.io/host_injections/nvidia/x86_64/host

works fine if the target does exist:

$ mkdir /tmp/opt/eessi
$ scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
Found ldconfig in the following locations:
- /sbin/ldconfig
- /usr/sbin/ldconfig
Using first version
Found NVIDIA GPU driver version 535.129.03
Found host CUDA version 12.2
Downloading latest version of nvliblist.conf from Apptainer to /tmp/tmp.Ko6VDH8wev/nvliblist.conf
Host NVIDIA gpu drivers linked successfully for EESSI

The extra block of code with host_injections_target makes sure the target of the host_injections variant symlink is an existing directory.

If you don't have permissions, you'll also get a clearer error now (though the confusing one is still there):

Creating /opt/eessi (real path /opt/eessi) failed with:
 mkdir: cannot create directory ‘/opt/eessi’: Permission denied
Creating /cvmfs/software.eessi.io/host_injections/nvidia/x86_64/host (real path /opt/eessi/nvidia/x86_64/host) failed with:
 mkdir: cannot create directory ‘/cvmfs/software.eessi.io/host_injections’: File exists
ERROR: No write permissions to directory /cvmfs/software.eessi.io/host_injections/nvidia/x86_64/host

I've also added some extra informative messages, like mentioning GPU driver version that was found, etc., and muted the useless download progress output produced by curl by using --silent.

…target of host_injections directory is a non-existing directory
@boegel boegel added the 2023.06-software.eessi.io 2023.06 version of software.eessi.io label Dec 21, 2023
Copy link

eessi-bot bot commented Dec 21, 2023

Instance eessi-bot-mc-aws is configured to build:

  • arch x86_64/generic for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/generic for repo eessi-hpc.org-2023.06-software
  • arch x86_64/generic for repo eessi.io-2023.06-compat
  • arch x86_64/generic for repo eessi.io-2023.06-software
  • arch x86_64/intel/haswell for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi-hpc.org-2023.06-software
  • arch x86_64/intel/haswell for repo eessi.io-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi.io-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi.io-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi.io-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi.io-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi.io-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi.io-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi.io-2023.06-software
  • arch aarch64/generic for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/generic for repo eessi-hpc.org-2023.06-software
  • arch aarch64/generic for repo eessi.io-2023.06-compat
  • arch aarch64/generic for repo eessi.io-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi-hpc.org-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi.io-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi.io-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi-hpc.org-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi.io-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi.io-2023.06-software

@boegel boegel requested a review from ocaisa December 21, 2023 19:57
ocaisa
ocaisa previously approved these changes Dec 21, 2023
@ocaisa
Copy link
Member

ocaisa commented Dec 21, 2023

bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/skylake_avx512

Copy link

eessi-bot bot commented Dec 21, 2023

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Dec 21, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-skylake_avx512 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2742

date job status comment
Dec 21 20:58:03 UTC 2023 submitted job id 2742 awaits release by job manager
Dec 21 20:58:45 UTC 2023 released job awaits launch by Slurm scheduler
Dec 21 21:03:46 UTC 2023 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-2742.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1703192603.tar.gzsize: 0 MiB (111853 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/intel/skylake_avx512/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/intel/skylake_avx512/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/intel/skylake_avx512
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 21 21:03:46 UTC 2023 test result (no tests yet)

@ocaisa
Copy link
Member

ocaisa commented Dec 21, 2023

@boegel So the failure is probably because there is no easystack change in the PR so EasyBuild never gets called, we should add some kind of or clause when no yaml is touched that just prints "No missing installations"

…. Relevant for PRs that e.g. only update things in .../scripts
@casparvl
Copy link
Collaborator

bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/skylake_avx512

Copy link

eessi-bot bot commented Dec 22, 2023

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-skylake_avx512 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2791

date job status comment
Dec 22 10:40:15 UTC 2023 submitted job id 2791 awaits release by job manager
Dec 22 10:40:52 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:45:53 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2791.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1703241926.tar.gzsize: 0 MiB (111854 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/intel/skylake_avx512/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/intel/skylake_avx512/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/intel/skylake_avx512
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:45:53 UTC 2023 test result (no tests yet)

ocaisa
ocaisa previously approved these changes Dec 22, 2023
@ocaisa
Copy link
Member

ocaisa commented Dec 22, 2023

bot: build repo:eessi.io-2023.06-software arch:x86_64/generic
bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/haswell
bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/skylake_avx512
bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2
bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3
bot: build repo:eessi.io-2023.06-software arch:aarch64/generic
bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_n1
bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1

Copy link

eessi-bot bot commented Dec 22, 2023

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2792

date job status comment
Dec 22 10:50:45 UTC 2023 submitted job id 2792 awaits release by job manager
Dec 22 10:50:56 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:56:13 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2792.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-generic-1703242529.tar.gzsize: 0 MiB (111501 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/generic/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/generic
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:56:13 UTC 2023 test result (no tests yet)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-haswell for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2793

date job status comment
Dec 22 10:50:49 UTC 2023 submitted job id 2793 awaits release by job manager
Dec 22 10:50:58 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:56:15 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2793.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-haswell-1703242557.tar.gzsize: 0 MiB (111680 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/intel/haswell/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/intel/haswell/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/intel/haswell
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:56:15 UTC 2023 test result (no tests yet)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-skylake_avx512 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2794

date job status comment
Dec 22 10:50:53 UTC 2023 submitted job id 2794 awaits release by job manager
Dec 22 10:51:00 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:52:11 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2794.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1703242267.tar.gzsize: 0 MiB (111854 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/intel/skylake_avx512/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/intel/skylake_avx512/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/intel/skylake_avx512
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:52:11 UTC 2023 test result (no tests yet)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2795

date job status comment
Dec 22 10:50:57 UTC 2023 submitted job id 2795 awaits release by job manager
Dec 22 10:52:08 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:59:24 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2795.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1703242738.tar.gzsize: 0 MiB (111517 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:59:24 UTC 2023 test result (no tests yet)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2796

date job status comment
Dec 22 10:51:01 UTC 2023 submitted job id 2796 awaits release by job manager
Dec 22 10:52:10 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:59:26 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2796.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1703242737.tar.gzsize: 0 MiB (111519 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen3/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen3
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:59:26 UTC 2023 test result (no tests yet)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2797

date job status comment
Dec 22 10:51:05 UTC 2023 submitted job id 2797 awaits release by job manager
Dec 22 10:52:02 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:57:18 UTC 2023 running job 2797 is running
Dec 22 10:58:19 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2797.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1703242649.tar.gzsize: 0 MiB (111080 bytes)
entries: 7
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/generic
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:58:19 UTC 2023 test result (no tests yet)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_n1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2798

date job status comment
Dec 22 10:51:09 UTC 2023 submitted job id 2798 awaits release by job manager
Dec 22 10:52:04 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:58:21 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2798.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_n1-1703242679.tar.gzsize: 0 MiB (111239 bytes)
entries: 7
modules under 2023.06/software/linux/aarch64/neoverse_n1/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/neoverse_n1/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/neoverse_n1
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:58:21 UTC 2023 test result (no tests yet)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2799

date job status comment
Dec 22 10:51:13 UTC 2023 submitted job id 2799 awaits release by job manager
Dec 22 10:52:06 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 10:58:22 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2799.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_v1-1703242678.tar.gzsize: 0 MiB (111241 bytes)
entries: 7
modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/neoverse_v1/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/neoverse_v1
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 10:58:22 UTC 2023 test result (no tests yet)

@ocaisa ocaisa added the bot:deploy Ask bot to deploy missing software installations to EESSI label Dec 22, 2023
EESSI-install-software.sh Outdated Show resolved Hide resolved
@casparvl
Copy link
Collaborator

bot: build repo:eessi.io-2023.06-software arch:x86_64/generic
bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/haswell
bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/skylake_avx512
bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2
bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3
bot: build repo:eessi.io-2023.06-software arch:aarch64/generic
bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_n1
bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1

Copy link

eessi-bot bot commented Dec 22, 2023

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2800

date job status comment
Dec 22 13:55:18 UTC 2023 submitted job id 2800 awaits release by job manager
Dec 22 13:55:48 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:01:03 UTC 2023 running job 2800 is running
Dec 22 14:02:09 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2800.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-generic-1703253661.tar.gzsize: 0 MiB (111504 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/generic/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/generic
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:02:09 UTC 2023 test result (no tests yet)
Dec 22 14:35:04 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-generic-1703253661.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-haswell for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2801

date job status comment
Dec 22 13:55:22 UTC 2023 submitted job id 2801 awaits release by job manager
Dec 22 13:55:50 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:02:11 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2801.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-haswell-1703253718.tar.gzsize: 0 MiB (111681 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/intel/haswell/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/intel/haswell/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/intel/haswell
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:02:11 UTC 2023 test result (no tests yet)
Dec 22 14:35:22 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-haswell-1703253718.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-skylake_avx512 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2802

date job status comment
Dec 22 13:55:26 UTC 2023 submitted job id 2802 awaits release by job manager
Dec 22 13:55:52 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:03:12 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2802.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1703253776.tar.gzsize: 0 MiB (111853 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/intel/skylake_avx512/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/intel/skylake_avx512/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/intel/skylake_avx512
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:03:12 UTC 2023 test result (no tests yet)
Dec 22 14:35:40 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1703253776.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2803

date job status comment
Dec 22 13:55:30 UTC 2023 submitted job id 2803 awaits release by job manager
Dec 22 13:55:44 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:00:59 UTC 2023 running job 2803 is running
Dec 22 14:02:06 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2803.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1703253660.tar.gzsize: 0 MiB (111522 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:02:06 UTC 2023 test result (no tests yet)
Dec 22 14:35:59 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-1703253660.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2804

date job status comment
Dec 22 13:55:33 UTC 2023 submitted job id 2804 awaits release by job manager
Dec 22 13:55:46 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:01:01 UTC 2023 running job 2804 is running
Dec 22 14:02:07 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2804.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1703253659.tar.gzsize: 0 MiB (111521 bytes)
entries: 7
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen3/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen3
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:02:07 UTC 2023 test result (no tests yet)
Dec 22 14:36:18 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen3-1703253659.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2805

date job status comment
Dec 22 13:55:38 UTC 2023 submitted job id 2805 awaits release by job manager
Dec 22 13:55:42 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:01:04 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2805.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1703253632.tar.gzsize: 0 MiB (111082 bytes)
entries: 7
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/generic
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:01:04 UTC 2023 test result (no tests yet)
Dec 22 14:36:36 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-aarch64-generic-1703253632.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_n1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2806

date job status comment
Dec 22 13:55:41 UTC 2023 submitted job id 2806 awaits release by job manager
Dec 22 13:56:54 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:05:15 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2806.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_n1-1703253902.tar.gzsize: 0 MiB (111240 bytes)
entries: 7
modules under 2023.06/software/linux/aarch64/neoverse_n1/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/neoverse_n1/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/neoverse_n1
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:05:15 UTC 2023 test result (no tests yet)
Dec 22 14:36:54 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-aarch64-neoverse_n1-1703253902.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Dec 22, 2023

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2023.12/pr_437/2807

date job status comment
Dec 22 13:55:45 UTC 2023 submitted job id 2807 awaits release by job manager
Dec 22 13:56:56 UTC 2023 released job awaits launch by Slurm scheduler
Dec 22 14:05:17 UTC 2023 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-2807.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_v1-1703253900.tar.gzsize: 0 MiB (111242 bytes)
entries: 7
modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/neoverse_v1/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/neoverse_v1
2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Dec 22 14:05:17 UTC 2023 test result (no tests yet)
Dec 22 14:37:13 UTC 2023 uploaded transfer of eessi-2023.06-software-linux-aarch64-neoverse_v1-1703253900.tar.gz to S3 bucket succeeded

@casparvl casparvl added bot:deploy Ask bot to deploy missing software installations to EESSI and removed bot:deploy Ask bot to deploy missing software installations to EESSI labels Dec 22, 2023
@boegel
Copy link
Contributor Author

boegel commented Dec 22, 2023

@ocaisa @casparvl Ready to merge, since it's deployed:

$ date
Fri Dec 22 18:36:35 CET 2023

$ grep host_injections_target /cvmfs/software.eessi.io/versions/2023.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
host_injections_target=$(realpath -m ${EESSI_CVMFS_REPO}/host_injections)
if [ ! -d ${host_injections_target} ]; then
    create_directory_structure ${host_injections_target}

@casparvl casparvl merged commit 44b563c into EESSI:2023.06-software.eessi.io Dec 22, 2023
33 checks passed
@boegel boegel deleted the 2023.06-software.eessi.io_link_nvidia_host_libraries_fix branch December 22, 2023 20:48
TopRichard added a commit to TopRichard/bot-software-layer1 that referenced this pull request Jul 26, 2024
….1-gompi/2023a

{2023.06}[gompi/2023a] amdahl V0.3.1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2023.06-software.eessi.io 2023.06 version of software.eessi.io accel:nvidia bot:deploy Ask bot to deploy missing software installations to EESSI
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants