Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix bot config: add required job_result_unknown_fmt + use 16 cores for jobs #268

Merged
merged 2 commits into from
Jun 20, 2023

Conversation

boegel
Copy link
Contributor

@boegel boegel commented Jun 20, 2023

Without this, the job manager crashes - required because of changes in EESSI/eessi-bot-software-layer#174

@eessi-bot
Copy link

eessi-bot bot commented Jun 20, 2023

Instance eessi-bot-citc-aws is configured to build:

  • arch x86_64/generic for repo eessi-2021.12
  • arch x86_64/generic for repo eessi-2023.06-compat
  • arch x86_64/generic for repo eessi-2023.06-software
  • arch x86_64/intel/haswell for repo eessi-2021.12
  • arch x86_64/intel/haswell for repo eessi-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi-2021.12
  • arch x86_64/intel/skylake_avx512 for repo eessi-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi-2021.12
  • arch x86_64/amd/zen2 for repo eessi-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi-2021.12
  • arch x86_64/amd/zen3 for repo eessi-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi-2023.06-software
  • arch aarch64/generic for repo eessi-2021.12
  • arch aarch64/generic for repo eessi-2023.06-compat
  • arch aarch64/generic for repo eessi-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi-2021.12
  • arch aarch64/neoverse_n1 for repo eessi-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi-2021.12
  • arch aarch64/neoverse_v1 for repo eessi-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi-2023.06-software

@boegel boegel changed the title add required job_result_unknown_fmt bot config setting fix bot config: add required job_result_unknown_fmt + use 16 cores for jobs Jun 20, 2023
Copy link
Collaborator

@trz42 trz42 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. Maybe you want to add more instances or someone wonders why a job is not starting. There are fewer nodes for the larger instances configured.

If we want to support multiple jobs per node, we probably need to add --mem= and also make local_tmp = /tmp/$USER/EESSI job-specific. The latter could be achieved by adding /$SLURM_JOB_ID.

@trz42 trz42 merged commit eef18e9 into EESSI:2023.06 Jun 20, 2023
@boegel boegel deleted the 2023.06_bot_cfg branch June 20, 2023 19:42
@boegel
Copy link
Contributor Author

boegel commented Jun 20, 2023

We can easily scale up extra *.4xlarge nodes on the fly in the CitC cluster. I'll make sure there's 2 of each *.4xlarge type

trz42 pushed a commit to trz42/software-layer that referenced this pull request Feb 3, 2024
…lotlib-2023b

{2023.06}[foss/2023b] SciPy-bundle/2023.11, netCDF/4.9.2 and matplotlib/3.8.2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants