Unable to configure a SLURM cluster on a shared vpc using the google cloud hpc toolkit #1279
Closed
irishwithag91
started this conversation in
General
Replies: 2 comments 1 reply
-
We're looking at this issue. Could you please provide version of toolkit you were using ( |
Beta Was this translation helpful? Give feedback.
1 reply
-
Hello @irishwithag91 , I wasn't able to reproduce this problem. Did you set proper
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Attempted to create a SLURM cluster on a shared VPC using the following template
https://github.com/GoogleCloudPlatform/hpc-toolkit/blob/main/community/examples/hpc-cluster-small-sharedvpc.yaml
Followed the pre requisites required for filestore creation
The login and control nodes were created successfully
After ssh into the login nodes , and running the sinfo command
Getting the following error :
[slurm-hpc-small-shared-vpc-login0 ~]$ sinfo
sinfo: error: resolve_ctls_from_dns_srv: res_nsearch error: Unknown host
sinfo: error: fetch_config: DNS SRV lookup failed
sinfo: error: _establish_config_source: failed to fetch config
sinfo: fatal: Could not establish a configuration source
Can someone please help on how to debug this further
Beta Was this translation helpful? Give feedback.
All reactions