-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Provide troubleshooting guidance, when "TASK [core/cluster : cluster | Create new cluster]" hanging #581
Comments
Thanks @troppens . agree. we have a pre-check in the cli toolkit though mmnetverify. we were discussing if we can validate ansible host inventory through mmnetverify but mmnetveify is internal tools that we can't use here as its not open source tool. We can add this info in the README , so that user can run manual mmnetverify before starting the scale cluster creation. I will also check if we can add some ssh ansible module to validate host inventory. Thanks |
Regarding ssh, I believe that this is required from admin node to all other nodes, but not from each node to all other nodes. From an ease-of-use perspective it might be desired to have ssh from any node to any other node. This would be good for new users just get started with Spectrum Scale, e.g. for evaluation or demo. From an production perspective it might be desired to restrict ssh to improve security. |
A few thoughts on this one:
@rajan-mis: Wouldn't it make sense to change the default of these to |
I missed the prep roles too, although I know that ssh must be configured. I was curious to see what breaks when I work with the OS and the role defaults ;-) I would not change ssh settings per default because they impact security. I believe it is sufficient to do a quick check on the node which executes |
Describe the bug
I provisioned three VMs on virtual infrastructure and tried to create a three-node Spectrum Scale cluster.
The following step was hanging for an hour or so:
I added a debug message to the core/cluster.yml:
In the next run of the playbook it gave me a hint:
So I tried this command without Ansible:
Ah. SSH is not set up properly. For a new user this is not easy to determine, although this is mentioned in the README.
To improve usability, it would be good to have an additional check in the role for ssh connectivity to make the troubleshooting easier for new users.
I have also considered to have a section Troubleshooting in the README, though a check in the role would be preferred.
To Reproduce
Steps to reproduce the behavior:
ansible-playbook -i hosts playbook.yml
Expected behavior
Described above.
Environment
Please run the following an paste your output here:
The text was updated successfully, but these errors were encountered: