Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create new cluster for each job execution #1500

Closed
wants to merge 1 commit into from

Conversation

psschwei
Copy link
Collaborator

@psschwei psschwei commented Sep 24, 2024

Summary

Create a new RayCluster each time we execute a job.

Details and comments

This will be better for resource management, as each job will get exactly the resources requested. Previously, if a user submitted two jobs they would run in the same cluster and sharing resources, so the resource requests wouldn't necessarily be honored exactly.

Plus, this will also be the default behavior after migrating to RayJobs (#1337), so its not something we weren't planning on doing eventually anyway.

@psschwei
Copy link
Collaborator Author

We don't necessarily need this now, so will leave it up to reviewers on whether to merge this or close it.

@psschwei
Copy link
Collaborator Author

Actually, the docker test failure here seems legit, and since I think this falls into the "nice to have" bucket right now, I don't think it makes sense to spend the time to iron that issue out right now.

@psschwei psschwei closed this Sep 24, 2024
@Tansito
Copy link
Member

Tansito commented Sep 24, 2024

The good thing at least is that right now in our cluster we are not allowing that:
https://github.com/Qiskit/qiskit-serverless/blob/main/charts/qiskit-serverless/charts/gateway/values.yaml#L45

We are using one here so good. Anyway I will open an issue around this, thanks Paul! 🙏

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants