-
Notifications
You must be signed in to change notification settings - Fork 628
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add timeout on scheduling wait #3736
Conversation
modules/nextflow/src/main/groovy/nextflow/script/ProcessConfig.groovy
Outdated
Show resolved
Hide resolved
modules/nextflow/src/main/groovy/nextflow/processor/TaskRun.groovy
Outdated
Show resolved
Hide resolved
0d59b4c
to
b93634e
Compare
81f7cb7
to
8a43489
Compare
Signed-off-by: Paolo Di Tommaso <[email protected]>
Signed-off-by: Paolo Di Tommaso <[email protected]>
✅ Deploy Preview for nextflow-docs-staging ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
Think this is ready to go. @bentsherman please give a try to it |
As I understand it, submit timeout errors are limited by |
Yes, exactly |
It seems like an odd choice. I would expect either two separate limits for submit retries and regular retries (e.g. Also, maybe we should call it |
Signed-off-by: Paolo Di Tommaso <[email protected]>
f5ea4b0
to
03836d7
Compare
I see your point, at the same time the submit retry requires setting It could make sense to use a separate setting, but in that case, it should be independent of the Let's move forward as "experimental", collect some user feedback and refine it later.
Agreed |
This commit adds a new directive named `maxSubmitAwait` that allows defining how long a task can be in a pending status waiting to be scheduled for execution by the underlying executor. When the timeout is reached, an execution error is returned. The usual `errorStrategy` directive can be used to control how handle the error condition and, if required, attempt a new job submission. The `task.submitAttempt` attribute can be used to determine which submitted attempt failed. Signed-off-by: Paolo Di Tommaso <[email protected]>
This PR adds a new directive
maxAwait
that allows controlling how long a task can be in submission (pending execution) state, elapsed that time a failure is reported.This, along with the use of dynamic directives evaluation and the
task.submitAttempt
attribute make it possible to attempt the execution of the task to a different queue or set of resources requirement.