-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running from cron no longer possible on Derecho #998
Comments
@mkavulich |
@SarahLu-NOAA It looks like the cron jobs have not yet been disabled (though this change is "imminent"), so this is likely unrelated to the issues you saw. |
@mkavulich In the chapters for running SRW, it explains how to run with or without cron. Do you think we should just add a note that on Derecho, people should use the methods to run without cron? Can't think what else would be needed, but feel free to suggest something! |
@mkavulich and @gspetro-NOAA |
Thanks for the input @SarahLu-NOAA ! |
We have recently gotten email notices that the old method of running with Cron on Derecho will be discontinued on February 11th. So this issue needs to be addressed soon. Possibly worth including logic so that |
Expected behavior
Previously the
USE_CRON_TO_RELAUNCH=true
option worked on all Tier-1 platforms (as far as I know).Current behavior
Due to a policy change with the new machine, there are now special procedures for setting up cron tables on Derecho. These procedures are not compatible with automatic modification due to needing to log in to a separate machine, so it is not feasible to support this mode of running the workflow automatically (
USE_CRON_TO_RELAUNCH=true
) on Derecho.Machines affected
Derecho
Steps To Reproduce
1. Generate an experiment with theUSE_CRON_TO_RELAUNCH=true
option on Derecho2. Observe that the workflow is not run.
So far, it looks like cron jobs do still work on Derecho. But I have been told by CISL that we need to migrate away from this system ASAP.
Detailed Description of Fix
Users guide run instructions will need to be updated with Derecho-specific instructions, as the crontab functionality is currently the recommended way to run the workflow.
Possible Implementation
One way to get around this would be to leverage the WE2E functionality currently present for running and monitoring experiments for general use. This would require some tweaking of the current setup to be more user-friendly outside of the WE2E context.
Output (optional)
Currently, the error message that appears in
log.launch_FV3LAM_wflow
is:But that may change in the future, as I was informed by CISL that the crontab functionality will stop working all together at some point.
The text was updated successfully, but these errors were encountered: