You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"Zombie Jobs" are jobs that have workspaces in MinIO but are not owned by any eda instances. They are rare occurrences resulting from exceptions thrown by MinIO itself. These jobs 'appear' to be running but are dead.
Presently, these cannot be removed or expired, and must be deleted manually.
Proposal
To enable us to detect and clear these jobs we will add 2 new endpoints.
An admin endpoint that lists jobs that appear to currently be in progress.
An admin endpoint that purges a job workspace from MinIO.
With these endpoints in combination with the existing internal-jobs endpoint we can:
list jobs that appear to be running (actual running jobs will be in this list as well)
use the internal jobs endpoint on both campuses to check for job ownership on each 'running' job
If neither campus knows about the job, it is a zombie and we can safely make a request to the delete endpoint to wipe the workspace
The text was updated successfully, but these errors were encountered:
Problem
"Zombie Jobs" are jobs that have workspaces in MinIO but are not owned by any eda instances. They are rare occurrences resulting from exceptions thrown by MinIO itself. These jobs 'appear' to be running but are dead.
Presently, these cannot be removed or expired, and must be deleted manually.
Proposal
To enable us to detect and clear these jobs we will add 2 new endpoints.
With these endpoints in combination with the existing internal-jobs endpoint we can:
The text was updated successfully, but these errors were encountered: