-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kill messages from cron #265
Comments
Hi @oddnoc, thanks for raising this issue! Can you confirm whether this has been occurring for a while, in previous versions of SilverStripe CMS / Queued Jobs, or if it has just started after an update? |
I'm not sure when this started, sorry. But I did manage to capture a snapshot of what's happening. I have a vagrant box with zfs, so I can roll the database back at will. Using this platform, I captured the output of ps as fast as possible whilst running the queue. The command I ran, with its output:
I was capturing
Additional version info:
In this environment, the message from kill happens only when a very specific set of QueuedJobDescriptor records exists in the database. Deleting any one of these records prevents the message from appearing. On this box with this set of records, the issue is 100% reproducible. I suspect this is a race condition, because it's happening intermittently in production with quite a different set of records.
|
We've started occasionally receiving more verbose output from kill: 25326: No such process |
As a first thing, I recommend upgrading to I couldn't reproduce the issue after applying these two fixes:
@oddnoc Could you please try reproduce the issue after applying those two fixes? |
@mfendeksilverstripe When I try to merge both fixes, I get conflicts. Could you resolve those into an integration branch? |
That's because one goes to |
I had to do more than just cherry-pick. See the pu branch on my fork. I have deployed that branch and will know whether it works in a few days, to allow enough time for some jobs to be run from cron. |
We got a result sooner than expected: [2020-06-16 22:40:01] error-log.DEBUG: [2020-06-16 22:40:01] - Queued Jobs - Failed to acquire job lock Failed to read job lock 854 {"file":"/var/silverstripe/live/releases/6/vendor/symbiote/silverstripe-queuedjobs/src/Services/QueuedJobService.php","line":762} [] |
This isn't necessarily an issue - see comments near the line where the exception gets throw:
This is why this message is debug level. Apart from this, have you noticed any issues? |
Since that one message, cron has been silent for days and queued jobs have been running normally. |
The ProcessJobQueueTask (run through cron) is occasionally but persistently sending emails saying
kill: 47765: No such process
(different process ID each time). This seems to happen only for the immediate queue. Sorry for the lack of details; I don't see in the code where the task even tries to invokekill
.Version info
The text was updated successfully, but these errors were encountered: