-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Metricbeat windows service metrics stops sending documents when a single service fails #40765
Comments
Pinging @elastic/elastic-agent-data-plane (Team:Elastic-Agent-Data-Plane) |
@VihasMakwana I think I saw you had root caused the source of the |
@cmacknz yes, that's correct. On my personal desktop, the metricbeat wasn't able to access following processes, running as root:
This was for @TheRiffRafi do you see any warning related to SeDebugPrivilege at the beginning of logs? |
Hello @VihasMakwana! Unfortunately I can't help with logs, all the instances I have of the failure have the logs with the problem already started, there is no instance of this where we've caught it in a state where the issue is not occurring and then suddenly starts happening (the systems are going weeks without reporting the service). Also, I have to make a correction on the original description, we have only seen this on 8.10.4, we haven't tested on a more recent version as the entire stack for the user is still on 8.10.4, it was a misunderstanding that we had seen this problem on a later version. |
No clear steps to reproduce, more info on this later.
Multiple instances of elastic-agent installations are failing to send the windows.service metric set for the windows integration. The system integration continues to send data without issues. The problem happens at random and it is resolved by restarting the elastic agent.
The issue happens in different versions of 8.x for elastic-agent and it hasn't confirmed as occurring on the latest version (as the user who has experienced this has not upgraded to latest version yet).The issue so far has only been seen on 8.10.4The error reported by metricbeat is the following:
So far the error indicates a problem only with one particular windows service, however, all other services being monitored by metricbeat can't continue to be monitored because this particular service getting in an unexpected state causes the entire metricbeat windows service metricset to stop reporting for any service.
Because this happens at random we are unable to setup debug logging to catch the failure and the logger for this function is not providing any more info.
We need to address 2 items with this issue:
The text was updated successfully, but these errors were encountered: