-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Failed test][cmd/opampsupervisor] TestSupervisorStopsAgentProcessWithEmptyConfigMap constantly failed #36764
Comments
Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
I think I have found the reason for that - My recent PR (#35892) introduced a change to the opamp agent to listen for status updates for the individual components, and it seems like in the shutdown function of the opamp agent there is a channel that is not properly closed. I'm already working on a fix and should have a PR ready today |
…pen-telemetry#36754) <!--Ex. Fixing a bug - Describe the bug and how this fixes the issue. Ex. Adding a feature - Explain what this achieves.--> #### Description The changes introduced in open-telemetry#35892 seemed to have introduced some flakyness in the opampsupervisor e2e tests, as the shutdown of the opamp agent waits for the component health loop to end. Due to an unclosed channel within the opamp agent however, the agent does not properly shut down, and the supervisor runs into a timeout before ultimately sending a SIGKILL to the agent process. Closing the channel in the Shutdown method of the opamp extension fixes that and the agent gets shut down properly upon the reception of the SIGINT signal #### Link to tracking Issue: Fixes open-telemetry#36764 #### Testing This fixes the failing test mentioned in the issue (open-telemetry#36764) --------- Signed-off-by: Florian Bacher <[email protected]>
…pen-telemetry#36754) <!--Ex. Fixing a bug - Describe the bug and how this fixes the issue. Ex. Adding a feature - Explain what this achieves.--> #### Description The changes introduced in open-telemetry#35892 seemed to have introduced some flakyness in the opampsupervisor e2e tests, as the shutdown of the opamp agent waits for the component health loop to end. Due to an unclosed channel within the opamp agent however, the agent does not properly shut down, and the supervisor runs into a timeout before ultimately sending a SIGKILL to the agent process. Closing the channel in the Shutdown method of the opamp extension fixes that and the agent gets shut down properly upon the reception of the SIGINT signal #### Link to tracking Issue: Fixes open-telemetry#36764 #### Testing This fixes the failing test mentioned in the issue (open-telemetry#36764) --------- Signed-off-by: Florian Bacher <[email protected]>
Component(s)
cmd/opampsupervisor
Describe the issue you're reporting
E.g. https://github.com/open-telemetry/opentelemetry-collector-contrib/actions/runs/12257185021/job/34195668020
The text was updated successfully, but these errors were encountered: