-
Notifications
You must be signed in to change notification settings - Fork 912
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High CPU Usage when using Falco v0.28.0 and higher #2996
Comments
ei @Junia18 thank you for reporting! I see that in Falco 0.36.2 you have the k8s metadata enrichment enabled. This unfortunately could cause huge drops... not sure which deployment method you are using, but the k8s enrichment could be disabled by removing the helm install falco falcosecurity/falco --set collectors.kubernetes.enabled=false Could you disable it and retry please? |
@Andreagit97 I have removed the
Is there any other solution I can try? |
Hi @Junia18, could you try the new For more info please see the following issue: #2973 Let me know if you need any help! |
@alacuku I was wondering if there's any information available about the anticipated release date for Falco v0.37.0. I appreciate your time and assistance. |
The expected release date for Falco 0.37 is the end of January 2024. |
Issues go stale after 90d of inactivity. Mark the issue as fresh with Stale issues rot after an additional 30d of inactivity and eventually close. If this issue is safe to close now please do so with Provide feedback via https://github.com/falcosecurity/community. /lifecycle stale |
/remove-lifecycle stale |
@Andreagit97 Despite upgrading to Falco v0.37.1, we continue to experience high CPU usage, accompanied by frequent syscall event drops in the logs. CPU usage after upgrading Falco to v0.37.1
Logs
Running the Falco binary using the following command: New Environment:
|
@Andreagit97 Just a gentle reminder that I am waiting for your input on this issue for over two weeks now. Any updates or insights you can provide would be greatly appreciated. Thanks! |
Hi 👋 @Junia18 happy to also help since Andrea is constantly working on many areas. Between v0.28 and now a lot changed. Recommending to follow our Troubleshooting guides https://falco.org/docs/troubleshooting/dropping/ and explore the In your original comment I see that you are showing metrics from different clusters #2996 (comment) -- unfortunately it's hard to make truly fair comparisons and looking at the CPU usages it does not appear to be a huge delta. Perhaps also checkout this paragraph https://falco.org/docs/metrics/performance/#server-load-and-falco-event-drops stating that Falco's usage is never constant. In case you are interested in a more fair comparison: would you consider running the different Falco versions on the exact same nodes plus also export our internal metrics (https://falco.org/docs/metrics/falco-metrics/) just so we can check if event rates etc were similar. Please let us know if the guides are helpful. |
^ small correction: #2996 (comment) the internal metrics didn't yet exist in v0.28 so we can't make the more detailed comparisons. |
ei @Junia18 as suggested by @incertum i would use the Looking at your actual situation, you are tracing 70 syscalls
With
It's enough to provide Falco with |
Issues go stale after 90d of inactivity. Mark the issue as fresh with Stale issues rot after an additional 30d of inactivity and eventually close. If this issue is safe to close now please do so with Provide feedback via https://github.com/falcosecurity/community. /lifecycle stale |
Stale issues rot after 30d of inactivity. Mark the issue as fresh with Rotten issues close after an additional 30d of inactivity. If this issue is safe to close now please do so with Provide feedback via https://github.com/falcosecurity/community. /lifecycle rotten |
We experience the same issue on a number of Ubuntu 22.04 VMs using falco 0.37.1. |
Rotten issues close after 30d of inactivity. Reopen the issue with Mark the issue as fresh with Provide feedback via https://github.com/falcosecurity/community. |
@poiana: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
I am encountering a critical issue while running Falco v0.36.2 as a daemonset on an Openshift cluster (OCP v4.12) with three worker nodes. The problem manifests as a substantial drop rate of events and a notable doubling of CPU usage across all three worker nodes.
This behavior is particularly concerning as it impacts the effective functioning of Falco and the overall performance of the cluster. The kernel probe is being utilized in this configuration.
Steps to Reproduce:
CPU usage before (when using Falco versions upto v0.26.2)
CPU usage after (when using Falco versions > v0.27.0)
Logs
Environment
Falco version: 0.36.2
System info:
"machine": "x86_64",
"release": "4.18.0-372.73.1.el8_6.x86_64",
"sysname": "Linux",
"version": "Digwatch compiler #1 SMP Fri Sep 8 13:16:27 EDT 2023"
Cloud provider or hardware configuration:
OS:
NAME="Red Hat Enterprise Linux"
VERSION="8.8 (Ootpa)"
ID="rhel"
ID_LIKE="fedora"
VERSION_ID="8.8"
PLATFORM_ID="platform:el8"
PRETTY_NAME="Red Hat Enterprise Linux 8.8 (Ootpa)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:redhat:enterprise_linux:8::baseos"
Kernel:
Linux 4.18.0-372.73.1.el8_6.x86_64 Digwatch compiler #1 SMP Fri Sep 8 13:16:27 EDT 2023 x86_64 x86_64 x86_64 GNU/Linux
Installation method: from source
The text was updated successfully, but these errors were encountered: