Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Operator controller manager Pod OOMKilled and crash looping #716

Open
andrewazores opened this issue Jan 26, 2024 · 3 comments
Open
Labels
bug Something isn't working needs-triage Needs thorough attention from code reviewers

Comments

@andrewazores
Copy link
Member

Current Behavior

  containers:
    - resources:
        limits:
          cpu: '1'
          memory: 256Mi
        requests:
          cpu: 100m
          memory: 64Mi
....
lastState:
        terminated:
          exitCode: 137
          reason: OOMKilled
          startedAt: '2024-01-26T08:36:59Z'
          finishedAt: '2024-01-26T08:37:24Z'

operator-pod.log

Are the manager's resource limits hardcoded? Should this be increased by default and is there a way to make this configurable for the user?

Expected Behavior

No response

Steps To Reproduce

No response

Environment

No response

Anything else?

No response

@andrewazores andrewazores added bug Something isn't working needs-triage Needs thorough attention from code reviewers labels Jan 26, 2024
@andrewazores
Copy link
Member Author

Users can/should do this via the OLM Subscription object as documented above if they need to adjust the values rather than rely on our defaults.

Question still remains whether we should adjust the defaults.

@ebaron
Copy link
Member

ebaron commented Feb 12, 2024

I think the subscription is the only way to do it at the bundle level. It's probably worth increasing the limit. Judging from the log, it didn't get very far. Maybe synchronizing the client cache on a cluster with many objects did it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working needs-triage Needs thorough attention from code reviewers
Projects
No open projects
Status: No status
Development

No branches or pull requests

2 participants