Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create metrics for instaslice #53

Open
kannon92 opened this issue Sep 5, 2024 · 5 comments
Open

Create metrics for instaslice #53

kannon92 opened this issue Sep 5, 2024 · 5 comments
Milestone

Comments

@kannon92
Copy link
Contributor

kannon92 commented Sep 5, 2024

We don't have any metrics exposed for this operator. We should think of some metrics that operations would find useful for monitoring the health of this service, placement of GPUs, etc.

@asm582 asm582 added this to the v0.1 milestone Sep 11, 2024
@kannon92
Copy link
Contributor Author

@asm582 I think we should consider this a must for release. Can you lead this effort? I don't have a good understanding to know what an admin would want to see from this project for observability.

@tardieu
Copy link
Contributor

tardieu commented Nov 25, 2024

  • tracking GPU slice allocations. First, which slot is in use vs free on each GPU. Which slices are in use (ie how slots are fused).
  • tracking GPU slice requests from pending pods.

@harche
Copy link
Contributor

harche commented Nov 25, 2024

@asm582
Copy link
Contributor

asm582 commented Nov 25, 2024

  • Expose allocations to users present in the InstaSlice object
  • Expose configmap info to persona admin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants