Skip to content
This repository has been archived by the owner on Feb 7, 2024. It is now read-only.

Expand Prometheus metrics #426

Open
11 tasks
mgrabovsky opened this issue May 26, 2021 · 0 comments
Open
11 tasks

Expand Prometheus metrics #426

mgrabovsky opened this issue May 26, 2021 · 0 comments
Labels
enhancement feature:metrics Features and bugs related to the metrics and monitoring subsystem.

Comments

@mgrabovsky
Copy link
Contributor

mgrabovsky commented May 26, 2021

As a follow-up to #419, it would be nice to expand the list of exposed metrics in the future. Some useful ones that come to mind are (roughly in order of descending priority):

  • Task duration summaries (min, max, quantiles) over some reasonable window
  • Number of tasks (failed, successful) per operating system release
  • Backtrace rating histogram, i.e. the number of successful tasks that produced a backtrace with rating 0..4. (This would require calling abrt-action-analyze-backtrace on the result.)
  • CPU load (1/5/15 min)
  • Memory usage
  • httpd stats
  • Time until TLS certificate expires
  • Occurrence of common retracing errors
  • Server uptime
  • Retrace Server version
  • Uploaded coredump size summaries

It would also be nice to split some of the metrics by operating system (e.g. tasks failed for Fedora 34, tasks failed for CentOS 8 etc.).

@mgrabovsky mgrabovsky added the feature:metrics Features and bugs related to the metrics and monitoring subsystem. label Jun 23, 2021
@mgrabovsky mgrabovsky added this to the 2.0.0 milestone Jan 17, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement feature:metrics Features and bugs related to the metrics and monitoring subsystem.
Projects
None yet
Development

No branches or pull requests

1 participant