Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expose additional metrics in prometheus #113

Open
grzesuav opened this issue Mar 19, 2020 · 1 comment
Open

Expose additional metrics in prometheus #113

grzesuav opened this issue Mar 19, 2020 · 1 comment
Labels

Comments

@grzesuav
Copy link
Collaborator

For example:

  • number of resync failures
@jesusvazquez
Copy link

👋 number of resync failures is a very specific and yet useful metric one could have to create an alert on in case there are resync errors.

A more generic approach would be something like the Observability RED method (Requests, Errors, Durations)

  • Requests per second
  • Errors per second
  • Durations per second

I'm still getting started with metacontroller but you can have many different controllers and it would be great to have such generic metrics for general observability and troubleshooting purposes.

I could create an alert on errors per second or percentile95 of durations so that I take a look in case any controller is failing or taking too long for whatever reason. That would be a starting point.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants