Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Set up K8S monitoring for dev cluster #325

Merged
merged 1 commit into from
Apr 5, 2022
Merged

Conversation

masih
Copy link
Member

@masih masih commented Apr 5, 2022

Context

Set up stack to monitor the K8S cluster, worker nodes and extra K8S metrics. Add pod monitor for indexer instance.

Proposed Changes

Install kube-prometheus stack which gives us:

  • prometheus operator
  • grafana
  • node exporter
  • adapter for k8s metrics
  • kube-state-metrics

Set up monitoring for pods belonging to the indexer running in dev cluster.

This PR also creates an AWS managed prometheus along with necessary roles to interact with it. But the remote_write is not hooked up because, the version of operator installed does not support SigV4 authentication. K8S cluster needs to be upgraded for us to install a newer version, and that will be done in future PRs. For now, this PR installs a grafana instance accessible locally via port forwarding that lets us make progress while we figure out how to hook up things to PL grafana.

Unrelated to context: while at it, give Kyle access to interact with encrypted secrets like the rest of the team.

Tests

Deployed manually for testing.

Revert Strategy

git revert then terraform apply

@masih masih requested a review from MarcoPolo April 5, 2022 16:05
deploy/infrastructure/dev/us-east-2/kms.tf Outdated Show resolved Hide resolved
Deploy K8S monitoring stack, including node-exporter,
kube-metrics-server, grafana and prometheus operator.
@masih masih force-pushed the masih/deploy-dev-monitoring branch from 6e75bfa to c44f400 Compare April 5, 2022 16:53
@masih masih merged commit e024c22 into main Apr 5, 2022
@masih masih deleted the masih/deploy-dev-monitoring branch April 5, 2022 19:19
@masih masih mentioned this pull request Apr 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants