Cluster Monitoring Operator

The Cluster Monitoring Operator manages and updates the Prometheus-based monitoring stack deployed on top of OpenShift.

It contains the following components:

Prometheus Operator
Prometheus
Alertmanager cluster for cluster and application level alerting
kube-state-metrics
node_exporter

The deployed Prometheus Operator is meant to be leveraged by users to easily deploy new Prometheus setup for their application monitoring. The Prometheus instance (prometheus-k8s) is responsible for monitoring and alerting on cluster and OpenShift components. It should not be extended to monitor user applications. Alertmanager is a cluster-global component for handling alerts generated by all Prometheus instances deployed in that cluster.

Metrics are collected from the following components

kube-state-metrics
node_exporter
Kubelets
API server
Prometheus (just prometheus-k8s for now)
Alertmanager

Important: The Prometheus Operator managed by the Cluster Monitoring Operator will by default only look for ServiceMonitor resources in namespaces containing an openshift.io/cluster-monitoring label (with any value).

Contributing new component integrations

The Cluster Monitoring Operator has many builtin ServiceMonitor resources which enable discovering the metrics endpoints of a variety of well-known components.

To register a new builtin component, make the following changes:

Add a new ServiceMonitor manifest file to assets/prometheus-k8s following the existing prometheus-k8s-service-monitor-$COMPONENT.yaml naming convention.
Add a constant in pkg/manifests/manifests.go which points to the new manifest file.
Add a new Factory method in pkg/manifests/manifests.go which loads the manifest using the new constant.
Add a step to PrometheusTask in pkg/tasks/prometheus.go which creates the ServiceMonitor using the Factory new method.

To add a new builtin alerting rule:

Add a new Prometheus rules file to rules/k8s.

Run make generate after you modify the files and make sure to add the modified files to the commit.

Roadmap

Monitor etcd
Adapt Tectonic inherited alerts with OpenShift operational knowledge

Testing

End-to-end tests

Run e2e-tests with make e2e-test. Clean up after e2e-tests with make e2e-clean

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
Documentation		Documentation
assets		assets
cmd/operator		cmd/operator
examples		examples
hack		hack
jsonnet		jsonnet
manifests		manifests
pkg		pkg
scripts		scripts
test/e2e		test/e2e
vendor		vendor
.dockerignore		.dockerignore
.gitignore		.gitignore
.travis.yml		.travis.yml
.yamllint		.yamllint
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
DCO		DCO
Dockerfile		Dockerfile
Dockerfile.generate		Dockerfile.generate
Gopkg.lock		Gopkg.lock
Gopkg.toml		Gopkg.toml
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
Makefile		Makefile
OWNERS		OWNERS
README.md		README.md
VERSION		VERSION
code-of-conduct.md		code-of-conduct.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cluster Monitoring Operator

Contributing new component integrations

Roadmap

Testing

End-to-end tests

About

Releases

Packages

Languages

License

sichvoge/cluster-monitoring-operator

Folders and files

Latest commit

History

Repository files navigation

Cluster Monitoring Operator

Contributing new component integrations

Roadmap

Testing

End-to-end tests

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages