← All dashboards

Kubernetes cluster monitoring (via Prometheus)

Monitors Kubernetes cluster using Prometheus. Shows overall cluster CPU / Memory / Filesystem usage as well as individual pod, containers, systemd services statistics. Uses cAdvisor metrics only.

Full K8 solution with dashboards, alerts and KPIs available in Grafana Cloud free tier (with 10K free Prometheus series metrics and more): https://grafana.com/signup/cloud/connect-account

release license k8s prometheus grafana link

Initial idea was taken from this dashboard and improved to exclude node-exporter dependency and to give more information about cluster state.


You only need to have running Kubernetes cluster with deployed Prometheus. Prometheus will use metrics provided by cAdvisor via kubelet service (runs on each node of Kubernetes cluster by default) and via kube-apiserver service only.

Your Prometheus configuration has to contain following scrape_configs:

  - job_name: kubernetes-nodes-cadvisor
    scrape_interval: 10s
    scrape_timeout: 10s
    scheme: https  # remove if you want to scrape metrics on insecure port
      ca_file: /var/run/secrets/kubernetes.io/serviceaccount/ca.crt
    bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
      - role: node
      - action: labelmap
        regex: __meta_kubernetes_node_label_(.+)
      # Only for Kubernetes ^1.7.3.
      # See: https://github.com/prometheus/prometheus/issues/2916
      - target_label: __address__
        replacement: kubernetes.default.svc:443
      - source_labels: [__meta_kubernetes_node_name]
        regex: (.+)
        target_label: __metrics_path__
        replacement: /api/v1/nodes/${1}/proxy/metrics/cadvisor
      - action: replace
        source_labels: [id]
        regex: '^/machine\.slice/machine-rkt\\x2d([^\\]+)\\.+/([^/]+)\.service$'
        target_label: rkt_container_name
        replacement: '${2}-${1}'
      - action: replace
        source_labels: [id]
        regex: '^/system\.slice/(.+)\.service$'
        target_label: systemd_service_name
        replacement: '${1}'


  • Total and used cluster resources: CPU, memory, filesystem.
    And total cluster network I/O pressure.
    Total and used cluster resources
  • Kubernetes pods usage: CPU, memory, network I/O.
    Pods usage
  • Containers usage: CPU, memory, network I/O.
    Docker and rkt containers which runs on cluster nodes but outside Kubernetes are also monitored.
    Containers usage
  • systemd system services usage: CPU, memory.
    systemd usage
  • Showing all above metrics both for all cluster and each node separately.
    Filtering metrics by nodes


If filesystem usage panels display N/A, you should correct device=~"^/dev/[vs]da9$" filter parameter in metrics query with devices your system actually has.

Dashboard revisions



Login or Sign up to write a review

Reviews from the community

Get this dashboard

Data source:


Import the dashboard template:


Download JSON

Docs: Importing dashboards

Downloads: 5,261,242

Related Kubernetes content

solutions Kubernetes monitoring