Monitors Kubernetes cluster using Prometheus. Shows overall cluster CPU / Memory / Filesystem usage as well as individual pod, containers, systemd services statistics. Uses cAdvisor metrics only.
cadvisor collects the usage information of GPU If you want to collect the GPU temperature or power information, please call the nvidia nvml libraray with node-exporter additionally
Upload an updated version of an exported dashboard.json file from Grafana
Sign up for Grafana Cloud
Get this dashboard
Import the dashboard template: