GPU monitoring


Monitors Kubernetes cluster using Prometheus. Shows overall cluster CPU / Memory / Filesystem usage as well as individual pod, containers, systemd services statistics. Uses cAdvisor metrics only.
Last updated: 3 years ago

Start with Grafana Cloud and the new FREE tier. Includes 10K series Prometheus or Graphite Metrics and 50gb Loki Logs

Downloads: 957

Reviews: 0

    cadvisor collects the usage information of GPU If you want to collect the GPU temperature or power information, please call the nvidia nvml libraray with node-exporter additionally

    Get this dashboard: