NVIDIA DCGM Dashboard for Kubernetes (MIG & Non-MIG GPUs)

This Grafana dashboard uses the NVIDIA DCGM exporter to monitor GPU performance in Kubernetes clusters. Supporting both MIG and non-MIG GPUs, it provides real-time metrics on utilization, memory, temperature, power, and more, enabling efficient management of NVIDIA GPU resources.

NVIDIA DCGM Dashboard for Kubernetes (MIG & Non-MIG GPUs) screenshot 1
NVIDIA DCGM Dashboard for Kubernetes (MIG & Non-MIG GPUs) screenshot 2
NVIDIA DCGM Dashboard for Kubernetes (MIG & Non-MIG GPUs) screenshot 3
The NVIDIA DCGM Dashboard for Kubernetes (MIG & Non-MIG GPUs) dashboard uses the prometheus data source to create a Grafana dashboard with the bargauge, gauge, piechart, stat, table and timeseries panels.
Revisions
RevisionDescriptionCreated
Kubernetes

Kubernetes

by Grafana Labs
Grafana Labs solution

Monitor your Kubernetes deployment with prebuilt visualizations that allow you to drill down from a high-level cluster overview to pod-specific details in minutes.

Learn more

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies