EasyHPC GPU Nodes

EasyHPC GPU node dashboard.

A Grafana dashboard for monitoring GPU nodes in an EasyHPC environment using Prometheus metrics collected by the NVIDIA DCGM Exporter.

It provides a unified view of host performance, GPU utilization, power consumption, temperature, memory usage, and high-speed interconnect activity such as NVLINK, PCIe, and InfiniBand.

Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies