EasyHPC GPU Nodes
EasyHPC GPU node dashboard.
A Grafana dashboard for monitoring GPU nodes in an EasyHPC environment using Prometheus metrics collected by the NVIDIA DCGM Exporter.
It provides a unified view of host performance, GPU utilization, power consumption, temperature, memory usage, and high-speed interconnect activity such as NVLINK, PCIe, and InfiniBand.
Data source config
Collector config:
Upload an updated version of an exported dashboard.json file from Grafana
| Revision | Description | Created | |
|---|---|---|---|
| Download |
