HPC / Combined Node, IB, GPU (profiling included) & NVLink Metrics Dashboard

This dashboard monitors HPC nodes with extended GPU profiling metrics including SM activity, tensor core usage, FP16/32/64 pipeline utilization, plus InfiniBand and NVLink performance data.

HPC / Combined Node, IB, GPU (profiling included) & NVLink Metrics Dashboard screenshot 1
HPC / Combined Node, IB, GPU (profiling included) & NVLink Metrics Dashboard screenshot 2
HPC / Combined Node, IB, GPU (profiling included) & NVLink Metrics Dashboard screenshot 3
The HPC / Combined Node, IB, GPU (profiling included) & NVLink Metrics Dashboard dashboard uses the prometheus data source to create a Grafana dashboard with the gauge, stat, state-timeline, table, text and timeseries panels.
Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies