EasyHPC Slurm Dashboard

Slurm cluster monitoring dashboard for EasyHPC slurm-exporter. Includes cluster overview (CPU/mem/GPU/node states), GPU usage by user and job, partition allocation, account quota saturation, pending reasons, backfill scheduler stats, queue fairness, and exporter health. Tested with Slurm 25.05, dashboard schema v39.

EasyHPC Slurm Dashboard

Slurm cluster monitoring via superdom/slurm-exporter.

Panels: cluster overview, partitions, nodes, GPU (cluster/user/job), accounts & users with quota saturation, live job inventory, pending reasons, scheduler diagnostics, queue fairness.

Requirements

  • Grafana 10.0+, Prometheus datasource
  • slurm-exporter v0.1.0+ on the Slurm controller
  • Slurm 23.x–25.x

Quick start

pip install git+https://github.com/superdom/slurm-exporter.git@v0.1.0
slurm-exporter --port=9410
- job_name: slurm-exporter
  static_configs: [{ targets: ["slurmctld:9410"] }]
Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies