EasyHPC Slurm Dashboard
Slurm cluster monitoring dashboard for EasyHPC slurm-exporter. Includes cluster overview (CPU/mem/GPU/node states), GPU usage by user and job, partition allocation, account quota saturation, pending reasons, backfill scheduler stats, queue fairness, and exporter health. Tested with Slurm 25.05, dashboard schema v39.
EasyHPC Slurm Dashboard
Slurm cluster monitoring via superdom/slurm-exporter.
Panels: cluster overview, partitions, nodes, GPU (cluster/user/job), accounts & users with quota saturation, live job inventory, pending reasons, scheduler diagnostics, queue fairness.
Requirements
- Grafana 10.0+, Prometheus datasource
- slurm-exporter v0.1.0+ on the Slurm controller
- Slurm 23.x–25.x
Quick start
pip install git+https://github.com/superdom/slurm-exporter.git@v0.1.0
slurm-exporter --port=9410
- job_name: slurm-exporter
static_configs: [{ targets: ["slurmctld:9410"] }]
Data source config
Collector config:
Upload an updated version of an exported dashboard.json file from Grafana
| Revision | Description | Created | |
|---|---|---|---|
| Download |