VAST 4.7 Cluster Statistics

Dashboard for visualizing VAST 4.7 cluster metrics

Introduction

The VAST Data Platform unifies storage, database, and containerized compute into a single, scalable software platform to power AI & deep learning in modern data centers and clouds.

In addition to the in-depth analytics provided in the product UI, VAST versions 4.5.0 and later have built-in Prometheus exporters for easy integration into existing monitoring infrastructure.

This dashboard provides a heads-up view for visualizing:

Cluster health and statistics
NFSv3 and S3 protocol statistics
View statistics
User statistics
Hardware health and statistics

Compatibility

For VAST 5.0 and later, see the new VAST Cluster Statistics dashboard.

This dashboard supports VAST 4.7 and later with the built-in Prometheus exporter. For a 4.6 dashboard, or one that sources from the external exporter, please contact VAST support.

Prometheus Configuration

VAST clusters running 4.6.0-sp11 and later provide separate metrics endpoints to give more control on which metrics to fetch and how often to fetch them. This allows users to customize the trade off between metric freshness and the cost of fetching the data.

Here's an example prometheus.yml configuration for the 4 endpoints used by this dashboard with some guidelines on scraping intervals.

scrape_configs:
  # Base metrics contain key cluster and protocol stats
  # Recommended scrape interval is >= 30s
  - job_name: 'vast_base'
    metrics_path: '/api/prometheusmetrics/'
    scrape_interval: 30s
    scrape_timeout: 20s
scheme: https
static_configs:
  - targets: [&#39;10.71.10.202:443&#39;]
tls_config:
    insecure_skip_verify: true

basic_auth:
   username: &#39;admin&#39;
   password: &#39;xxxxxx&#39;
Device metrics can be data intensive for larger clusters
Recommended scrape interval is >= 60s
job_name: 'vast_devices'
metrics_path: '/api/prometheusmetrics/devices'
scrape_interval: 60s
scrape_timeout: 45s
scheme: https
static_configs:
targets: ['10.71.10.202:443']
tls_config:
insecure_skip_verify: true
basic_auth:
username: 'admin'
password: 'xxxxxx'
View metrics can be data intensive for clusters with many views
Recommended scrape interval is >= 60s
job_name: 'vast_views'
metrics_path: '/api/prometheusmetrics/views'
scrape_interval: 60s
scrape_timeout: 45s
scheme: https
static_configs:
targets: ['10.71.10.202:443']
tls_config:
insecure_skip_verify: true
basic_auth:
username: 'admin'
password: 'xxxxxx'
User metrics can be data intensive for clusters with many users
Recommended scrape interval is >= 60sjob_name: 'vast_users'
metrics_path: '/api/prometheusmetrics/users'
scrape_interval: 60s
scrape_timeout: 45s
scheme: https
static_configs:
targets: ['10.71.10.202:443']
tls_config:
insecure_skip_verify: truebasic_auth:
username: 'admin'
password: 'xxxxxx'

Revisions

Revision	Description	Created
			Download

Get this dashboard

Import the dashboard template

Download JSON

Datasource

Dependencies

Resources

Docs: Importing dashboards Webinar: Getting started with Grafana dashboard design Webinar: Building advanced Grafana dashboards

VAST 4.7 Cluster Statistics

Introduction

Compatibility

Prometheus Configuration

Device metrics can be data intensive for larger clusters

Recommended scrape interval is >= 60s

View metrics can be data intensive for clusters with many views

Recommended scrape interval is >= 60s

User metrics can be data intensive for clusters with many users

Recommended scrape interval is >= 60s

Data source config

Collector config:

Get this dashboard

Still have questions?

Get every update