cassandra-dashboard

Monitoring for Cassandra clusters

cassandra-dashboard screenshot 1

Revision 3: https://grafana.com/api/dashboards/10849/revisions/3/download

This dashboard shows a set of graphs to monitor the operations of Apache Cassandra clusters. It should be compatible with most versions, except perhaps the “Repair Ratio” graph that only works in recent ones. Among others:

  • Global cluster operations [per type]
  • Operations per instance [per type]
  • Latencies
  • Active/blocked/pending thread details
  • Timeouts, unavailables and other errors
  • Cache info
  • Compaction information
  • Load per instance
  • Large partitions
  • Tombstones
  • Repair status and progress

The backend database is Prometheus, feel free to reuse the Ansible roles I have created:

https://github.com/mimarpe/ansible/tree/master/roles/prometheus-deploy

cassandra-exporter is used as external agent on the Cassandra nodes to get the metrics. It is based on jmx-exporter. You can use the role or template configuration here to deploy it:

https://github.com/mimarpe/ansible/blob/master/roles/cassandra-exporter-setup/templates/config_cassandra_exporter.yml.j2

Cassandra-Reaper is used for repair scheduling and for the metrics used in the dashboard

Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies