Tarantool Cartridge dashboard

Dashboard for Tarantool Cartridge and Tarantool 1.10—2.x application and database server monitoring, based on grafonnet library.

Tarantool Cartridge dashboard screenshot 1
Tarantool Cartridge dashboard screenshot 2
Tarantool Cartridge dashboard screenshot 3

About project

This is the dashboard for Tarantool Cartridge and Tarantool 1.10—2.x applications.

This dashboard is based on grafonnet library. Sources are available in project’s GitHub page.

Configuration

The dashboard is designed to work with Tarantool + Prometheus + Grafana cluster.

Tarantool configuration

The dashboard is designed to work with Tarantool metrics module (v0.15.1 or newer is needed for complete experience). It is recommended to use metrics role (refer to cartridge.roles.metrics role page for instructions). Be sure to set alias for every Tarantool instance (e.g. by environment variable TARANTOOL_ALIAS) so metrics will be splitted and displayed correctly.

Cartridge must be configured to have a prometheus metrics endpoint for Prometheus (refer to cartridge.roles.metrics role page for instructions). For HTTP panels, HTTP routing middleware with summary collector must be enabled. (See more on HTTP middleware in metrics documentation and Cartridge role example.)

To support CRUD statistics, install CRUD 0.11.1 or newer. Call crud.cfg{stats = true, stats_driver='metrics', stats_quantiles=true} on router to enable CRUD statistics collect with latency quantiles.

To support expirationd statistics, install expirationd 1.2.0 or newer. Call expirationd.cfg{metrics = true} on instance to enable statistics export.

Dashboard

Metrics used in dashboard are based on Tarantool metrics module. Time series are grouped by alias parameter, so you can use this dashboard to monitor a cluster of Tarantool instances.

Dashboard contains the following sections:

  • Cluster overview
  • Replication overview
  • Tarantool HTTP statistics
  • Tarantool network activity
  • Tarantool memtx allocation overview
  • Tarantool MVCC overview
  • Tarantool vinyl statistics
  • Tarantool CPU statistics
  • Tarantool runtime overview
  • Tarantool LuaJit statistics
  • Tarantool operations statistics
  • CRUD module statistics
  • expirationd module statistics

Cluster overview

Contains several cluster overview panels, providing some aggregated info on cluster instances health, overall load and memory stats. Also contains panels on Cartridge issues count, read only status and switchover triggers, election info.

Replication overview

Contains info about replication lag and status, clock delta and synchronous queue statistics.

Tarantool HTTP statistics

Overview of processed HTTP requests on Tarantool’s side, grouped by method, path and response code. There are two lines of panels: one on rps and the other on 99th percentile of latency.

Tarantool network activity

Overview of instances network activity. Displays input and output data load, as well as network connections, detailed requests stats and memory consumption.

Tarantool memtx allocation overview

Overview of Tarantool memtx memory allocation (tuples and indexes). Contains short instruction on memory allocation monitoring. You can read more about slab here.

Tarantool MVCC overview

Contains statistics of Tarantool memtx engine transactions manager.

Tarantool vinyl statistics

Overview of Tarantool vinyl memory and disk allocation, vinyl scheduler process and transaction statistics. You can read more about vinyl engine here.

Tarantool CPU statistics

Contains CPU process time statistics for each instance.

Tarantool runtime overview

Metrics related to runtime. Contains Lua and transaction memory panels, fiber statistics panels and event loop panel.

Tarantool LuaJit statistics

Detailed Lua runtime overview. Contains panels about objects allocation and garbage collect, memory consumption and LuaJit traces info.

Tarantool operations statistics

Overview of operations on Tarantool spaces (select, update, etc.) aggregated over all Tarantool spaces, as well as other operations activity (eval, call, auth, errors and SQL calls).

CRUD module statistics

Overview of CRUD module operations (crud.insert, crud.select, etc) separated by operation, cluster space and execution status. Contains RPS graphs, latency graphs and graphs with SELECT/PAIRS operations detailed info.

expirationd module statistics

Overview of expirationd module jobs: tuples statistics, restarts count and working time.

Contributing

Project is open-sourced, so you can freely file an issue with suggestion or bug report to our GitHub page.

Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies