1 Node Exporter for Prometheus Dashboard EN v20201010

Dashboard

【English version】Update 2020.10.10, add the overall resource overview! Support Grafana6&7,Support Node Exporter v0.16 and above.Optimize the main metrics display. Includes: CPU, memory, disk IO, network, temperature and other monitoring metrics。https://github.com/starsliao/Prometheus
Last updated: a month ago

Downloads: 62353

Reviews: 31

  • ensa.png
    ensa.png
  • ens1.png
    ens1.png

Grafana v6.7.4/v7.2.0 + node_exporter 1.0.1 test pass

If you are using grafana 6.x, please download and edit the json file, replace table-old with table, and then import it into grafana.

【English version】Update 2020.10.10, add the overall resource overview! Support Grafana6&7,Support Node Exporter v0.16 and above.Optimize the main metrics display. Includes: CPU, memory, disk IO, network, temperature and other monitoring metrics。https://github.com/starsliao/Prometheus


Important update:

  • Newly added data source variable origin_prometheus, taken from the external system label of Prometheus: external_labels can be used to support multiple scenarios where Prometheus accesses to VictoriaMetrics or Thanos and other third-party storage to use the remote_write method. (No value by default, the absence of this label in the indicator does not affect the use) VictoriaMetrics please use v1.42.0 and above versions, which fixed the problem of displaying the grafana table.

  • Added the time interval variable interval, all graphs are associated with this variable, and the granularity of the graph can be adjusted by selecting the time interval as needed. Note that the scrape_interval prometheus.yml. If there are less than 2 values in the rate time interval, the graph cannot be displayed. When it is equal to 2 values, rate is irate.(The default time interval is set to 2 minutes. If your Prometheus scrape_interval is greater than 1 minute, the graph will not be displayed. Just choose a larger time interval.)


Compatibility Release Notes:

If you import the dashboard, you get an error:

Failed create dashboard model
Cannot read property 'Symbol(Symbol.iterator)' of undefined

Note that your grafana version is too low, does not support the new BAR GAUGE chart, it is recommended to upgrade to the latest version of grafana, or re-import version without BAR GAUGE:

https://grafana.com/grafana/dashboards/11173


Blog:StarsL.cn

GitHub:https://github.com/starsliao/Prometheus


Screenshots:

Resource Overview Resource Overview Resource Details Resource Details

Notes:


After import the dashboard, please click Dashboardsettings-- Variables to set the variable in the upper right corner of the dashboard according to the actual situation。

The three variables:$job, $hostname and $node will be set and associated by default.

  • $node takes the instance of node_exporter, ip:port format. Most queries are associated with this variable, please make sure it is valid.

  • $maxmount is used to check the maximum partition of the current host. Normally can only obtain partitions of type ext.* and xfs by default.


【update】:

2020/10/10

  1. The origin_prometheus variable removed ALL to solve the problem of no data after clicking the link in the summary table.

2020/10/03

  1. Added 5-minute load, TCP connection number, timewait number, and total number of hosts to the resource overview table, and removed paging.
  2. Fixed the problem of links to resource details in the resource overview table.
  3. Replace all irate to be rate, increased the time interval variable, and can self-control the graph granularity. For more details, please refer to the description of Important Update as above.
  4. Added data source variables to support multiple Prometheus. For more details, please refer to the description of Important Update as above.
  5. Modified the description of the resource overview chart. Move the mouse to i in the upper left corner of the table to view it.

2020/06/27

  1. Added a 5-minute load value to the table on the resource overview page.
  2. Adjusted the resource details link of the resource overview page, you can switch the details of each host in the current window.
  3. Adjusted the options of the variable menu, Instance multi-selection can view multiple hosts at the same time on the detail page.
  4. The node_exporter 1.0.1 test is used normally.

2020/05/30

  1. Added table details of overall host resources conditions and statistical graphs of overall host resources. The default contraction, can be grouped by Job, can be directly linked to the specified host details. (Pls refer to screenshot)
  2. Cancelled the graph of the host temperature, and increased the chart of the per hour flow rate.
  3. Corrected the calculation of the disk usage rate to be consistent with the algorithm of the df command.
  4. Optimized the calculation time value of some charts, speeded up the loading speed, and adjusted the display effect and position of some chart curves.

2019/11/2

  1. Adjusted the display metrics and descriptions of the Network Sockstat to make it more practical.
  2. Modified the display and description of the node_disk_io_time_seconds_total metrics.
  3. Add the reference value to the chart for each I/O read/write time-consuming.
  4. Optimized the display effect of part graph, fixed the color of some lines.

2019/10/30

  1. The pie chart that needs to be manually installed was removed, and the pie chart of the original disk information is integrated into the disk table information.
  2. Add a Bar Gauge to timely display the information such as cpu,memory , etc.
  3. Add a graph to turn on context switching and opening files.
  4. To separate the Time Spent Doing I/Os from the cpu usage graph.
  5. Most of the charts in the entire dashboard have been adjusted and optimized to enhance the practicality and compatibility.
  6. Fixed the issue about report error of displaying multiple server partial charts at the same time.

2019/7/1

  1. Add usage graph of disk partitions.
  2. Optimized data display effect.

2019/5/20

  1. Add server list multi-select support, graphs can display data of multiple servers.
  2. Optimized the display effect of variables.
  3. Optimize the description of some monitoring metrics, click the "i" in the upper left corner of the chart to view.

2019/1/9

  1. Fixed a bug that showed inaccurate memory usage.
  2. Add a link to update node_exporter and dashboard.

11/16

  1. Add description of the variable.
  2. Optimized the display speed after the new installation of the dashboard

11/15

  1. Add an environment to group servers.
  2. Add the pie chart and total disk space.
  3. Add the descriptor about current opened file.
  4. Add the description of some monitoring metrics.
  5. Optimized the display results of some metrics.

11/13

  1. Add the ratio of a graph of disk's I/O operation consuming time per second.
Dependencies: