This is documentation for the next version of Grafana. For the latest stable release, go to the latest version.
Overview of Grafana 8 alerts
Alerts allow you to know about problems in your systems moments after they occur. Robust and actionable alerts help you identify and resolve issues quickly, minimizing disruption to your services.
Alerts have four main components:
- Alerting rule - One or more query and/or expression, a condition, the frequency of evaluation, and the (optional) duration that a condition must be met before creating an alert.
- Contact point - A channel for sending notifications when the conditions of an alerting rule are met.
- Notification policy - A set of matching and grouping criteria used to determine where, and how frequently, to send notifications.
- Silences - Date and matching criteria used to silence notifications.
You can perform the following tasks for alerts:
- Create a Grafana managed alert rule
- Create a Cortex or Loki managed alert rule
- View existing alert rules and their current state
- Test alert rules and troubleshoot
- Add or edit an alert contact point
- Add or edit notification policies
- Create and edit silences
The current alerting system doesn’t support high availability. Alert notifications are not deduplicated and load balancing is not supported between instances e.g. silences from one instance will not appear in the other. The Grafana team aims to have this feature by Grafana version 8.1+.
Grafana managed alerts are evaluated by the Grafana backend. Rule evaluations are scheduled, according to the alert rule configuration, and queries are evaluated by an engine that is part of core Grafana.
Alerting rules can only query backend data sources with alerting enabled:
- builtin or developed and maintained by grafana:
Google Cloud Monitoring,
Azure Data Explorer
- any community backend data sources with alerting enabled (
alertingproperties are set in the plugin.json)
Metrics from the alerting engine
The alerting engine publishes some internal metrics about itself. You can read more about how Grafana publishes internal metrics.
||gauge||How many alerts by state|
||histogram||Histogram of requests to the Alerting API|
||gauge||The number of active, non default Alertmanager configurations for grafana managed alerts|
||counter||The total number of rule evaluations|
||counter||The total number of rule evaluation failures|
||summary||The duration for a rule to execute|
||gauge||The number of rules|
Related Grafana resources
Unify your data with Grafana plugins: Splunk, MongoDB, Datadog, and more
Show how Grafana can be used to take data from multiple different sources and unify it, without disrupting the investments that are working today.
Demo: Getting started with Grafana Enterprise and observability
Join the Grafana Labs team for a 30-minute demo of how to get started with the Grafana Stack, so you can go from zero to observability in just a few minutes.