Menu
Grafana Cloud

Introduction to Grafana Incident

Grafana Incident is a comprehensive incident management solution that is designed to help teams respond quickly and effectively to incidents, outages, and other critical events.

During an incident, Grafana Incident automates tasks, assigns roles, and passively collects context to document learnings, so you can focus on what’s important. By integrating Grafana Incident into your current incident management process, you can streamline your workflow, minimize the impact of incidents, and learn from them to prevent similar incidents from happening in the future.

Grafana Incident demo

Watch the following product demo to see Grafana Incident in action:

Overview

With Grafana Incident, teams can streamline their incident management processes, reduce response times, and improve overall incident response.

Centralized incident management: Grafana Incident provides a centralized platform for managing incidents, enabling teams to collaborate more effectively and respond faster to incidents.

Automated incident response steps: Meet your team where they are and start collaborating on incidents faster. Grafana Incident automatically creates all the virtual spaces and resources, such as Slack channels, meeting rooms, and post-incident review documents. When an incident is declared, there’s no need to spend time on administrative tasks.

Inform stakeholders faster: Keep your stakeholder up-to-date without interruption to your response team. Grafana Incident passively tracks the timeline of an incident, so stakeholders can determine what happened and who’s involved during an incident. Once you’ve resolved the incident, gain valuable insights about your incident response process, what happened, and how.

Reduce MTTR: Grafana Incident enables you to resolve incidents faster by making it easier to coordinate and collaborate during an incident. Save valuable investigation time with automation and integrations with your existing tools.

Key features

Grafana Incident offers a range of key features that enable teams to manage incidents more effectively:

Built-in incident timeline and task management

Manage and assign tasks to keep the incident response team aware and informed. The built-in timeline and task manager make it easier for responders to join the incident, review activity, and identify what needs to be done.

Integrations and automation

Integrate with your existing tools and automates routine incident response tasks to start collaborating on incidents faster. Automatically create virtual spaces and resources, such as Slack channels, meeting rooms, and post-incident review documents.

Interactive Incident Chatbot

Create and manage incidents directly from Slack with the Chatbot command-line interface. Assign roles, add tasks, update the incident severity, and more.

Incident roles

Identify who’s involved and what they’re doing with Incident roles. The Investigator is responsible for diagnosing and resolving the incident, while the Incident Commander oversees the incident by managing communication, tasks, and necessary updates.

AI-powered features

Grafana Incident integrates the power of Machine Learning into feature development to reduce toil and solve real-world problems for admins and developers. New AI-driven developments in Grafana Incident include:

Sift: A diagnostic assistant in Grafana Cloud designed to automatically discover contributing causes to incidents across metrics, logs, and tracing data.

Auto-summary: Grafana Incident auto-summary: A tool that automatically summarizes key details from your incident timelines with a single click.

To learn more about Grafana Machine Learning, visit the Machine Learning documentation.

Next steps