Deployment Performance & Health


Watch performance, errors, latency, and infrastructure metrics for your workload
Last updated: 3 months ago

Start with Grafana Cloud and the new FREE tier. Includes 10K series Prometheus or Graphite Metrics and 50gb Loki Logs

Downloads: 83847

Reviews: 0

  • Screen Shot 2022-02-01 at 4.57.59 PM.png
    Screen Shot 2022-02-01 at 4.57.59 PM.png

A parameterized dashboard for common workload types (deployment, daemonSet, statefulSet) that has charts that pull Prometheus metrics from Kubernetes, Istio, and node-exporter and visualizes metrics in several categories (by panel):

  • At a Glance - A quick view of the health of your Kubernetes-based app (assumes it's web service, so it's mostly Istio metrics like success, latency, etc)
  • RED (Requests, Errors, Duration) - SRE "Golden Signals" that come from Istio
  • USE (Utilization, Saturation, Errors) - SRE "Golden Signals" that come from Kubernetes
  • Infra Resources - POD distribution by host and AZ, HPA metrics, image tag, oomkills, CPU throttling, total deployment allocated CPU's & memory, and more

Select your account, cluster, namespace, and then your workload name, and all charts will render.

Get this dashboard: