Slide 2 of 2

Distributed tracing and continuous profiling at scale

Grafana Cloud Traces

Often, a high-traffic system produces far more traces than you can affordably store and query on your own.

Grafana Cloud Traces runs that backend and storage for you, so you can afford to keep enough trace history to investigate.

  • No backend to operate: Storage, scaling, sharding, and capacity planning are handled for you.
  • Adaptive Traces: Healthy requests produce far more traces than you need, but tail sampling keeps what matters and drops the rest. Open source can tail-sample too, but you run the pipeline yourself and redeploy it to change policies. Adaptive Traces is the managed version, where policy changes take effect within about 10 seconds, no redeploy.

Grafana Cloud Profiles

Profiling works the same in open source and Grafana Cloud, but the difference is operating it.

  • Always-on profiling generates continuous data from every instance in your fleet.
  • Storing and querying that volume affordably is hard to run yourself.

Grafana Cloud Profiles stores and queries that volume for you, so you keep profiling on everywhere.

Real results from real teams

  • Uber

    Moved from manual, reactive profiling to always-on continuous profiling with Grafana Cloud Profiles.

  • IG Group

    Chose Grafana Cloud specifically for Tempo as their traces store at scale, alongside Mimir for metrics. Built an “observable by default” platform using OpenTelemetry across cloud and on-prem.

Script

Tracing and profiling are available in open source Grafana. The challenge is cost: storing traces gets expensive fast, and always-on profiling generates a steady stream of data that’s expensive to store and query yourself. Grafana Cloud runs those backends for you, so you keep profiling on everywhere without operating the storage.