This is documentation for the next version of Grafana Tempo documentation. For the latest stable release, go to the latest version.
Monitor query I/O and span timestamp distance
You can use these metrics to monitor query I/O and span timestamp quality:
query_frontend_bytes_inspected_total
measures how many bytes the frontend reads per request type and tenant. This value shows the total number of bytes read from disk and object storage.spans_distance_in_future_seconds
andspans_distance_in_past_seconds
measure how far a span end time is from the ingestion time. This capability lets you find customers that send spans too far in the future or past, which may not be found using the Search API.
Use these metrics together to correlate query cost with data quality and pipeline health.
Reference
The query frontend emits query_frontend_bytes_inspected_total
when a request finishes, aggregating bytes inspected by queriers.
The distributor emits spans_distance_in_future_seconds
and spans_distance_in_past_seconds
by comparing span end time with ingestion time.
Names | Type | Labels | Buckets | Emitted | Notes |
---|---|---|---|---|---|
query_frontend_bytes_inspected_total | Counter | tenant , op | - | On request completion at the query frontend; aggregates bytes from queriers; excludes cached querier responses. | |
spans_distance_in_future_seconds , spans_distance_in_past_seconds | Histogram | tenant | 300s, 1800s, 3600s (5m, 30m, 1h) | In the distributor on ingest; observes seconds between span end time and ingestion time. | Spans in the future are accepted but invalid and might not be searchable. |
PromQL examples
To see how frequently future-dated spans arrive by tenant, use the histogram count rate:
sum by (tenant) (
rate(tempo_spans_distance_in_future_seconds_count[5m])
)
Inspect query read throughput (bytes/s
) by tenant and operation:
sum by (tenant, op) (
rate(tempo_query_frontend_bytes_inspected_total[5m])
)
Top five tenants by inspected GiB over the last hour:
topk(
5,
sum by (tenant) (increase(tempo_query_frontend_bytes_inspected_total[1h])) / 1024 / 1024 / 1024
)
To quantify ingestion delay using the past-distance histogram, chart the P90 over time:
histogram_quantile(
0.9,
sum by (tenant, le) (
rate(tempo_spans_distance_in_past_seconds_bucket[15m])
)
)