About Grafana Mimir versioning
This topic describes our guarantees for this Grafana Mimir major release.
Flags, configuration, and minor version upgrades
Upgrading Grafana Mimir from one minor version to the next minor version should work, but we don’t want to bump the major version every time we remove a configuration parameter.
We will keep deprecated features in place for two minor releases.
You can use the deprecated_flags_inuse_total
metric to generate an alert that helps you determine if you’re using a deprecated flag.
These guarantees don’t apply to experimental features.
Reading old data
The Grafana Mimir maintainers commit to ensuring that future versions can read data written by versions within the last two years. In practice, we expect to be able to read data written more than two years ago, but a minimum of two years is our guarantee.
API Compatibility
Grafana Mimir strives to be 100% compatible with the Prometheus HTTP API which is by default served by endpoints with the /prometheus HTTP path prefix /prometheus/*
.
We consider any deviation from this 100% API compatibility to be a bug, except for the following scenarios:
- Additional API endpoints for creating, removing, modifying alerts, and recording rules.
- Additional APIs that push metrics (under
/prometheus/api/push
). - Additional API endpoints for management of Grafana Mimir, such as the ring. These APIs are not included in any compatibility guarantees.
- Delete series API.
Experimental features
Grafana Mimir is an actively developed project and we encourage the introduction of new features and capabilities. Not everything in each release of Grafana Mimir is considered production-ready. We mark as “Experimental” all features and flags that we don’t consider production-ready.
We do not guarantee backwards compatibility for experimental features and flags. Experimental configuration and flags are subject to change.
The following features are currently experimental:
- Alertmanager
- Enable a set of experimental API endpoints to help support the migration of the Grafana Alertmanager to the Mimir Alertmanager.
-alertmanager.grafana-alertmanager-compatibility-enabled
- Enable support for any UTF-8 character as part of Alertmanager configuration/API matchers and labels.
-alertmanager.utf8-strict-mode-enabled
- Enable a set of experimental API endpoints to help support the migration of the Grafana Alertmanager to the Mimir Alertmanager.
- Compactor
- Enable cleanup of remaining files in the tenant bucket when there are no blocks remaining in the bucket index.
-compactor.no-blocks-file-cleanup-enabled
- In-memory cache for parsed meta.json files:
-compactor.in-memory-tenant-meta-cache-size
- Enable cleanup of remaining files in the tenant bucket when there are no blocks remaining in the bucket index.
- Ruler
- Aligning of evaluation timestamp on interval (
align_evaluation_time_on_interval
) - Allow defining limits on the maximum number of rules allowed in a rule group by namespace and the maximum number of rule groups by namespace. If set, this supersedes the
-ruler.max-rules-per-rule-group
and-ruler.max-rule-groups-per-tenant
limits. -ruler.max-rules-per-rule-group-by-namespace
-ruler.max-rule-groups-per-tenant-by-namespace
- Allow protecting rule groups from modification by namespace. Rule groups can always be read, and you can use the
X-Mimir-Ruler-Override-Namespace-Protection
header with namespace names as values to override protection from modification. -ruler.protected-namespaces
- Allow control over independent rules to be evaluated concurrently as long as they exceed a certain threshold on their rule group last duration runtime against their interval. We have both a limit on the number of rules that can be executed per ruler and per tenant:
-ruler.max-independent-rule-evaluation-concurrency
-ruler.max-independent-rule-evaluation-concurrency-per-tenant
-ruler.independent-rule-evaluation-concurrency-min-duration-percentage
-ruler.rule-evaluation-write-enabled
- Allow control over rule sync intervals.
ruler.outbound-sync-queue-poll-interval
ruler.inbound-sync-queue-poll-interval
- Cache rule group contents.
-ruler-storage.cache.rule-group-enabled
- Aligning of evaluation timestamp on interval (
- Distributor
- Metrics relabeling
-distributor.metric-relabeling-enabled
- Using status code 529 instead of 429 upon rate limit exhaustion.
-distributor.service-overload-status-code-on-rate-limit-enabled
- Limit exemplars per series per request
-distributor.max-exemplars-per-series-per-request
- Limit OTLP write request byte size
-distributor.max-otlp-request-size
- Enforce a maximum pool buffer size for write requests
-distributor.max-request-pool-buffer-size
- Enable conversion of OTel start timestamps to Prometheus zero samples to mark series start
-distributor.otel-created-timestamp-zero-ingestion-enabled
- Promote a certain set of OTel resource attributes to labels
-distributor.promote-otel-resource-attributes
- Metrics relabeling
- Hash ring
- Disabling ring heartbeat timeouts
-distributor.ring.heartbeat-timeout=0
-ingester.ring.heartbeat-timeout=0
-ruler.ring.heartbeat-timeout=0
-alertmanager.sharding-ring.heartbeat-timeout=0
-compactor.ring.heartbeat-timeout=0
-store-gateway.sharding-ring.heartbeat-timeout=0
-overrides-exporter.ring.heartbeat-timeout=0
- Disabling ring heartbeats
-distributor.ring.heartbeat-period=0
-ingester.ring.heartbeat-period=0
-ruler.ring.heartbeat-period=0
-alertmanager.sharding-ring.heartbeat-period=0
-compactor.ring.heartbeat-period=0
-store-gateway.sharding-ring.heartbeat-period=0
-overrides-exporter.ring.heartbeat-period=0
- Disabling ring heartbeat timeouts
- Ingester
- Add variance to chunks end time to spread writing across time (
-blocks-storage.tsdb.head-chunks-end-time-variance
) - Snapshotting of in-memory TSDB data on disk when shutting down (
-blocks-storage.tsdb.memory-snapshot-on-shutdown
) - Out-of-order samples ingestion (
-ingester.ooo-native-histograms-ingestion-enabled
) - Out-of-order native histogram samples ingestion (
-ingester.out-of-order-time-window
) - Shipper labeling out-of-order blocks before upload to cloud storage (
-ingester.out-of-order-blocks-external-label-enabled
) - Postings for matchers cache configuration:
-blocks-storage.tsdb.head-postings-for-matchers-cache-ttl
-blocks-storage.tsdb.head-postings-for-matchers-cache-size
(deprecated)-blocks-storage.tsdb.head-postings-for-matchers-cache-max-bytes
-blocks-storage.tsdb.head-postings-for-matchers-cache-force
-blocks-storage.tsdb.block-postings-for-matchers-cache-ttl
-blocks-storage.tsdb.block-postings-for-matchers-cache-size
(deprecated)-blocks-storage.tsdb.block-postings-for-matchers-cache-max-bytes
-blocks-storage.tsdb.block-postings-for-matchers-cache-force
- CPU/memory utilization based read request limiting:
-ingester.read-path-cpu-utilization-limit
-ingester.read-path-memory-utilization-limit"
- Early TSDB Head compaction to reduce in-memory series:
-blocks-storage.tsdb.early-head-compaction-min-in-memory-series
-blocks-storage.tsdb.early-head-compaction-min-estimated-series-reduction-percentage
- Timely head compaction (
-blocks-storage.tsdb.timely-head-compaction-enabled
) - Count owned series and use them to enforce series limits:
-ingester.track-ingester-owned-series
-ingester.use-ingester-owned-series-for-limits
-ingester.owned-series-update-interval
- Per-ingester circuit breaking based on requests timing out or hitting per-instance limits
-ingester.push-circuit-breaker.circuit-breaker.enabled
-ingester.push-circuit-breaker.failure-threshold-percentage
-ingester.push-circuit-breaker.failure-execution-threshold
-ingester.push-circuit-breaker.thresholding-period
-ingester.push-circuit-breaker.cooldown-period
-ingester.push-circuit-breaker.initial-delay
-ingester.push-circuit-breaker.request-timeout
-ingester.read-circuit-breaker.circuit-breaker.enabled
-ingester.read-circuit-breaker.failure-threshold-percentage
-ingester.read-circuit-breaker.failure-execution-threshold
-ingester.read-circuit-breaker.thresholding-period
-ingester.read-circuit-breaker.cooldown-period
-ingester.read-circuit-breaker.initial-delay
-ingester.read-circuit-breaker.request-timeout
- Add variance to chunks end time to spread writing across time (
- Querier
- Limiting queries based on the estimated number of chunks that will be used (
-querier.max-estimated-fetched-chunks-per-query-multiplier
) - Max concurrency for tenant federated queries (
-tenant-federation.max-concurrent
) - Maximum response size for active series queries (
-querier.active-series-results-max-size-bytes
) - Enable PromQL experimental functions (
-querier.promql-experimental-functions-enabled
) - Allow streaming of
/active_series
responses to the frontend (-querier.response-streaming-enabled
) - Mimir query engine (
-querier.query-engine=mimir
and-querier.enable-query-engine-fallback
, and all flags beginning with-querier.mimir-query-engine
) - Maximum estimated memory consumption per query limit (
-querier.max-estimated-memory-consumption-per-query
) - Ignore deletion marks while querying delay (
-blocks-storage.bucket-store.ignore-deletion-marks-while-querying-delay
)
- Limiting queries based on the estimated number of chunks that will be used (
- Query-frontend
-query-frontend.querier-forget-delay
- Instant query splitting (
-query-frontend.split-instant-queries-by-interval
) - Lower TTL for cache entries overlapping the out-of-order samples ingestion window (re-using
-ingester.out-of-order-window
from ingesters) - Query blocking on a per-tenant basis (configured with the limit
blocked_queries
) - Sharding of active series queries (
-query-frontend.shard-active-series-queries
) - Server-side write timeout for responses to active series requests (
-query-frontend.active-series-write-timeout
) - Caching of non-transient error responses (
-query-frontend.cache-errors
,-query-frontend.results-cache-ttl-for-errors
)
- Query-scheduler
-query-scheduler.querier-forget-delay
- Store-gateway
- Eagerly loading some blocks on startup even when lazy loading is enabled
-blocks-storage.bucket-store.index-header.eager-loading-startup-enabled
- Eagerly loading some blocks on startup even when lazy loading is enabled
- Read-write deployment mode
- API endpoints:
/api/v1/user_limits
/api/v1/cardinality/active_series
- Metric separation by an additionally configured group label
-validation.separate-metrics-group-label
-max-separate-metrics-groups-per-user
- Vault
- Fetching TLS secrets from Vault for various clients (
-vault.enabled
) - Vault client authentication token lifetime watcher. Ensures the client token is always valid by renewing the token lease or re-authenticating. Includes the metrics:
cortex_vault_token_lease_renewal_active
cortex_vault_token_lease_renewal_success_total
cortex_vault_auth_success_total
- Fetching TLS secrets from Vault for various clients (
- Logger
- Rate limited logger support
log.rate-limit-enabled
log.rate-limit-logs-per-second
log.rate-limit-logs-burst-size
- Rate limited logger support
- Memcached client
- Customise write and read buffer size
-<prefix>.memcached.write-buffer-size-bytes
-<prefix>.memcached.read-buffer-size-bytes
- Customise write and read buffer size
- Timeseries Unmarshal caching optimization in distributor (
-timeseries-unmarshal-caching-optimization-enabled
) - Reusing buffers for marshalling write requests in distributors (
-distributor.write-requests-buffer-pooling-enabled
) - Logging of requests that did not send any HTTP request:
-server.http-log-closed-connections-without-response-enabled
. - Ingester: track “owned series” and use owned series instead of in-memory series for tenant limits.
-ingester.use-ingester-owned-series-for-limits
-ingester.track-ingester-owned-series
-ingester.owned-series-update-interval
- Server
- PROXY protocol support
-server.proxy-protocol-enabled
- PROXY protocol support
- Kafka-based ingest storage
-ingest-storage.*
-ingester.partition-ring.*
Deprecated features
Deprecated features are usable up until the release that indicates their removal. For details about what deprecated means, see Parameter lifecycle.
The following features or configuration parameters are currently deprecated and will be removed in a future release (to be announced):
- Rule group configuration file
evaluation_delay
field: usequery_offset
instead
- Support for Redis-based caching