Important: This documentation is about an older version. It's relevant only to the release noted, many of the features and functions have been updated or replaced. Please view the current version.
Grafana Enterprise Metrics downloads
Releases
v2.6.1 – April 21st 2023
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.6.1(digest:- sha256:f53020d39b991143cd88b1ef0474ad40b589672eb04f0f379e4e579a388957f5)
- License: Grafana Labs license 
Changelog
- [ENHANCEMENT] Update all base images from alpine:3.17.1toalpine:3.17.3.
- [BUGFIX] Updated Go to version 1.19.8 to fix CVE-2023-24538.
Upstream Grafana Mimir details
- Version: 2.6.0
- Hash: 27698f399fc9e13c6fe0a8c79f882993814fda4a
- Changelog: CHANGELOG.md
v2.6.0 – February 22nd 2023
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.6.0(digest:- sha256:51c6c3f9decc4a4422b0ac34e88a278a6f8a992032747159220935c828f3d372)
- License: Grafana Labs license 
Changelog
- [CHANGE] Graphite querier: add experimental support for optional rate limitting at the subqueries level, using the .max-concurrent-sub-queries-per-requestflag.
- [CHANGE] Graphite querier: the storage aggregation method set in storage-aggregation.confcan no longer be overridden during runtime usingconsolidateBywhen metrictank is used as a render engine for remote queries.
- [CHANGE] Graphite querier: /tags/autoComplete/valuesnow takes input time range into account. Previously only tag values in the last hour were returned.
- [CHANGE] Graphite querier: The to/untilparameter for/findand/tagsendpoints are now respected by the querier, though ifto/untilis greater than the current time, it’s adjusted to the current time. Previously this value was always overwritten by the current time.
- [FEATURE] Graphite querier: Add support for /metrics/expandendpoint.
- [ENHANCEMENT] Refactor caching logic in the versioned bucket client to reduce the number of requests to object storage.
- [ENHANCEMENT] Update all base images from alpine:3.16.2toalpine:3.17.1.
- [BUGFIX] Graphite querier: flush metric name cache for remote queries.
- [BUGFIX] Graphite querier: fix panic when running certain combinations of functions.
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance.
Upstream Grafana Mimir details
- Version: 2.6.0
- Hash: 27698f399fc9e13c6fe0a8c79f882993814fda4a
- Changelog: CHANGELOG.md
v2.5.2 – February 17th 2023
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.5.2(digest:- sha256:760a027454e44cf3c817359eb7c27482b8355d7254dea15b4714c8316a43c011)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance
Upstream Grafana Mimir details
- Version: 2.5.0
- Hash: 25533fdfcf5d8e26ee8f49bcaf13e30bd678d4b3
- Changelog: CHANGELOG.md
v2.5.1 – January 6th 2023
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.5.1(digest:- sha256:da2a349151c1fe42c7e952aef1835ce33ab88fbd21f201e0aa059aa8d8e8a4bb)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Fix empty buildinfo in GEM binary
Upstream Grafana Mimir details
- Version: 2.5.0
- Hash: 25533fdfcf5d8e26ee8f49bcaf13e30bd678d4b3
- Changelog: CHANGELOG.md
v2.5.0 – December 15th 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.5.0(digest:- sha256:ffe4873a520c981a0c3ea2ef0531846cad38cf2a96dc62d6a3432bb0a609c95e)
- License: Grafana Labs license 
Changelog
- [CHANGE] Flag -*.azure.msi-resource is now ignored, and will be removed in Mimir 2.7. This setting is now made automatically by Azure.
- [CHANGE] Graphite querier: the storage aggregation method set in storage-aggregation.conf can no longer be overridden during runtime using consolidateBy when metrictank is used as a render engine. This matches Graphite’s behavior.
- [CHANGE] Graphite querier: caches default TTL is now lowered to 10 minutes. This is done to keep consistency in the event that out-of-order ingestion is enabled in mimir so that graphite queries answer with latest available data instead of caching responses for days.
- [ENHANCEMENT] Added .tls-min-version and .tls-cipher-suites flags to configure cipher suites and min TLS version supported by servers. 
- [ENHANCEMENT] All: Add clustername label to cpu usage metrics (cortex_quota_cpu_count, cortex_quota_gomaxprocs, cortex_quota_cgroup_cpu_max, cortex_quota_cgroup_cpu_period). The value is the cluster name in the GEM license.
- [ENHANCEMENT] Add recording rules to fulfill requirements for all Mimir mixin dashboards. The following recording rules have been added to GEM Self Monitoring to better align with the Mimir mixin: target:cortex_ingester_queried_exemplars:99quantile target:cortex_ingester_queried_exemplars:50quantile target:cortex_ingester_queried_exemplars:avg target:cortex_ingester_queried_exemplars_bucket:sum_rate target:cortex_ingester_queried_exemplars_sum:sum_rate target:cortex_ingester_queried_exemplars_count:sum_rate target_instance:cortex_alertmanager_alerts:sum target_instance:cortex_alertmanager_silences:sum target:cortex_alertmanager_state_replication_total:rate5m target:cortex_alertmanager_state_replication_failed_total:rate5m cortex_alertmanager_alerts_invalid_total:rate5m target:cortex_alertmanager_alerts_received_total:rate5m target:cortex_alertmanager_partial_state_merges_total:rate5m target:cortex_alertmanager_partial_state_merges_failed_total:rate5m target_integration:cortex_alertmanager_notifications_total:rate5m target_integration:cortex_alertmanager_notifications_failed_total:rate5m
- [ENHANCEMENT] Optimise the latest version lookup mechanism for versioned bucket client to reduce count of requests to object storage. Now it uses binary search instead of using sequential search.
- [BUGFIX] Fixed a bug in the Graphite querier where render requests that failed to be processed by the native engine were not being proxied to Graphite web.
Upstream Grafana Mimir details
- Version: 2.5.0
- Hash: 25533fdfcf5d8e26ee8f49bcaf13e30bd678d4b3
- Changelog: CHANGELOG.md
v2.4.1 – February 17th 2023
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.4.1(digest:- sha256:86bc899a450e1f052e2dd4d7fd55435711068cfefa6d5da541a1fa59feaa69e6)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance
Upstream Grafana Mimir details
- Version: 2.4.0
- Hash: 32137ee2c4c41fa649abfb9582e1f33a9e13363b
- Changelog: CHANGELOG.md
v2.4.0 – November 14th 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.4.0(digest:- sha256:1f56acfb6c9ddbb5d6e961401ba55963ae51752889a1b5536b840837df8f44be)
- License: Grafana Labs license 
Changelog
- [CHANGE] CarbonAPI is now being used instead of MetricTank as the default native query engine for the Graphite querier.
- [CHANGE] Enterprise metrics docker image no longer requests the CAP_NET_BIND_SERVICE capability as the default HTTP port was changed from 80 to 8080.- If you set -server.http-listen-portor-server.grpc-listen-portto a value lower than 1024, then you need to modify your configuration- When using Docker provide the flag - --cap-add net_bind_service.
- When using the - mimir-distributedHelm chart, make sure that all the GEM components have the following additional securityContext setting in their respective values file sections:- securityContext: sysctls: - name: net.ipv4.ip_unprivileged_port_start value: "0" # might be set to the lowest listen port number as well
 
 
- If you set 
- [FEATURE] Added a new flag -graphite.querier.cache-ttlto the Graphite querier to configure the TTL of cached metric names and aggregation configs.
- [FEATURE] Added optional rate limiting capabilities to the Graphite querier.- This can be configured using the following flags:- -graphite.querier.rate-limit-enabled
- -graphite.querier.rate-limit-qps
- -graphite.querier.tenant-rate-limit-qps
- -graphite.querier.heavy-rate-limit-qps
 
 
- This can be configured using the following flags:
- [ENHANCEMENT] Ruler: Add <prometheus-http-prefix>/api/v1/status/buildinfoendpoint.
- [ENHANCEMENT] Update all build images to use Go 1.19.2.
- [BUGFIX] Fix CVE-2022-44643
Upstream Grafana Mimir details
- Version: 2.4.0
- Hash: 32137ee2c4c41fa649abfb9582e1f33a9e13363b
- Changelog: CHANGELOG.md
v2.3.2 – February 17th 2023
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.3.2(digest:- sha256:dfba678a8b13647634dc9fa021ff1e8d9f23741d2e8590552c5ad22bedd59c81)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance
Upstream Grafana Mimir details
- Version: v2.3.1
- Hash: e18bcff2f8648c4edb8446bff0256a070717684c
- Changelog: CHANGELOG.md
v2.3.1 – November 14th 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.3.1(digest:- sha256:d697519012b4f8307ea3f39774235e99d0e5f9c498c7e93685551597179340b3)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Fix CVE-2022-44643
Upstream Grafana Mimir details
- Version: v2.3.1
- Hash: e18bcff2f8648c4edb8446bff0256a070717684c
- Changelog: CHANGELOG.md
v2.3.0 – September 28th 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.3.0(digest:- sha256:0cb46f23551037c8f9df40572d5a09876a04cb59536ff7a06eb558c2e1bf558e)
- License: Grafana Labs license 
Changelog
- [CHANGE] Gateway: Dial timeout now defaults to 5s instead of 30s.
- [CHANGE] Gateway: Dialing gRPC proxy backends during startup now blocks until the connection is established.
- [FEATURE] Gateway: Add support for TSDB block upload routes.
- [FEATURE] Admin client: commonconfig block introduced in Mimir now configures Admin Client in GEM too.
- [ENHANCEMENT] Gateway: the CLI flag -gateway.request.limithas been added for configuring request limiter middleware.
- [ENHANCEMENT] Update all build images to use Go 1.18.6.
- [ENHANCEMENT] Update all images to use Alpine 3.16.2.
- [ENHANCEMENT] Gateway: Dial timeout is now configurable via -gateway.proxy.*.dial-timeout.
- [BUGFIX] Gateway: Expose /distributor/ring endpoint on the distributors.
- [BUGFIX] LBAC: some query limits would not be applied for requests that use LBAC.
Upstream Grafana Mimir details
- Version: v2.3.1
- Hash: e18bcff2f8648c4edb8446bff0256a070717684c
- Changelog: CHANGELOG.md
v2.2.0 – July 21st 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.2.0(digest:- sha256:5165f84eeb399c1701757efc5a3f9219422bc43935bf995ea7e3d31417b2d6cb)
- License: Grafana Labs license 
Changelog
- [CHANGE] Ruler: /api/v1/rules*and/prometheus/rules*configuration endpoints are removed in favour of/prometheus/config/v1/rules*. Requests through the gateway are unaffected.
- [CHANGE] The remote subquerier for the Graphite query proxy is no longer optional- The following CLI flags (and their respective YAML config options) have been removed:- -graphite.querier.enable-remote-subquerier
- -graphite.querier.use-remote-results
 
 
- The following CLI flags (and their respective YAML config options) have been removed:
- [CHANGE] The YAML config options for the datadog.apihave been broken out intodatadog.read_apianddatadog.write_api
- [ENHANCEMENT] Admin-client: added experimental support for refreshing authentication cache entries before they expire. When enabled, a cache entry is refreshed and its time to live is extended if it is retrieved and has less than or equal to -auth.cache.refresh.refresh-ttltime left to live in the cache.- The following CLI flags (and their respective YAML config options) have been added:- -auth.cache.refresh.buffer
- -auth.cache.refresh.concurrency
- -auth.cache.refresh.enabled
- -auth.cache.refresh.refresh-ttl
- -auth.cache.refresh.retry-interval
 
 
- The following CLI flags (and their respective YAML config options) have been added:
- [ENHANCEMENT] Gateway: Rewrite requests to deleted ruler configuration endpoints to use supported endpoints.
- [BUGFIX] Docs: Make config category labels consistent across command-line help and generated documentation.
Upstream Grafana Mimir details
- Version: v2.2.0
- Hash: 65344e2ed2cf305b50de805824026f5c5a6fadcf
- Changelog: CHANGELOG.md
v2.1.0 – June 2nd 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.1.0(digest:- sha256:d02650b34c77cb5130b23790c958f658de8a5634f4d66f24dcb631ae7ba34b99)
- License: Grafana Labs license 
Changelog
- [FEATURE] Ruler: Added support for expression remote evaluation.- The following CLI flags (and their respective YAML config options) have been added:- -ruler.query-frontend.address
- -ruler.query-frontend.auth-token
- -ruler.query-frontend.tls-enabled
- -ruler.query-frontend.tls-ca-path
- -ruler.query-frontend.tls-cert-path
- -ruler.query-frontend.tls-key-path
- -ruler.query-frontend.tls-server-name
- -ruler.query-frontend.tls-insecure-skip-verify
 
 
- The following CLI flags (and their respective YAML config options) have been added:
- [ENHANCEMENT] Self-monitoring: Emit OOM kill and page fault metrics as part of self-monitoring.
- [BUGFIX] Ruler API: Ruler Limits are now enforced during rule group creation.
- [BUGFIX] Authentication: Expose internal errors during authentication only in logs, not to clients.
Upstream Grafana Mimir details
- Version: v2.1.0
- Hash: 3cff860d16e08d14e8aaa10649053a9c0f0f15a7
- Changelog: CHANGELOG.md
v2.0.1 – April 14th 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.0.1(digest:- sha256:30c80aa0612aed4e0bab24f9e5c817a112f0bbdfa7b51404a069474d706ceaee)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Authentication: Only include active tenants when resolving the wildcard tenant (*).
Upstream Grafana Mimir details
No changes since GEM v2.0.0:
- Version: v2.0.0
- Hash: 9fd2da5d3dc764fc00e4396a5c0ddd12ccebb00d
- Changelog: CHANGELOG.md
v2.0.0 – April 13th 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v2.0.0(ID:- sha256:43ed80839bd0cb1d799087d5591a8873cfaead182683055bbb8aa207efcf8a5f, Repo digest:- sha256:338bbcf64ea051cc3911908b977ae3b7bb8ed65342e7a2f8df3f781aa0f5e61a)
- License: Grafana Labs license 
Changelog
- [CHANGE] Admin-API: enable leader election by default
- [CHANGE] Change default value of instrumentation.enabledtotrue
- [CHANGE] Graphite Querier: The GRPC server is now registered to enable subquerier requests. This requires using the flag EnableRemoteSubquerier.
- [CHANGE] Graphite Querier: The remote read query is now the default behavior. Also, the previous implementation has been removed.
- [CHANGE] Admin-API: Change auth.typedefault fromtrusttoenterprise
- [CHANGE] Limits: The max_series_per_querylimit has been removed from the Admin API and runtime configuration and is no longer enforced by GEM during queries.
- [CHANGE] Graphite: The Graphite Querier and Graphite Write Proxy have been removed from single binary mode (the alltarget). They can still be run using thegraphite-querierandgraphite-write-proxytargets, respectively.
- [CHANGE] Query-frontend and Graphite Querier: migrated memcached backend client to the same one used in other components (memcached config and metrics are now consistent across all services).- The following CLI flags (and their respective YAML config options) have been added:- -graphite.querier.metric-name-cache.backend(set it to- memcached)
- -graphite.querier.aggregation-cache.backend(set it to- memcached)
 
- The following CLI flags (and their respective YAML config options) have been changed:- -graphite.querier.metric-name-cache.memcached.hostnameand- -graphite.querier.metric-name-cache.memcached.service: use- -graphite.querier.metric-name-cache.memcached.addressesinstead
- -graphite.querier.aggregation-cache.memcached.hostnameand- -graphite.querier.aggregation-cache.memcached.service: use- -graphite.querier.aggregation-cache.memcached.addressesinstead
 
- The following CLI flags (and their respective YAML config options) have been renamed:- -graphite.querier.metric-name-cache.background.write-back-concurrencyrenamed to- -graphite.querier.metric-name-cache.memcached.max-async-concurrency
- -graphite.querier.metric-name-cache.background.write-back-bufferrenamed to- -graphite.querier.metric-name-cache.memcached.max-async-buffer-size
- -graphite.querier.metric-name-cache.memcached.batchsizerenamed to- -graphite.querier.metric-name-cache.memcached.max-get-multi-batch-size
- -graphite.querier.metric-name-cache.memcached.parallelismrenamed to- -graphite.querier.metric-name-cache.memcached.max-get-multi-concurrency
- -graphite.querier.metric-name-cache.memcached.timeoutrenamed to- -graphite.querier.metric-name-cache.memcached.timeout
- -graphite.querier.metric-name-cache.memcached.max-item-sizerenamed to- -graphite.querier.metric-name-cache.memcached.max-item-size
- -graphite.querier.metric-name-cache.memcached.max-idle-connsrenamed to- -graphite.querier.metric-name-cache.memcached.max-idle-connections
- -graphite.querier.aggregation-cache.background.write-back-concurrencyrenamed to- -graphite.querier.aggregation-cache.memcached.max-async-concurrency
- -graphite.querier.aggregation-cache.background.write-back-bufferrenamed to- -graphite.querier.aggregation-cache.memcached.max-async-buffer-size
- -graphite.querier.aggregation-cache.memcached.batchsizerenamed to- -graphite.querier.aggregation-cache.memcached.max-get-multi-batch-size
- -graphite.querier.aggregation-cache.memcached.parallelismrenamed to- -graphite.querier.aggregation-cache.memcached.max-get-multi-concurrency
- -graphite.querier.aggregation-cache.memcached.timeoutrenamed to- -graphite.querier.aggregation-cache.memcached.timeout
- -graphite.querier.aggregation-cache.memcached.max-item-sizerenamed to- -graphite.querier.aggregation-cache.memcached.max-item-size
- -graphite.querier.aggregation-cache.memcached.max-idle-connsrenamed to- -graphite.querier.aggregation-cache.memcached.max-idle-connections
 
- The following CLI flags (and their respective YAML config options) have been removed:- -graphite.querier.aggregation-cache.default-validity: new setting is hardcoded to 7 days
- -graphite.querier.aggregation-cache.memcached.circuit-breaker-consecutive-failures: feature removed
- -graphite.querier.aggregation-cache.memcached.circuit-breaker-interval: feature removed
- -graphite.querier.aggregation-cache.memcached.circuit-breaker-timeout: feature removed
- -graphite.querier.aggregation-cache.memcached.consistent-hash: new setting is always enabled
- -graphite.querier.aggregation-cache.memcached.update-interval: new setting is hardcoded to 30s
- -graphite.querier.metric-name-cache.default-validityand- -frontend.memcached.expiration: new setting is hardcoded to 7 days
- -graphite.querier.metric-name-cache.memcached.circuit-breaker-consecutive-failures: feature removed
- -graphite.querier.metric-name-cache.memcached.circuit-breaker-interval: feature removed
- -graphite.querier.metric-name-cache.memcached.circuit-breaker-timeout: feature removed
- -graphite.querier.metric-name-cache.memcached.consistent-hash: new setting is always enabled
- -graphite.querier.metric-name-cache.memcached.update-interval: new setting is hardcoded to 30s
 
- The following metrics have been changed:- cortex_cache_dropped_background_writes_total{name}changed to- thanos_memcached_operation_skipped_total{name, operation, reason}
- cortex_cache_value_size_bytes{name, method}changed to- thanos_memcached_operation_data_size_bytes{name}
- cortex_cache_request_duration_seconds{name, method, status_code}changed to- thanos_memcached_operation_duration_seconds{name, operation}
- cortex_cache_fetched_keys{name}changed to- thanos_cache_memcached_requests_total{name}
- cortex_cache_hits{name}changed to- thanos_cache_memcached_hits_total{name}
- cortex_memcache_request_duration_seconds{name, method, status_code}changed to- thanos_memcached_operation_duration_seconds{name, operation}
- cortex_memcache_client_servers{name}changed to- thanos_memcached_dns_provider_results{name, addr}
- cortex_memcache_client_set_skip_total{name}changed to- thanos_memcached_operation_skipped_total{name, operation, reason}
- cortex_dns_lookups_totalchanged to- thanos_memcached_dns_lookups_total
- For all metrics the value of the “name” label has changed from frontend.memcachedtofrontend-cache.
- Above mentioned metrics are now also available with name=metric-nameand name=aggregationsfor caches used by Graphite Querier.
 
- The following metrics have been removed:- cortex_cache_background_queue_length{name}
 
 
- The following CLI flags (and their respective YAML config options) have been added:
- [CHANGE] Compactor: -compactor.compaction-strategyoption removed. The only compactor that can be now used is “split and merge” compactor.
- [CHANGE] Graphite: Enabled distributed subqueries by default and renamed remote_writeYAML flags.- -graphite.querier.use-remote-resultsand- -graphite.querier.enable-remote-subqueriernow default to- true. This means by default subqueries will be distributed across queriers.
- remote_writeYAML flags have been renamed:- keepalivehas been renamed to- keep_alive
- maxidleconnshas been renamed to- max_idle_conns
- maxconnshas been renamed to- max_conns
- skiplabelvalidationhas been renamed to- skip_label_validation.
 
 
- [FEATURE] Admin-API Deletion Markers:- Update statusfield in tenants
- Add statusfield to access policies and tokens
- Add new Admin API v3 endpoints with soft-deletion of entities- /admin/api/v3/accesspolicies
- /admin/api/v3/clusters
- /admin/api/v3/features
- /admin/api/v3/licenses
- /admin/api/v3/tenants
- /admin/api/v3/tokens
 
- List endpoints only return entities in active status
- Update HTTP authentication layer to only authorize requests of active entities
- Update storage cache logic to only store the object’s latest version
- Add v3 endpoints to gateway routes
 
- Update 
- [FEATURE] Graphite Write Proxy: Added -graphite.remote-write-proxy.enabled,-graphite.remote-write-proxy.write-endpointand-graphite.write-proxy.skip-label-validationto enhance the internal series write performance of the graphite writer. It’s recommended to enable this flag on every installation as soon as possible because it will become a default configuration in future releases.
- [FEATURE] Ruler: Added federated rule groups support.- Exposed cortex_ruler_sync_unauthorized_groupsmetric to track the number of skipped rule groups during storage synchronizations.
 
- Exposed 
- [FEATURE] Divide configuration parameters into categories “basic”, “advanced”, and “experimental”. Only flags in the basic category are shown when invoking -help, whereas-help-allwill include flags in all categories (basic, advanced, experimental).
- [FEATURE] Datadog: Added experimental support for ingesting and querying Datadog metrics by adding a Datadog translation layer on top of GEM.
- [FEATURE] Gateway: Forward requests to deprecated and removed endpoints in Mimir 2.0 (grafana/mimir#763) to their non-legacy equivalents.
- [ENHANCEMENT] Update all build images to use Go 1.17.8.
- [ENHANCEMENT] Admin-API: Allow the max_global_exemplars_per_userlimit to be set via the Admin API.
- [ENHANCEMENT] Admin-API: Enable compactor_blocks_retention_periodto be set on a per-tenant basis via the Admin API.
- [ENHANCEMENT] Querier: Apply Label Based Access Policy (LBAC) rules to exemplar endpoints.
- [ENHANCEMENT] Federation frontend: Add bearer_tokenconfiguration for proxy targets.
- [ENHANCEMENT] Self-monitoring: Add support for emitting exemplars as part of self-monitoring metrics.
- [ENHANCEMENT] Federation frontend: Return richer error when downstream data source is failed.
- [BUGFIX] Graphite: no need to configure Mimir’s queryable when starting only -target=graphite-querier.
- [BUGFIX] Graphite: When configured with enterprise authentication, requests sent to cortex remote read api now forward authorization headers if present.
- [BUGFIX] LBAC: Filter label values using LBAC policies correctly.
- [BUGFIX] Authentication: HTTP 500 errors are now returned for transient errors while attempting to authenticate user requests.
- [BUGFIX] Authentication: Do not cache transient errors while attempting to authenticate user requests.
- [BUGFIX] Config: Enterprise configuration extensions now appear in the /configendpoint
- [BUGFIX] Admin: Validate the access policy name used for token generation.
- [BUGFIX] Admin: Fixed a cosmetic issue that could report an incorrect license expiration timestamp in the metric grafana_labs_license_expiry_timestampif multiple valid licenses exist in local storage and object storage.
- [BUGFIX] Gateway: All Alertmanager endpoints are correctly proxied to the alertmanager backend proxy.
Previously, only the /alertmanagerendpoint was proxied. Users were able to authenticate but not access the alerts UI page at/alertmanager/#/alerts.
Upstream Grafana Mimir details
- Version: v2.0.0
- Hash: 9fd2da5d3dc764fc00e4396a5c0ddd12ccebb00d
- Changelog: CHANGELOG.md
v1.7.1 – November 14th 2022
Links
- Binary (Linux AMD64) 
- Docker image: run - docker pull grafana/enterprise-metrics:v1.7.1(digest:- sha256:84576bd0bab9beb98f6c93e6b9d91dc4efc3e5434747c43ca2bd84863219c8c6)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Fix CVE-2022-44643
v1.7.0 – January 6th 2022
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.7.0(digest:- sha256:286ce03b3dcd50c7924ee6860d58b2bd7986c9548cc6fe6207d23b0212883c33)
- License: Grafana Labs license 
Changelog
- [FEATURE] Admin-API: Added support for Azure Storage
- [ENHANCEMENT] Federation Frontend: Propagate requests’ bearer token when it is present.
- [ENHANCEMENT] Federation Frontend: Support TLS configuration for targets.
v1.6.2 – January 6th 2022
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.6.2(digest:- sha256:48fef5ef7a339d766274a37448e1c3745fde53ec0e2f4eab1a8a093a786d41d2)
- License: Grafana Labs license 
Changelog
- [BUGFIX] GEM update from v1.5.0 (or older) to v.1.6+ will not invalidate tenant limits set via API anymore.
v1.6.1 – November 18th 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.6.1(digest:- sha256:66f9eb4cee53df7b95860b1d094cae1dca88e1724de3695fec0449f92fe1db90)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Admin-API: Make sure that read-path limits inherit defaults from global limits.
v1.6.0 – November 15th 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.6.0(digest:- sha256:1e01fe4d792b53b9a4d37c38a612c2027582d6d7248f567ed31e2ed6102c035d)
- License: Grafana Labs license 
Changelog
- [CHANGE] Admin-client: Rename the “default” auth method to “trust”.
- [CHANGE] License: Deprecated flag -bootstrap.license.pathhas been removed. The new flag to use for specifying a license is-license.path.
- [CHANGE] Ruler: endpoints for listing rules (/api/v1/rules,/api/v1/rules/{namespace}) now return HTTP status code 200 and an empty map when there are no rules instead of an HTTP 404 and plain text error message.
- [CHANGE] Query-frontend: added shardedlabel tocortex_query_seconds_totalmetric.
- [CHANGE] Query-frontend: changed the flag name for controlling query sharding total shards from -querier.total-shardsto-frontend.query-sharding-total-shards.
- [CHANGE] Flag -querier.parallelise-shardable-querieshas been renamed to-query-frontend.parallelize-shardable-queries
- [CHANGE] Querier/ruler: Option -querier.ingester-streaminghas been removed. Querier/ruler now always use streaming method to query ingesters.
- [CHANGE] Limits: Option -ingester.max-samples-per-queryis now deprecated. YAML fieldmax_samples_per_queryis no longer supported. It required-querier.ingester-streamingoption to be set to false, but since-querier.ingester-streamingis removed (always defaulting to true), the limit using it was removed as well.
- [CHANGE] Limits: Set the default max number of inflight ingester push requests (-ingester.instance-limits.max-inflight-push-requests) to 30000 in order to prevent clusters from being overwhelmed by request volume or temporary slow-downs.
- [CHANGE] Update Go version to 1.16.9.
- [CHANGE] Admin-API: Require that tenant updates include the statusfield.
- [FEATURE] Querier: Added label names cardinality endpoint <prefix>/api/v1/cardinality/label_namesthat is disabled by default. Can be enabled/disabled via the CLI flag-querier.cardinality-analysis-enabledor its respective YAML config option. Configurable on a per-tenant basis.
- [FEATURE] Querier: Added label values cardinality endpoint <prefix>/api/v1/cardinality/label_valuesthat is disabled by default. Can be enabled/disabled via the CLI flag-querier.cardinality-analysis-enabledor its respective YAML config option. Configurable on a per-tenant basis.
- [FEATURE] Compactor: added support for a new compaction strategy -compactor.compaction-strategy=split-and-merge. When thesplit-and-mergecompactor is used, source blocks for a given tenant are grouped into-compactor.split-groupsnumber of groups. Each group of blocks is then compacted separately, and is split into-compactor.split-and-merge-shardsshards (configurable on a per-tenant basis). Compaction of each tenant shards can be horizontally scaled. Number of compactors that work on jobs for single tenant can be limited by using-compactor.compactor-tenant-shard-sizeparameter, or per-tenantcompactor_tenant_shard_sizeoverride.
- [FEATURE] Query Frontend: Updated experimental querysharding for the blocks storage. You can now enabled querysharding for blocks storage (-store.engine=blocks) by setting-query-frontend.parallelize-shardable-queriestotrue. The following additional config and exported metrics have been added.- New config options:- -frontend.query-sharding-total-shards: The amount of shards to use when doing parallelisation via query sharding.
- -frontend.query-sharding-max-sharded-queries: The max number of sharded queries that can be run for a given received query. 0 to disable limit.
- -blocks-storage.bucket-store.series-hash-cache-max-size-bytes: Max size - in bytes - of the in-memory series hash cache in the store-gateway.
- -blocks-storage.tsdb.series-hash-cache-max-size-bytes: Max size - in bytes - of the in-memory series hash cache in the ingester.
 
- New exported metrics:- cortex_bucket_store_series_hash_cache_requests_total
- cortex_bucket_store_series_hash_cache_hits_total
- cortex_frontend_query_sharding_rewrites_succeeded_total
- cortex_frontend_sharded_queries_per_query
 
- Renamed metrics:- cortex_frontend_mapped_asts_totalto- cortex_frontend_query_sharding_rewrites_attempted_total
 
- Modified metrics:- added shardedlabel tocortex_query_seconds_total
 
- added 
- When query sharding is enabled, the following querier config must be set on query-frontend too:- -querier.max-concurrent
- -querier.timeout
- -querier.max-samples
- -querier.at-modifier-enabled
- -querier.default-evaluation-interval
- -querier.active-query-tracker-dir
- -querier.lookback-delta
 
- Sharding can be dynamically controlled per request using the Sharding-Control: 64header. (0 to disable)
- Sharding can be dynamically controlled per tenant using the limit query_sharding_total_shards. (0 to disable)
- Added sharded_queriescount to the “query stats” log.
- Number of shards is adjusted to be compatible with number of compactor shards used by split-and-merge compactor. Querier can use this to avoid querying blocks that cannot have series in given query shard. This only works when using split-and-merge compactor.
 
- New config options:
- [FEATURE] Graphite: Added -graphite.querier.remote-read-enabledand-graphite.querier.query-addressto enhance the internal query performance of the graphite querier. It’s recommended to enable this flag on every installation as soon as possible because it will become a default configuration in future releases.
- [FEATURE] Ingester: Enable snapshotting of in-memory TSDB on disk during shutdown via -blocks-storage.tsdb.memory-snapshot-on-shutdown.
- [FEATURE] Query-Frontend: Added -query-frontend.cache-unaligned-requestsoption to cache responses for requests that do not have step-aligned start and end times. This can improve speed of repeated queries, but can also pollute cache with results that are never reused.
- [ENHANCEMENT] Admin-client: Make the cluster_name configuration optional.
- [ENHANCEMENT] Admin-API: Add new Admin API v2 endpoints that replace the term ‘instance’ used in version v1 with the term ’tenant’- /admin/api/v2/accesspolicies
- /admin/api/v2/clusters
- /admin/api/v2/features
- /admin/api/v2/licenses
- /admin/api/v2/tenants
- /admin/api/v2/tokens
 
- [ENHANCEMENT] LBAC: Optimize filtering when using single selector in LBAC policy by passing matchers to downstream querier.
- [ENHANCEMENT] Distributor: reduce latency when HA-Tracking by doing KVStore updates in the background.
- [ENHANCEMENT] Compactor: when sharding is enabled, skip already planned compaction jobs if the tenant doesn’t belong to the compactor instance anymore.
- [ENHANCEMENT] Compactor: Blocks cleaner will ignore users that it no longer “owns” when sharding is enabled, and user ownership has changed since last scan.
- [ENHANCEMENT] Query federation: improve performance in MergeQueryable by memoizing labels.
- [ENHANCEMENT] Querier / store-gateway: optimized regex matchers.
- [ENHANCEMENT] Query-frontend: added cortex_query_frontend_non_step_aligned_queries_totalto track the total number of range queries with start/end not aligned to step.
- [ENHANCEMENT] Compactor: added -compactor.compaction-jobs-ordersupport to configure which compaction jobs should run first for a given tenant (in case there are multiple ones). Supported values are:smallest-range-oldest-blocks-first(default),newest-blocks-first(not supported bydefaultcompaction strategy).
- [ENHANCEMENT] Add option (-querier.label-values-max-cardinality-label-names-per-request) to configure the maximum number of label names allowed to be queried in a single<prefix>/api/v1/cardinality/label_valuesAPI call.
- [ENHANCEMENT] Make distributor inflight push requests count include background calls to ingester.
- [ENHANCEMENT] Store-gateway: added an in-memory LRU cache for chunks attributes. Can be enabled setting -blocks-storage.bucket-store.chunks-cache.attributes-in-memory-max-items=XwhereXis the max number of items to keep in the in-memory cache. The following new metrics are exposed:- cortex_cache_memory_requests_total
- cortex_cache_memory_hits_total
- cortex_cache_memory_items_count
 
- [ENHANCEMENT] Store-gateway: log index cache requests to tracing spans.
- [ENHANCEMENT] Ingester: reduce CPU and memory utilization if remote write requests contains a large amount of “out of bounds” samples.
- [ENHANCEMENT] Ingester: reduce CPU and memory utilization when querying chunks from ingesters.
- [ENHANCEMENT] Querier: when fetching data for specific query-shard, we can ignore some blocks based on compactor-shard ID, since sharding of series by query sharding and compactor is the same. Added metrics:- cortex_querier_blocks_found_total
- cortex_querier_blocks_queried_total
- cortex_querier_blocks_with_compactor_shard_but_incompatible_query_shard_total
 
- [ENHANCEMENT] Querier&Ruler: reduce cpu usage, latency and peak memory consumption.
- [ENHANCEMENT] Overrides Exporter: Add max_fetched_chunks_per_querylimit to the default and per-tenant limits exported as metrics.
- [BUGFIX] License: Fixed initialization of AWS subscription manager so it creates a cluster object if not present when running GEM as AWS Marketplace product.
- [BUGFIX] Admin-API: Change the way per-instance limits are stored to avoid breaking changes between versions.
- [BUGFIX] Self-monitoring: Ensure system rules adhere to the sharding configuration of the rulers.
- [BUGFIX] Graphite: fixed invalid labelerror when querying metrics with dashes in the tags.
- [BUGFIX] Authentication: Fix caching behavior to ensure tokens are eventually removed from the cache.
- [BUGFIX] Authentication: Enforce that instances must exist even when using wildcard access policies.
- [BUGFIX] Admin-API: Expose metrics cortex_admin_api_clientsandcortex_admin_client_is_leaderfor leader election correctly.
- [BUGFIX] Limits: Fix the way cortex_limits_admin_store_last_update_timestamp_secondsis set to emit a correct UNIX timestamp.
- [BUGFIX] Alertmanager: don’t replace user configurations with blank fallback configurations (when enabled), particularly during scaling up/down instances when sharding is enabled.
- [BUGFIX] Query-frontend: Ensure query_range requests handled by the query-frontend return JSON formatted errors.
- [BUGFIX] Query-frontend: don’t reuse cached results for queries that are not step-aligned.
- [BUGFIX] Querier: fixed UserStats endpoint. When zone-aware replication is enabled, MaxUnavailableZonesparam is used instead ofMaxErrors, so settingMaxErrors = 0doesn’t make the Querier wait for all Ingesters responses.
v1.5.1 – September 21st 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.5.1(digest:- sha256:079ed9d61a7ab0953afbfa76de8ab2d38d44ac17e630446bab4084b4aba0c2e4)
- License: Grafana Labs license 
Changelog
- [ENHANCEMENT] Add ADFS compatibility to our OIDC auth.
- [BUGFIX] Ruler: Use predictable names for Ruler WALs ensuring they are used after crashes and cleaned up.
v1.5.0 – August 24th 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.5.0(digest:- sha256:b0d98ffe49df461a524743a49dca26952a59c9c007231035e52f0a06e5003fff)
- License: Grafana Labs license 
Changelog
- [CHANGE] Alertmanager: allowed to configure the experimental receivers firewall on a per-tenant basis. The following CLI flags (and their respective YAML config options) have been changed and moved to the limits config section:- -alertmanager.receivers-firewall.block.cidr-networksrenamed to- -alertmanager.receivers-firewall-block-cidr-networks
- -alertmanager.receivers-firewall.block.private-addressesrenamed to- -alertmanager.receivers-firewall-block-private-addresses
 
- [CHANGE] Memberlist: Expose default configuration values to the command line options. Note that setting these explicitly to zero will no longer cause the default to be used. If the default is desired, then do set the option. The following are affected:- -memberlist.stream-timeout
- -memberlist.retransmit-factor
- -memberlist.pull-push-interval
- -memberlist.gossip-interval
- -memberlist.gossip-nodes
- -memberlist.gossip-to-dead-nodes-time
- -memberlist.dead-node-reclaim-time
 
- [CHANGE] Authentication: Access Policy names passed via a JWT token in the OIDC auth flow will be downcased before being matched against Access Policies in GEM. This improves interoperability between GEM and other systems since GEM only allows lowercase characters in Access Policy names
- [CHANGE] Change default value of -server.grpc.keepalive.min-time-between-pingsfrom5mto10sand-server.grpc.keepalive.ping-without-stream-allowedtotrue.
- [CHANGE] Changed -alertmanager.storage.typedefault value fromconfigdbtolocal.
- [CHANGE] Changed -ruler.storage.typedefault value fromconfigdbtolocal.
- [CHANGE] Cortex chunks storage has been deprecated and it’s now in maintenance mode: all Cortex users are encouraged to migrate to the blocks storage. No new features will be added to the chunks storage. The default Cortex configuration still runs the chunks engine; please check out the blocks storage doc on how to configure Cortex to run with the blocks storage.
- [CHANGE] Dependency: update go-redis from v8.2.3 to v8.9.0.
- [CHANGE] Deprecated the bootstraptarget in favor of thetokengentarget.
- [CHANGE] Enable strict JSON unmarshal for pkg/util/validation.Limitsstruct. The customUnmarshalJSON()will now fail if the input has unknown fields.
- [CHANGE] Graphite: proxy no longer generates generic metrics metadata. This helps to reduce ingestion rate as counted by Cortex and used for limits.
- [CHANGE] Ingester: Change default value of -ingester.active-series-metrics-enabledtotrue. This incurs a small increase in memory usage, between 1.2% and 1.6% as measured on ingesters with 1.3M active series.
- [CHANGE] License: Flag -bootstrap.license.pathhas been deprecated in favor of-license.path.
- [CHANGE] Memberlist: the memberlist_kv_store_value_byteshas been removed due to values no longer being stored in-memory as encoded bytes.
- [CHANGE] Querier / ruler: Change -querier.max-fetched-chunks-per-queryconfiguration to limit to maximum number of chunks that can be fetched in a single query. The number of chunks fetched by ingesters AND long-term storare combined should not exceed the value configured on-querier.max-fetched-chunks-per-query.
- [CHANGE] Querier / ruler: deprecated -store.query-chunk-limitCLI flag (and its respective YAML config optionmax_chunks_per_query) in favour of-querier.max-fetched-chunks-per-query(and its respective YAML config optionmax_fetched_chunks_per_query). The new limit specifies the maximum number of chunks that can be fetched in a single query from ingesters and long-term storage: the total number of actual fetched chunks could be 2x the limit, being independently applied when querying ingesters and long-term storage.
- [CHANGE] Query-frontend: Enable query stats by default, they can still be disabled with -frontend.query-stats-enabled=false.
- [CHANGE] Removed configdbsupport from Ruler and Alertmanager backend storages.
- [CHANGE] Removed log_messages_totalmetric.
- [CHANGE] Removed query sharding for the chunks storage. Query sharding is now only supported for blocks storage.
- [CHANGE] Renamed metric deprecated_flags_inuse_totalasdeprecated_flags_used_total.
- [CHANGE] Renamed metric experimental_features_in_use_totalasexperimental_features_used_total.
- [CHANGE] Some files and directories on local disk now have stricter permissions, and are only readable by owner, but not group or others.
- [CHANGE] The example Kubernetes manifests (stored at k8s/) have been removed due to a lack of proper support and maintenance.
- [CHANGE] Update Go version to 1.16.6.
- [FEATURE] Added flag -debug.block-profile-rateto enable goroutine blocking events profiling.
- [FEATURE] Alertmanager: Added -alertmanager.max-config-size-byteslimit to control size of configuration files that Cortex users can upload to Alertmanager via API. This limit is configurable per-tenant.
- [FEATURE] Alertmanager: Added -alertmanager.max-templates-countand-alertmanager.max-template-size-bytesoptions to control number and size of templates uploaded to Alertmanager via API. These limits are configurable per-tenant.
- [FEATURE] Alertmanager: Added rate-limits to notifiers. Rate limits used by all integrations can be configured using -alertmanager.notification-rate-limit, while per-integration rate limits can be specified via-alertmanager.notification-rate-limit-per-integrationparameter. Both shared and per-integration limits can be overwritten using overrides mechanism. These limits are applied on individual (per-tenant) alertmanagers. Rate-limited notifications are failed notifications. It is possible to monitor rate-limited notifications via newcortex_alertmanager_notification_rate_limited_totalmetric.
- [FEATURE] Alertmanager: support negative matchers, time-based muting - upstream release notes.
- [FEATURE] Allow for reporting CPU time usage to AWS Marketplace metering service in case GEM is running as AWS Marketplace container product.
- [FEATURE] Collect and store CPU time usage reports in Admin store, which can later be used to submit to metering services, such as the AWS Marketplace API
- [FEATURE] Querier/Ruler: Added new -querier.max-fetched-chunk-bytes-per-queryflag. When Cortex is running with blocks storage, the max chunk bytes limit is enforced in the querier and ruler and limits the size of all aggregated chunks returned from ingesters and storage as bytes for a query.
- [FEATURE] Querier: Added new -querier.max-fetched-series-per-queryflag. When Cortex is running with blocks storage, the max series per query limit is enforced in the querier and applies to unique series received from ingesters and store-gateway (long-term storage).
- [FEATURE] Query Frontend: Add cortex_query_fetched_chunks_totalper-user counter to expose the number of chunks fetched as part of queries. This metric can be enabled with the-frontend.query-stats-enabledflag (or its respective YAML config optionquery_stats_enabled).
- [FEATURE] Query Frontend: Add cortex_query_fetched_series_totalandcortex_query_fetched_chunks_bytes_totalper-user counters to expose the number of series and bytes fetched as part of queries. These metrics can be enabled with the-frontend.query-stats-enabledflag (or its respective YAML config optionquery_stats_enabled).
- [FEATURE] Query Frontend: Add experimental querysharding for the block storage. You can now enabled querysharding for block storage (-store.engine) by setting-querier.parallelise-shardable-queriestotrue.
- [FEATURE] Ruler Storage: S3 header extensions were added to the new ruler storage S3 config block.
- [FEATURE] Ruler: Add new -ruler.query-stats-enabledwhich when enabled will report thecortex_ruler_query_seconds_totalas a per-user metric that tracks the sum of the wall time of executing queries in the ruler in seconds.
- [FEATURE] When running GEM as AWS Marketplace container product then the Go runtime variable GOMAXPROCSis automatically set to match the container CPU quota, in case Kubernetes CPU resource limits are set.
- [FEATURE] Alertmanager: The experimental sharding feature is now considered complete. Detailed information about the configuration options can be found here for alertmanager and here for the alertmanager storage. To use the feature:- Ensure that a remote storage backend is configured for Alertmanager to store state using -alertmanager-storage.backend, and flags related to the backend. Note that thelocalandconfigdbstorage backends are not supported.
- Ensure that a ring store is configured using -alertmanager.sharding-ring.store, and set the flags relevant to the chosen store type.
- Enable the feature using -alertmanager.sharding-enabled.
- Note the prior addition of a new configuration option -alertmanager.persist-interval. This sets the interval between persisting the current alertmanager state (notification log and silences) to object storage. See the configuration file reference for more information.
 
- Ensure that a remote storage backend is configured for Alertmanager to store state using 
- [ENHANCEMENT] Add Cassandra support.
- [ENHANCEMENT] Add timeout for waiting on compactor to become ACTIVE in the ring.
- [ENHANCEMENT] Added tenant_idstag to tracing spans
- [ENHANCEMENT] Added option -distributor.excluded-zonesto exclude ingesters running in specific zones both on write and read path.
- [ENHANCEMENT] Added zone-awareness support to alertmanager for use when sharding is enabled. When zone-awareness is enabled, alerts will be replicated across availability zones.
- [ENHANCEMENT] Admin-API: Add a new endpoint for returning product and feature information at /admin/api/v1/features
- [ENHANCEMENT] Admin-API: Allow admin-api to operate for read-only request when no license is present.
- [ENHANCEMENT] Alertmanager: Added -alertmanager.max-alerts-countand-alertmanager.max-alerts-size-bytesto control max number of alerts and total size of alerts that a single user can have in Alertmanager’s memory. Adding more alerts will fail with a log message and incrementingcortex_alertmanager_alerts_insert_limited_totalmetric (per-user). These limits can be overrided by using per-tenant overrides. Current values are tracked incortex_alertmanager_alerts_limiter_current_alertsandcortex_alertmanager_alerts_limiter_current_alerts_size_bytesmetrics.
- [ENHANCEMENT] Alertmanager: Added -alertmanager.max-dispatcher-aggregation-groupsoption to control max number of active dispatcher groups in Alertmanager (per tenant, also overrideable). When the limit is reached, Dispatcher produces log message and increasescortex_alertmanager_dispatcher_aggregation_group_limit_reached_totalmetric.
- [ENHANCEMENT] Alertmanager: Cleanup persisted state objects from remote storage when a tenant configuration is deleted.
- [ENHANCEMENT] Authentiation: OIDC integration now supports a JWT with multiple roles. When present, these roles will be rolled up into a “virtual” access policy that provides metrics read access to the union of instances contained in those roles.
- [ENHANCEMENT] Blocks storage: support ingesting exemplars and querying of exemplars. Enabled by setting new CLI flag -blocks-storage.tsdb.max-exemplars=<n>or config optionblocks_storage.tsdb.max_exemplarsto positive value.
- [ENHANCEMENT] Distributor: Added distributors ring status section in the admin page.
- [ENHANCEMENT] Etcd: Added username and password to etcd config.
- [ENHANCEMENT] Expose CPU quota information (number of cores, cgroup quota) as Prometheus metrics.
- [ENHANCEMENT] Expose error counters and timestamps of CPU usage reporting as Prometheus metrics when AWS Marketplace meterting is enabled.
- [ENHANCEMENT] Expose value of GOMAXPROCS as Prometheus metrics.
- [ENHANCEMENT] Facilitate running GEM Docker image as a non-root user. Usage is documented in the Kubernetes deployment documentation.
- [ENHANCEMENT] Ingester: Added option -ingester.ignore-series-limit-for-metric-nameswith comma-separated list of metric names that will be ignored in max series per metric limit.
- [ENHANCEMENT] Ingester: added option -ingester.readiness-check-ring-healthto disable the ring health check in the readiness endpoint.
- [ENHANCEMENT] License: Added flag -license.typethat is used to specify that the APP is running through AWS Marketplace.
- [ENHANCEMENT] License: Implemented /licensesendpoint that responds with static list of licenses that replaces default implementation if the APP is running through AWS Marketplace.
- [ENHANCEMENT] License: Implemented logic to check if AWS Marketplace subscription is active instead of checking license file if the APP is running through AWS Marketplace.
- [ENHANCEMENT] Memberlist: expose configuration of memberlist packet compression via -memberlist.compression=enabled.
- [ENHANCEMENT] Memberlist: optimized receive path for processing ring state updates, to help reduce CPU utilization in large clusters.
- [ENHANCEMENT] Node-API: Added TSDB block metadata to the exportable debug archive.
- [ENHANCEMENT] Node-API: Register a new endpoint for fetching a compressed debug file containing config and version information at /node/api/v1/debug-export.
- [ENHANCEMENT] Node-API: Register a new endpoint for fetching version information about the nodes at /node/api/v1/version.
- [ENHANCEMENT] Querier now can use the LabelNamescall with matchers, if matchers are provided in the/labelsAPI call, instead of using the more expensiveMetricsForLabelMatcherscall as before. This can be enabled by enabling the-querier.query-label-names-with-matchers-enabledflag once the ingesters are updated to this version. In the future this is expected to become the default behavior.
- [ENHANCEMENT] Reduce memory used by streaming queries, particularly in ruler.
- [ENHANCEMENT] Ring, query-frontend: Avoid using automatic private IPs (APIPA) when discovering IP address from the interface during the registration of the instance in the ring, or by query-frontend when used with query-scheduler. APIPA still used as last resort with logging indicating usage.
- [ENHANCEMENT] Ruler: added rule_grouplabel to metricscortex_prometheus_rule_group_iterations_totalandcortex_prometheus_rule_group_iterations_missed_total.
- [ENHANCEMENT] Scanner: add support for DynamoDB (v9 schema only).
- [ENHANCEMENT] Scanner: retry failed uploads.
- [ENHANCEMENT] Storage: Added the ability to disable Open Census within GCS client (e.g -gcs.enable-opencensus=false).
- [ENHANCEMENT] Store-gateway: added -store-gateway.sharding-ring.wait-stability-min-durationand-store-gateway.sharding-ring.wait-stability-max-durationsupport to store-gateway, to wait for ring stability at startup.
- [ENHANCEMENT] Wildcard Datasource: Wildcard “*” datasources are now supported in datasource urls for GEM. This allows an action to have access to all instances in all access policies associated with the provided token. If that set of instances includes a wildcard “*”, then access is expanded to all instances in the cluster.
- [ENHANCEMENT] Added instrumentation to Redis client, with the following metrics:- cortex_rediscache_request_duration_seconds
 
- [ENHANCEMENT] Include additional limits in the per-tenant override exporter. The following limits have been added to the cortex_overridesmetric:- max_fetched_series_per_query
- max_fetched_chunk_bytes_per_query
- ruler_max_rules_per_rule_group
- ruler_max_rule_groups_per_tenant
 
- [ENHANCEMENT] License Manager: Added functionality to regularly check the local license file and sync it to the license storage backend.- Added metrics grafana_labs_license_syncs_totalandgrafana_labs_license_sync_failures_total.
 
- Added metrics 
- [ENHANCEMENT] Ring: allow experimental configuration of disabling of heartbeat timeouts by setting the relevant configuration value to zero. Applies to the following:- -distributor.ring.heartbeat-timeout
- -ring.heartbeat-timeout
- -ruler.ring.heartbeat-timeout
- -alertmanager.sharding-ring.heartbeat-timeout
- -compactor.ring.heartbeat-timeout
- -store-gateway.sharding-ring.heartbeat-timeout
 
- [ENHANCEMENT] Ring: allow heartbeats to be explicitly disabled by setting the interval to zero. This is considered experimental. This applies to the following configuration options:- -distributor.ring.heartbeat-period
- -ingester.heartbeat-period
- -ruler.ring.heartbeat-period
- -alertmanager.sharding-ring.heartbeat-period
- -compactor.ring.heartbeat-period
- -store-gateway.sharding-ring.heartbeat-period
 
- [ENHANCEMENT] Alertmanager: introduced new metrics to monitor operation when using -alertmanager.sharding-enabled:- cortex_alertmanager_state_fetch_replica_state_total
- cortex_alertmanager_state_fetch_replica_state_failed_total
- cortex_alertmanager_state_initial_sync_total
- cortex_alertmanager_state_initial_sync_completed_total
- cortex_alertmanager_state_initial_sync_duration_seconds
- cortex_alertmanager_state_persist_total
- cortex_alertmanager_state_persist_failed_total
 
- [ENHANCEMENT] Memberlist: introduced new metrics to aid troubleshooting tombstone convergence:- memberlist_client_kv_store_value_tombstones
- memberlist_client_kv_store_value_tombstones_removed_total
- memberlist_client_messages_to_broadcast_dropped_total
 
- [ENHANCEMENT] Ruler: added new metrics for tracking total number of queries and push requests sent to ingester, as well as failed queries and push requests. Failures are only counted for internal errors, but not user-errors like limits or invalid query. This is in contrast to existing cortex_prometheus_rule_evaluation_failures_total, which is incremented also when query or samples appending fails due to user-errors.- cortex_ruler_write_requests_total
- cortex_ruler_write_requests_failed_total
- cortex_ruler_queries_total
- cortex_ruler_queries_failed_total
 
- [BUGFIX] Graphite: Fix handling of consolidateBy and make aggregation method part of aggregation cache key.
- [BUGFIX] Alertmanager: fix Alertmanager status page if clustering via gossip is disabled or sharding is enabled.
- [BUGFIX] Authentication: fix handling of missing instances, or when instance has no matching access policy, by properly returning a 401 instead of crashing.
- [BUGFIX] Compactor: fixed panic while collecting Prometheus metrics.
- [BUGFIX] Graphite: Apply the max-points-per-req-hard limit correctly.
- [BUGFIX] Graphite: Fix race in index.json API endpoint which lead to incomplete results.
- [BUGFIX] HA Tracker: when cleaning up obsolete elected replicas from KV store, tracker didn’t update number of cluster per user correctly.
- [BUGFIX] Ingester: fix issue where runtime limits erroneously override default limits.
- [BUGFIX] Ingester: fixed infrequent panic caused by a race condition between TSDB mmap-ed head chunks truncation and queries.
- [BUGFIX] Ingester: fixed ingester stuck on start up (LEAVING ring state) when -ingester.heartbeat-period=0and-ingester.unregister-on-shutdown=false.
- [BUGFIX] Invalidate cached authentication tokens when they are deleted from object storage.
- [BUGFIX] Make multiple Get requests instead of MGet on Redis Cluster.
- [BUGFIX] Memberlist: fix to setting the default configuration value for -memberlist.retransmit-factorwhen not provided. This should improve propagation delay of the ring state (including, but not limited to, tombstones). Note that if the configuration is already explicitly given, this fix has no effect.
- [BUGFIX] Purger: fix Invalid null value in condition for column rangecaused bynilvalue in range for WriteBatch query.
- [BUGFIX] Querier: Fix issue where samples in a chunk might get skipped by batch iterator.
- [BUGFIX] Querier: fix queries failing with “at least 1 healthy replica required, could only find 0” error right after scaling up store-gateways until they’re ACTIVE in the ring.
- [BUGFIX] Query-frontend: Fix 401s during query_rangerequests when enterprise authentication is used. The workaround involving disabling enterprise authentication on the querier can now be removed.
- [BUGFIX] Ruler: Fix bug in rule forwarding with remote write which could cause filling up the disk because it was not truncated.- New flags called -ruler.remote-write.wal-truncate-frequency,-ruler.remote-write.min-wal-timeand-ruler.remote-write.max-wal-timehave been added.
 
- New flags called 
- [BUGFIX] Ruler: Honor the evaluation delay for the ALERTSandALERTS_FOR_STATEseries.
- [BUGFIX] Ruler: fix /ruler/rule_groupsendpoint doesn’t work when used with object store.
- [BUGFIX] Ruler: fix startup in single-binary mode when the new ruler_storageis used.
- [BUGFIX] Ruler: fixed counting of PromQL evaluation errors as user-errors when updating cortex_ruler_queries_failed_total.
- [BUGFIX] Store-gateway: when blocks sharding is enabled, do not load all blocks in each store-gateway in case of a cold startup, but load only blocks owned by the store-gateway replica.
- [BUGFIX] Upgrade Prometheus. TSDB now waits for pending readers before truncating Head block, fixing the chunk not founderror and preventing wrong query results.
v1.4.2 – July 21st 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.4.2(digest:- sha256:385b563669a5ba4a459f833a2c356884b757de719e43369ead0c5dc59cb11d94)
- License: Grafana Labs license 
Changelog
- [SECURITY] Prevent path traversal attack from users able to control the HTTP header X-Scope-OrgID. (CVE-2021-36157)- Users only have control of the HTTP header when GEM is configured with
flags -auth.type=defaultand-tenant-federation.enabled=false
 
- Users only have control of the HTTP header when GEM is configured with
flags 
- [SECURITY] Update build image to use Go 1.16.6. (CVE-2021-34558) #1874
- [BUGFIX] Ruler: Register remote write metrics correctly. #1814
Upstream Cortex details
- Cortex Hash: 2210ebb7052a9efb99d0e4dc53043a3f5d806d00
v1.4.1 – June 29th 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.4.1(digest:- sha256:d1d17bfe2ec984b093b9da1ab8cdea1f764f24f16b38557d719254c4e64c9f9a)
- License: Grafana Labs license 
Changelog
- [BUGFIX] Update the GEM build image to use Alpine 3.14, python 3.9 and gsutil 4.52.
Upstream Cortex details
- Cortex Hash: 98dd0c4d69576fdfaf2b9bfd7aa475e835e11429
v1.4.0 – June 28th 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.4.0(digest:- sha256:ff38e0544d805bfd1450a1f033ed79585252a4444d247e0e4c649625619215ab)
- License: Grafana Labs license 
Changelog
- [CHANGE] Breaking: Verify token issuer when using OIDC authentication. Includes a breaking change for users of OIDC authentication. #1571- Before this change the configuration of OIDC authentication required the OIDC provider’s jwks_urito be set in the configuration flagauth.admin.oidc.url. This flag has been deprecated.
- A new flag named auth.admin.oidc.issuer-urlhas been added, and it must be set to the URL of the OIDC provider. For example:-auth.admin.oidc.issuer-url=https://accounts.google.comNote: This is not simply a rename of the old flag; you also need to update the value. The defined issuer is required to provide the OIDC discovery endpoint (/.well-known/openid-configuration)
 
- Before this change the configuration of OIDC authentication required the OIDC provider’s 
- [CHANGE] Breaking: The GEM/GEL Ruler can now be accessed by access policies with rules read/write permissions, which are no longer metrics/logs specific #1366 & #1403- Before this change, there were metric rule specific permissions metrics:rules:readandmetrics:rules:write.
- The data representation for this change in object storage is backwards compatible, so no change is needed for existing access policies using the new rules.
- The JSON representation for these rules is not backwards compatible, and so any JSON interactions with the API that specified the strings
metrics:rules:readormetrics:rules:writemust be updated to the stringsrules:readandrules:writerespectively.
- This breaking change applies to the GEM Plugin as well, so please update to version v3.0.X.
 
- Before this change, there were metric rule specific permissions 
- [CHANGE] Remove enterprise_featuresconfig block entirely. #1453
- [CHANGE] Alertmanager: deprecated -alertmanager.storage.*CLI flags (and their respective YAML config options) in favour of-alertmanager-storage.*. This change doesn’t apply toalertmanager.storage.pathandalertmanager.storage.retention.
- [CHANGE] Blocks storage: removed the config option -blocks-storage.bucket-store.index-cache.postings-compression-enabled, which was deprecated. Postings compression is always enabled.
- [CHANGE] GEM now fails fast on startup if it is unable to connect to the ring backend.
- [CHANGE] Querier / ruler: deprecated -store.query-chunk-limitCLI flag (and its respective YAML config optionmax_chunks_per_query) in favor of-querier.max-fetched-chunks-per-query(and its respective YAML configuration optionmax_fetched_chunks_per_query). The new limit specifies the maximum number of chunks that can be fetched in a single query from ingesters and long-term storage: the total number of chunks that are actually fetched, in the worst case, can be twice the limit because the limit is applied to ingesters as well as long-term storage.
- [CHANGE] Query frontend: removed the configuration option -querier.compress-http-responses, which was deprecated. Instead, use-api.response-compression-enabled.
- [CHANGE] Runtime-config / overrides: removed the config options -limits.per-user-override-config(use-runtime-config.file) and-limits.per-user-override-period(use-runtime-config.reload-period), both deprecated.
- [FEATURE] Add embedded recording rules to the Enterprise Ruler to support building dashboards and
alerts from internal metrics written directly to GEM itself via a distributor. #1459- To enable or disable the feature, use the -instrumentation.enabledflag or associatedenabledsetting on theinstrumentationconfiguration block. The feature is disabled by default.
 
- To enable or disable the feature, use the 
- [FEATURE] Add the ability to write internal metrics directly to GEM itself via a distributor. #1281- To configure, or enabled or disabled the feature, user the -instrumentation.enabledflag and associated other flags or theinstrumentationconfiguration block:The feature is disabled by default.instrumentation: enabled: false flush_period: 15s write_timeout: 10s distributor_client: address: dns:///:9095 connect_timeout: 5s tls_enabled: false tls_cert_path: tls_key_path: tls_ca_path: tls_server_name: tls_insecure_skip_verify:
 
- To configure, or enabled or disabled the feature, user the 
- [FEATURE] Self-monitoring: expose filesystem usage metrics to source the disk utilization panel in the self-monitoring resource dashboards #1618
- [FEATURE] Add an experimental GEM component federation-frontend, which can be used to federate queries between multiple GEM clusters. #1274
- [FEATURE] Querier: Added new -querier.max-fetched-series-per-queryflag. When GEM is running with blocks storage, the max series per query limit is enforced in the querier and applies to unique series received from ingesters and store-gateway (long-term storage).
- [FEATURE] Querier/Ruler: Added new -querier.max-fetched-chunk-bytes-per-queryflag. When GEM is running with blocks storage, the max chunk bytes limit is enforced in the querier and ruler and limits the size of all aggregated chunks returned from ingesters and storage as bytes for a query.
- [ENHANCEMENT] Introduce configuration parameter to limit how many points we process per query. #1292
- [ENHANCEMENT] Adding API endpoints via which a user can post / get their storage schemas / aggregations. #1389
- [ENHANCEMENT] Admin-API: Listing mutable resources now includes a comma separated list of versions for those resources in the ETagheader #1419
- [ENHANCEMENT] Admin-API: Updating a mutable resources now allows a wildcard value ("*") to be passed as theIf-Matchheader, which allows the updating of any current version #1449
- [ENHANCEMENT] The /configHTTP endpoint now also returns GEM specific options alongside regular Cortex configuration. #1380
- [BUGFIX] Fix LBAC regular expression matchers #1305
- [BUGFIX] Validate all fields of JWT tokens used for auth, except the issuer. #1500
- [BUGFIX] Ruler: ensure the S3 rule storage flags properly maps to the upstream flags. #1460
- [BUGFIX] Admin-API: rejecting update requests when access policies have empty scopes or realms. #1447
- [BUGFIX] Updated licenses are now persisted to object storage, fixing the responses from the license API which would show old license information. #1568
- [BUGFIX] Validate all fields of JWT tokens used for auth, except the issuer. #1500
- [BUGFIX] OAuth: Don’t use default access policy when an invalid JWT claim is provided. #1635
- [BUGFIX] Authentiation: Invalidate cached authentication tokens when they are deleted from object storage. #1703
Upstream Cortex details
- Cortex Hash: 98dd0c4d69576fdfaf2b9bfd7aa475e835e11429
- Cortex Commits
v1.3.1 – Jul 21st 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.3.1(digest:- sha256:e03a7ae061d5f617490812a6f45c6362fdc9ef79010555a207ebee2174ef9b23)
- License: Grafana Labs license 
Changelog
- [SECURITY] Prevent path traversal attack from users able to control the HTTP header X-Scope-OrgID. (CVE-2021-36157)- Users only have control of the HTTP header when GEM is configured with
flags -auth.type=defaultand-tenant-federation.enabled=false
 
- Users only have control of the HTTP header when GEM is configured with
flags 
- [SECURITY] Update build image to use Go 1.16.6. (CVE-2021-34558) #1874
- [BUGFIX] Update the GEM build image to use Alpine 3.14, python 3.9 and gsutil 4.52. #1781
- [BUGFIX] Ruler: Register remote write metrics correctly. #1814
Upstream Cortex details
- Cortex Hash: 64592254fe91c86e903882947a58d572a316884d
v1.3.0 – April 26th 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.3.0
- License: Grafana Labs license 
Changelog
- [SECURITY] Alertmanager: Fix a local file disclosure vulnerability when -experimental.alertmanager.enable-apiis used (CVE-2021-31231):- The HTTP Basic auth password_file can be used as an attack vector to send any file content via a webhook.
- The Alertmanager templates can be used as an attack vector to send any file content because the Alertmanager can load any text file specified in the templates list.
 
- [CHANGE] Admin API: Concurrent requests to the same resource are no longer allowed. If two requests are issued to create, update, or delete the same resource, then the first one to achieve a lock executes and the second one returns a conflict error. This is handled per process. To enforce this behavior on multiple processes, use leader election. #1186
- [CHANGE] Admin API: all errors encountered during the processing of HTTP requests are converted to GRPC errors in order to determine the correct HTTP status to return. This enforces consistency for leader election, because some requests are handled internally, and others are forwarded to other instances. #1217
- [CHANGE] Admin API: all mutation operations (PUT/DELETE) now require anIf-Matchheader to be set (an integer between""such as"27") to verify that the correct version of the resource is being modified and prevent against race conditions. You can find the current version of a resource in theETagheader that is returned when that resource is read (viaGET) or updated (viaPUT).
- [FEATURE] Admin API: you can set per-instance resource limits via the Admin API. This is enabled by default. #1173- You can enable or disable this feature by using the -admin-api.limits.enabledor-admin-api.limits.refresh-periodflags. Also, you can configure this feature by using theadmin_apiconfiguration block:admin_api: limits: enabled: true refresh_period: 1m
 
- You can enable or disable this feature by using the 
- [ENHANCEMENT] Upgrade build image to use Go 1.16.3. #1294
- [ENHANCEMENT] Admin client: Add cortex_admin_client_is_leadergauge metric to determine when the client considers itself the leader. #1175
- [ENHANCEMENT] Admin API: update an access policy via the Admin API using a PUTrequest. #1139
- [ENHANCEMENT] Admin API: Update an instance via the Admin API using a PUTrequest. #1180
- [ENHANCEMENT] Gateway: Forward /multitenant_alertmanager/ringand/ruler/ringroutes to thealertmanagerandrulerproxy backends. #1144
- [BUGFIX] Graphite: Fix aggregation cache to generate cache keys using correct input data. #963
- [BUGFIX] Authentication: Fix issue where all requests would trigger a panic if authentication is enabled but no admin client is configured. A error is now printed instead. #1106
Upstream Cortex details
- Cortex Hash: 2d8477c4a325ce5071676e906efcee4adb687513
- Cortex Commits
v1.2.1 – April 27 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.2.1
- License: Grafana Labs license 
Changelog
- [SECURITY] Alertmanager: Fix a local file disclosure vulnerability when -experimental.alertmanager.enable-apiis used (CVE-2021-31231):- The HTTP Basic auth password_file can be used as an attack vector to send any file content via a webhook.
- The Alertmanager templates can be used as an attack vector to send any file content because the Alertmanager can load any text file specified in the templates list.
 
v1.2.0 – March 10 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.2.0
- License: Grafana Labs license 
Changelog
- [CHANGE] Gateway: Remove purger proxy configuration, which is not a supported target for blocks clusters.
- [CHANGE] Auth: Override authentication flags have been renamed:- The auth.override-admin-tokenflag has been changed toauth.override.token.
- The auth.override-admin-token-fileflag has been changed toauth.override.token-file.
 
- The 
- [FEATURE] Gateway: Improve the gatewaytarget to support unique TLS configurations and write timeouts for each backend.- New fields have been added to allow for configuration:gateway: proxy: default: tls: tls_cert_path: <string> tls_key_path: <string> tls_ca_path: <string> tls_insecure_skip_verify: <bool> distributor: read_timeout: <duration> write_timeout: <duration> tls: ...
 
- New fields have been added to allow for configuration:
- [FEATURE] Compactor: Introduced time-shardingcompaction strategy.
- [ENHANCEMENT] Distributor: Wrap remote writes in distributor to sample and log them as business intelligence events.
- [ENHANCEMENT] Metrics emitted for TLS certificate expiration now reflect certificates being reloaded.
- [ENHANCEMENT] Remove the Graphite Auto Complete Index and use Cortex index instead.
- [ENHANCEMENT] Add Graphite API endpoint /metrics/index.json.
- [ENHANCEMENT] Distributor: Wrap remote writes in distributor to sample and log them as business intelligence events.
- [ENHANCEMENT] Call Cortex Distributor over gRPC from Graphite Write Proxy (formerly Graphite Distributor)
- [ENHANCEMENT] Admin API: Add feature to elect and admin-api leader instance to handle all mutation requests. Requests to non-leader instances are forwarded to the leader instance.- New fields have been added to allow for configuration:
 admin_api: leader_election: enabled: <bool> ring: kvstore: <kv.Config> heartbeat_period: <duration> heartbeat_timeout: <duration> tokens_observe_period: <duration> instance_interface_name: <[]string> client_config: <grpcclient.Config>
- [BUGFIX] LBAC: Fix issue where debug logs would not print the selector and instead print selector="unsupported value type".
- [BUGFIX] Admin-Client: Warning logs are no longer created on resource creation.
- [BUGFIX] Ruler: Fix issue where invalid remote-write URLs cause a panic.
- [BUGFIX] Querier: Apply label access filters on multi tenant access policies.
Upstream Cortex details
- Cortex Hash: 003eb33266ca464d7290a938a9d767c36b9a03a4
- Cortex CHANGELOG
v1.1.3 – April 27 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- Docker image: run - docker pull grafana/metrics-enterprise:v1.1.3
- License: Grafana Labs license 
Changelog
- [SECURITY] Alertmanager: Fix a local file disclosure vulnerability when -experimental.alertmanager.enable-apiis used (CVE-2021-31231):- The HTTP Basic auth password_file can be used as an attack vector to send any file content via a webhook.
- The Alertmanager templates can be used as an attack vector to send any file content because the Alertmanager can load any text file specified in the templates list.
 
v1.1.2 – January 20 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- License: Grafana Labs license 
Changelog
- [BUGFIX] Querier: fix default value incorrectly overriding -querier.frontend-addressin single-binary mode.
v1.1.1 – January 14 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- License: Grafana Labs license 
Changelog
- [BUGFIX] Ruler: Minimize gaps on rule evaluations with stale input and enabled ruler evaluation delay.
v1.1.0 – January 12 2021
Links
- Binary (Linux AMD64) 
- Deb (Linux AMD64) 
- RPM (Linux AMD64) 
- License: Grafana Labs License 
Changelog
- [CHANGE] Admin-API: Resources must not be both prefixed and suffixed with the - __characters. If any of your existing resources exist with this naming pattern, they must be deleted and recreated with a new name before upgrading.
- [CHANGE] Graphite: Allow storage schema and storage aggregation configs to be defined per tenant. 
- [CHANGE] Admin-Client: Instance management client calls no longer use object storage - Itercalls when retrieving the latest version of a resource.
- [CHANGE] Graphite: Add API endpoints to explore the available Graphite functions. 
- [CHANGE] Admin: The selectors for label policies are now provided as PromQL label strings instead of typed objects. - Deprecated: - "label_policies": [ { "selector": [ { "name": "env", "value": "dev", "type": "EQ" } ] } ]
- New: - "label_policies": [ { "selector": "{env=\"dev\"}" } ]
 
- [CHANGE] Admin: Operations with an - ADMINscope are no longer restricted to operating on clusters they have as a configured realm.
- [CHANGE] Deprecate - enterprise_featuresconfig section in favor of the Cortex config extension.- Deprecated: - enterprise_features: ruler_s3_request_headers: file: <string> poll_interval: <duration> ruler_remote_write: enabled: <bool> wal_dir: <string>
- New: - ruler: storage: s3: header_map_file_path: <string> header_map_poll_interval: <duration> remote_write: enabled: <bool> wal_dir: <string>
 
- [FEATURE] Ruler: Alerts can now be correctly forwarded to the Alertmanager with enterprise authentication enabled by setting the basic authentication username to - __alertmanager__and the password to a API token with access to every instance.
- [FEATURE] Queries: LBAC enforcement has been added for queries and label value requests. - When GEM is run using the defaultauthentication mode, LBAC policies are specified using theX-Prom-Label-PolicyHTTP header in the format:X-Prom-Label-Policy: <tenant-id>:urlEscaped(<prometheus label selector>). For example, a policy that only allows metrics with the labelenvequal todevfor tenanttest-instancecould specified with the following header:X-Prom-Label-Policy: test-instance:%7Benv=%22dev%22%7D. To specify multiple policies either set the header multiple times or set the header with a single string of multiple policies separated by an unescaped comma.
 
- When GEM is run using the 
- [FEATURE] Admin API: add - label_policiesfield, which contains an array of label matchers to the access policy realm JSON.- { "realms": [ { "instance": "<string>", "cluster": "<string>", "label_policies": [ { "selector": [ { "type": "<enum: EQ | NEQ | RE | NRE>", "name": "<string>", "value": "<string>" } ] } ] } ] }
- [FEATURE] Admin: Add target - tokengento generate tokens for the default or a custom access policy.
- [FEATURE] Admin: Added a default - __admin__access policy that has an- ADMINscope. This policy can be disabled adding the following to the GEM configuration file.- admin_client: disable_default_admin_policy: true
- [FEATURE] Querier: Queries can be federated across multiple tenants. The tenants IDs involved need to be specified separated by a - |character in the- X-Scope-OrgIDrequest header.
- [FEATURE] Add - gatewaytarget that can be configured to proxy requests to microservices and can be used to load balance remote_write requests to the distributors.
- [ENHANCEMENT] AdminAPI: Add scope for read only admin access, - admin:read.
- [ENHANCEMENT] AdminAPI: Add separate set of scopes for alerts and rules. - alerts:read
- alerts:write
- logs:rules:read
- logs:rules:write
- metrics:rules:read
- metrics:rules:write
 
- [ENHANCEMENT] Reduce allocations in Graphite Ingester, when ingesting untagged Graphite metrics. 
- [ENHANCEMENT] Serve Graphite /metrics/find requests by keeping track of all recent metrics in an in-memory index on the Ingesters to reduce latency. 
- [ENHANCEMENT] Add auxiliary Graphite API endpoints to explore tags and obtain auto-complete suggestions for the Grafana query editor. 
- [ENHANCEMENT] Admin API: add ClusterKind support for Logs & Traces. 
- [ENHANCEMENT] Admin API: add scopes for Logs. 
- [ENHANCEMENT] Admin: The bootstrap target no longer needs to be run before being able to start GEM with enterprise features. Every target will now try to perform bootstrapping on startup if it has not already been done. Failure to bootstrap will not prevent GEM running, but enterprise features will not be available. 
- [ENHANCEMENT] Add - grafana_labs_license_expiry_timestampmetric to expose GEM license expiration as a UNIX timestamp, in seconds.
- [BUGFIX] Graphite: Fixing a bug in the request parsing of GET requests on the auto-complete endpoints. 
- [BUGFIX] Graphite: When ingesting datapoints resulting in out-of-order/out-of-bounds/duplicate-sample we need to return status 200 to prevent an indefinite loop. 
- [BUGFIX] Ruler: Fix issue where remote-write rule groups are created then immediately deleted when a rule group name contains the - /delimiter character.
Upstream Cortex changes
- Upstream Cortex hash: c3b8c46fd8fc9a2aa85accbe54cb00be2552dcd9
- Changes since last GEM release
v1.0.2 – October 16 2020
Links
Changelog
- [CHANGE] Update vendored Cortex from v1.4.0 to [v1.4.0-21bad5][21bad5]
- [BUGFIX] Fix potential panic due to writing into a closed chan in the graphite query executor.
- [ENHANCEMENT] Admin: Access policy create operations now enforce valid instance/cluster names for the realms configured on the access policy.
- [ENHANCEMENT] Add -versionflag to GEM.
- [FEATURE] Add config options to rate limit the LIST methods of buckets.
- [FEATURE] Adds the Graphite /render API endpoint, which can be used to query metrics with the Graphite query language.
- [FEATURE] Add config options to specify and poll files to inject arbitrary HTTP headers in requests to S3 for the admin and blocks client.blocks_storage: s3: header_map_file_path: <path to header file> header_map_poll_interval: <duration string> admin_client: storage: s3: header_map_file_path: <path to header file> header_map_poll_interval: <duration string>
- [FEATURE] Adds the Graphite /metrics/find API endpoint, which can be used to obtain lists of metrics matching a given pattern (Grafana query editor auto-complete, dashboard variable population, etc).
- [FEATURE] Add a default access policy option for OpenID Connect tokens.
Upstream Cortex details
- Cortex Hash: [21bad57b346c730d684d6d0205efef133422ab28][21bad5]
- Cortex CHANGELOG
v1.0.1 – October 06 2020
Links
Upstream Cortex details
- Cortex Hash: 23554ce028c090a4a3413ac0e35e5e1dc9fa929f
- Cortex Version: 1.4.0
Changelog
- [CHANGE] Update vendored Cortex to v1.4.0.
v1.0.0 – September 17 2020
Links
Upstream Cortex details
- Cortex Hash: bb5fcc929832f7bd2a6c2df348b387abcb8b961e
- Cortex Version: 1.4.0-rc.0
Changelog
- [BUGFIX] Make config field names consistent.
- [CHANGE] Use Go 1.14.9 to build the project and cut build-image@v0.1.3.
v1.0.0-rc.2 – September 15 2020
Links
Upstream Cortex details
- Cortex Hash: c3a344784a0c8ce70ef2521f543033dee3dce6c6
- Cortex Version: 1.3.1
Changelog
- [BUGFIX] Admin API: Fix panic on start up for admin-apitarget.
v1.0.0-rc.1 – September 04 2020
Links
Upstream Cortex details
- Cortex Hash: 4f6e1e5c48ccad2c1988cf1d36ca522ae0c805ed
- Cortex Version: 1.3.1
Changelog
- [CHANGE] Admin-Client: The storage backend for the admin client no longer defaults to s3. Instead no default is set and the admin client will not start up unless a default is set.
- [CHANGE] The following features will no longer be active unless GEM is started with access to a valid license.- Admin API
- Ruler S3 auth headers
- Ruler API to configure remote write rule groups
 
v0.6.3 – August 20 2020
Links
Upstream Cortex details
- Cortex Hash: 2bda7b94
- Cortex Version: 1.2.1
Changelog
- [CHANGE] Auth: removed auth.enableflag and addauth.typeflag withdefaultandenterpriseoptions.
- [FEATURE] Admin API: Add list endpoint for stored licenses.
v0.6.2 – August 04 2020
Links
Upstream Cortex details
- Cortex Hash: 6db67a4efbbf62b1133fa037a95382a21f752bbf
- Cortex Version: 1.2.1
Changelog
- [CHANGE] Ruler: S3 Headers are no longer protected by a license.







