Grafana Enterprise Metrics downloads
Releases
v2.13.1 – August 15th 2024
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.13.1
- Manifest digest:
sha256:bef3d3390d9b2e2a2208552f3ab98a23a7afa90d98fda8f950d1b23bae99575f
- linux/amd64 digest:
sha256:8b208b8ecefc7905608da445f128e2aa571f9e1eec7c1427ac637e41a5922be8
- linux/arm64 digest:
sha256:f1113c92e13987a7377aa8617a4b361914fde77d3f38e902d79ed6c9333d5445
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.13.1
- Manifest digest:
sha256:a3c4d70ccae452db70f6ad0e3a1d9aec29a29734761781de35afe57020100b92
- linux/amd64 digest:
sha256:7fb83ba3377afe63808ee146e27ef334dd1eaf71557d4bd4fb5873a03b42015c
- linux/arm64 digest:
sha256:484321e918b5af25159f863d492b88697986938841ed6e53850bf9fda68ba499
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [CHANGE] Cross-cluster query federation: log failed requests to remote targets.
- [CHANGE] Active series: the default value for active series idle timeout (
-ingester.active-series-metrics-idle-timeout
) increased from10m
to20m
. - [CHANGE] Active series: active series metrics must now be enabled and their update interval must be set to
20m
; otherwise, GEM fails to start. - [BUGFIX] Cross-cluster query federation: fix issue where time range for label names and label values requests was not propagated to remote targets.
Upstream Grafana Mimir details
- Version: 2.13.0
- Hash: 4775ec156855ba1eb498e564f09abad35053a44f
- Changelog: CHANGELOG.md
v2.13.0 – July 11th 2024
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.13.0
- Manifest digest:
sha256:3a7b42c70e1f9d67b0a3da8bd1540e172708929f98e73a60aa00f0ddd45db3a1
- linux/amd64 digest:
sha256:2dc0140c4adb1c06d7e050ad200ef156701bb3bb99c1b6f7496ce09ba8a52560
- linux/arm64 digest:
sha256:63f07bd37fff257a54111e6dde2e0acd565f52731201bda7cf848f8cf4a2c882
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.13.0
- Manifest digest:
sha256:b21e3489fad6e4334b3f7645b1c219873187c1d4de88fbfed67376d61c458bc0
- linux/amd64 digest:
sha256:0a81b72217b27b09672980ea3e9f8f87545f5eda273189897dbd36648652f041
- linux/arm64 digest:
sha256:9e07d6094c272426325685d7ae7a7047eec97429839283d271518bf6f7b8788a
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [CHANGE] Implemented enhanced usage statistics for more accurate reporting.
- [CHANGE] Build: grafana/enterprise-metrics docker image is now based on gcr.io/distroless/static-debian12 image. Alpine-based docker image is still available as grafana/enterprise-metrics-alpine, until 2.15.
- [CHANGE] Clamp GOMAXPROCS to runtime.NumCPU.
Upstream Grafana Mimir details
- Version: 2.13.0
- Hash: 4775ec156855ba1eb498e564f09abad35053a44f
- Changelog: CHANGELOG.md
v2.12.0 – April 5th 2024
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.12.0
- Manifest digest:
sha256:fb8514119242911882899145bb97339d2a3e281dbc75d9eac974d57d4244d5e4
- linux/amd64 digest:
sha256:3b11a70277cf79422f776da04c0cea365a604227375ed2dc693a1fdc705b74ce
- linux/arm64 digest:
sha256:4d977a53e7213d2854c2576d1820422891646f750101f7dbb0742b83ada9a525
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.12.0
- Manifest digest:
sha256:730142084b03f9f87d6ad7913c20038b170b26a42fe30d61ffec50660a223a1c
- linux/amd64 digest:
sha256:8d2dca5749eddfaaa1bf9a7b5eaf87e5fa12c26a0fc45e7a46bc237c0dc72238
- linux/arm64 digest:
sha256:ef7b1f111d66160f2c67ecf56f928b3efc307630f2b7299e9ac5f56a3bd3ff45
- Manifest digest:
- Standard distroless image:
docker pull grafana/enterprise-metrics-distroless:v2.12.0
- Manifest digest:
sha256:ae917c1032688f262f0a78e0592b06194e70865c0848c22bb17eadc27f1f93c4
- linux/amd64 digest:
sha256:15767ad1d23eb3f08d23a8b0bec4b0902f90fc05c43ab62e1e084b0dc8745eff
- linux/arm64 digest:
sha256:1159ff518547f0234cb746fdb676393bb3f2639de46db85ad7bb059a29785699
- Manifest digest:
- BoringCrypto distroless image:
docker pull grafana/enterprise-metrics-boringcrypto-distroless:v2.12.0
- Manifest digest:
sha256:dc336656922d99f5e5b7556768b55aa2c97254220b3f98da52e7ac5e3ce0bca4
- linux/amd64 digest:
sha256:5192a4d2e9db563450e36b5546ded7ec389df2336c9812ae5040141b336796b8
- linux/arm64 digest:
sha256:8193426a4373d956c4436ad249549d604f17a0931bb19ca4bcc6395b594cdd8d
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [CHANGE] Server: responses with HTTP 4xx status codes are now treated as errors and used in status_code label of request duration metric.
- [CHANGE] Disable SSL certificate expiration collector when using certificates from Vault.
- [ENHANCEMENT] Build GEM based on a distroless image.
- [ENHANCEMENT] Update base image to alpine:3.18.5
- [ENHANCEMENT] Add experimental flag
-graphite.querier.deduplicate-samples-in-raw-interval
to test a fix for duplicate points appearing in Graphite data. This flag will be removed when the correct behavior is determined. - [ENHANCEMENT] Add support for setting graphite querier rate limits to <= 0 to disable individual limits.
- [ENHANCEMENT] Set tracer max tag value length to 16e3 instead of the default 256.
- [BUGFIX] LBAC: caching for
/prometheus/api/v1/label/*
endpoints was not respecting LBAC policies leading to sharing cached responses for different LBAC policies within the same tenant.
Upstream Grafana Mimir details
- Version: 2.12.0
- Hash:
c7aab9e039d63397d2293114ad063b03626e247b
- Changelog: CHANGELOG.md
v2.11.2 – March 6th 2024
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.11.2
- Manifest digest:
sha256:40b0c6a82a98f0ff90c0183089dacbebef56413830e557f7c313ee2d3a6fc67c
- linux/amd64 digest:
sha256:b338df2b83b1b3d4061b2ecba54486359b7f01ed28097cd8b3b8284018c0b86e
- linux/arm64 digest:
sha256:ad021cf4d79cd519d62722f53200598444c3273724c38d9f842fa0374587c87d
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.11.2
- Manifest digest:
sha256:c58bdd0db618b4a1f54eb42f8b66767f5adf5b52cc70092ed1225f2bffc066e6
- linux/amd64 digest:
sha256:a101377e4a35c91cce0bd8cc050223bef608a75dd69b3803b4b777414d2e38fa
- linux/arm64 digest:
sha256:3d5857ea72d71acbfc6fcc00f6b4d40f14a4c21c0c3d328689b749fc450c4921
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Use Go 1.21.8.
- [BUGFIX] Update base image to
alpine:3.18.5
Upstream Grafana Mimir details
- Version: 2.11.0
- Hash:
c8939ea55f0818337e3428924f25dafc6ea256d1
- Changelog: CHANGELOG.md
v2.11.1 – January 9th 2024
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.11.1
- Manifest digest:
sha256:42a73166fbfa331e6124c55db642c1d0609a902be631d1a2b04814632b49c9f7
- linux/amd64 digest:
sha256:8a4264f4edf7703a16919c8e0e0ff51f62c026f15fa059e38beba693200361b9
- linux/arm64 digest:
sha256:96d8789cb22b33b650796554be8dd17548dea57ee526fdadf8386fc4381939b4
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.11.1
- Manifest digest:
sha256:e74f4e64af43d0c06e644feb47823b9aa27ba355a76b4c11489439327592afe8
- linux/amd64 digest:
sha256:bb7dcfd7ae176afb252ab90392b2e4b0ac9c0522ea0983b8042150ea226b24eb
- linux/arm64 digest:
sha256:ed6d25e7593c1106c20d617dbd8ac1c934581bc68dda6e7b7b5a10d98d8e05a2
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [CHANGE] Remove support for using AWS Marketplace licenses (
-license.type=aws-marketplace
). - [CHANGE] Network errors are now responded with 598 status code instead of 504 (originally 408).
- [ENHANCEMENT] Graphite querier: Add support for
maxDataPoints=-1
in/render
endpoint. - [ENHANCEMENT] Update base image to
alpine:3.18.4
- [ENHANCEMENT] Gateway audit logging: support decimal values for sampling rate. For example, setting
admin-api.auditlogging.sample-rate
to0.1
means sampling of 0.1%. - [ENHANCEMENT] Graphite querier: use input time range for querying tag values.
- [ENHANCEMENT] Graphite querier: added new
-graphite.querier.max-query-length
flag to limit the largest time range allowed for queries. - [ENHANCEMENT] Add
-auth.required-for-debug
option for disabling auth on/debug
endpoints. Auth is required by default. - [BUGFIX] Fix federation-frontend panic because of nil ExemplarQueryable
- [BUGFIX] Fix
execution: attempted to read series at index X from ingester chunks stream, but the stream has series with index Y
errors when multiple LBAC label selectors apply to a request and ingester to querier or store-gateway to querier chunks streaming is enabled.
Upstream Grafana Mimir details
- Version: 2.11.0
- Hash:
c8939ea55f0818337e3428924f25dafc6ea256d1
- Changelog: CHANGELOG.md
v2.10.6 – March 6th 2024
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.10.6
- Manifest digest:
sha256:9df8bbe69487e1975831f5c3f9a8479494d902b5cf9e4bbddf73217edb907567
- linux/amd64 digest:
sha256:1cf8c2db208104bfa7ae10c881c22ff40bcd88392bb874cb6baa841a9ee73c0b
- linux/arm64 digest:
sha256:1bc90da7fbf4b0dc998d8c0f2950b3bc5a713997ef6a83f18b7aee4fbfa91c55
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.10.6
- Manifest digest:
sha256:2d6658d1c9e300f138b7cfa17ef790fae1cb1cae9c0ac538c16353a615314fb0
- linux/amd64 digest:
sha256:e3cc9e2d4d339a84a18ef207df179ceaf69da61c214699cf4e2faadf52239bda
- linux/arm64 digest:
sha256:40fd2f3634f86b1d9ee139646f3696963c0b8d82c6305d55d332a1bf0acc8f30
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Use Go 1.21.8.
Upstream Grafana Mimir details
- Version: 2.10.5
- Hash:
f0830941c68406123603422701b9b2e5bc24b87a
- Changelog: CHANGELOG.md
v2.10.5 – December 13th 2023
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.10.5
- Manifest digest:
sha256:2a2b93c5d3dfe177e2e6bcbf496eb92224f46c55e0163fded23cbe83c6f173d1
- linux/amd64 digest:
sha256:d38023d560bc6ba623532e55f5021cfe9da71b34372256bf9cf6300bcc12f95e
- linux/arm64 digest:
sha256:cb2bc8348c3423cbdc5aad80b5d4fc3d352f1246f42562313e500fe1295788ea
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.10.5
- Manifest digest:
sha256:f4476e021c3472bd675e872e3e06c046374db3c2c9c65d5378858b05b593178c
- linux/amd64 digest:
sha256:f71c68cb8f4edf47b7e460b929f7c949cc8f092f819e985687f256b9700b67cb
- linux/arm64 digest:
sha256:7a419080ff7ce974bcc0801aef6048dce4cd0c4494eab378402f92c649031b10
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Update base image to
alpine:3.18.5
- [BUGFIX] Use Go 1.21.5.
Upstream Grafana Mimir details
- Version: 2.10.5
- Hash:
f0830941c68406123603422701b9b2e5bc24b87a
- Changelog: CHANGELOG.md
v2.10.4 – November 14th 2023
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.10.4
- Manifest digest:
sha256:a59219679aa471817a39a34e48bfb0ac371960aa8f53b516f6a03bdf5fc47f5c
- linux/amd64 digest:
sha256:d061004df0d39a5c7c3ce3aea1a4a2b67b8d1f6223ebf2ae0e85d0424e676d62
- linux/arm64 digest:
sha256:fcb88978bb8bde446c06914f92db44190a7501ba8df47cb03d01c7ae59737a47
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.10.4
- Manifest digest:
sha256:e40b88405c0cc35523ff52e75896c9ffc68b78d8e64231bec0e2c68020751a3d
- linux/amd64 digest:
sha256:b873b49989bfce04b2a4f8697136b3560c48233181ce2ebc62507e9a8f35731e
- linux/arm64 digest:
sha256:9ad3f7dc90bcd6186971fc88fc81476ca636dec4a5e09acfa18937e1437c7e68
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Update otelhttp library to v0.44.0 as a mitigation for CVE-2023-45142.
- [BUGFIX] Admin client: fix memory leak.
Upstream Grafana Mimir details
- Version: 2.10.4
- Hash:
d1f4f1291001db7384ba66ca90b4f753309f262d
- Changelog: CHANGELOG.md
v2.10.3 – October 18th 2023
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.10.3
- Manifest digest:
sha256:da9c1ef4fdc779532f9d2d81cc97f3d909b55b19ccc6536aa6a91eb0cc971fdc
- linux/amd64 digest:
sha256:5def0a00df7655dd1127bc9ac8aedb684fb0a45f7a62c27541b0a2e4dc5a42e9
- linux/arm64 digest:
sha256:2b13a29d643a2d987c6a5a0ba693264c233ef387dd3e4db85adefc330b4c9c42
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.10.3
- Manifest digest:
sha256:80a728cbadab32673062e06b3bdd43883f96c638b8a2d95508d82847eb5da858
- linux/amd64 digest:
sha256:89704845628a413eca8e5b773d207321c3be394fe2acdaa62f3ba8851306abe3
- linux/arm64 digest:
sha256:23d8c368a8842c7c781ee8cbc96773f4dfbbe2975b5703c38e460cf6e0aa6053
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Update grpc-go library to 1.57.2-dev to fix a concurrency issue introduced in 1.57.1.
Upstream Grafana Mimir details
- Version: 2.10.3
- Hash:
eb097eae69b4a324cc67e083f9d99489e36d2ab9
- Changelog: CHANGELOG.md
v2.10.2 – October 13th 2023
Important note
Warning
This release contains a known bug in thegrpc-go
library that can drastically affect the performance of grpc servers. We recommend updating to the newer v2.10.3 that fixes this issues.
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.10.2
- Manifest digest:
sha256:fa86c72666322a0663d12778aee1db1147f09514a3b30ac00a2948ba66ecbc5a
- linux/amd64 digest:
sha256:a898416b8aa0967ebdde8308434897841bf4b97e7a1eb5873e376b7d13963da1
- linux/arm64 digest:
sha256:2bc766cf3e49b40398219f514bf6a7f401b3b7e3851bb9c88933c0690ea432ae
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.10.2
- Manifest digest:
sha256:7ca15841d01f83379cafd7a1f4aec8849632c0f25d8b19e4abaf11cb20cd441c
- linux/amd64 digest:
sha256:655776c78d52ebd89d342979ee56d156d5606bd0abdf325fe8f56cd78c30f1b8
- linux/arm64 digest:
sha256:c165171f2858eb45bbb7d6ce27dceb2b815732d387b3a2d5e3b4681c0ca18f2e
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [ENHANCEMENT] Update Go to version 1.21.3, grpc-go to version 1.57.1 and golang.org/x/net to version 0.17 to fix CVE-2023-44487.
Upstream Grafana Mimir details
- Version: 2.10.2
- Hash:
e0ed0f5986389fe5357bc6cba8589b5a1fc5ff3a
- Changelog: CHANGELOG.md
v2.10.1 – September 19th 2023
We have encountered problems during publishing artifacts for Grafana Enterprise Metrics v2.10.0, so we have skipped that version.
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.10.1
- Manifest digest:
sha256:bc3636d33e1564cf5db9fc3d631eb235bcdab832541bbbf4fb8a78e776e60f6d
- linux/amd64 digest:
sha256:9632070af8db99f455380dd0ed45828a2557085c738dcbcf53196c2d87ad2068
- linux/arm64 digest:
sha256:e9721f93c6cf506361184dc6cb93061805fb5fff34f1f026872a2913445ace48
- Manifest digest:
- BoringCrypto image:
docker pull grafana/enterprise-metrics-boringcrypto:v2.10.1
- Manifest digest:
sha256:59def74ac3c5900d205f673cf3448ee15f50d5c3f35b06220a71ac53b69eddd0
- linux/amd64 digest:
sha256:c58764fd32b575d01dbde201389e633b0f51913309055a27b065b9ce84f6f1f7
- linux/arm64 digest:
sha256:a69af77cd0bf74f45947e72f104395d9d233569600b2adc1218072d9c99141e5
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [ENHANCEMENT] Graphite querier: CarbonAPI adds support for
inf
literal forkeepLastValue
andinterpolate
functions. - [ENHANCEMENT] Graphite querier: skip all whitespace during query parsing.
- [ENHANCEMENT] Enterprise Metrics: use new data structure for labels to reduce memory consumption.
- [ENHANCEMENT] Graphite querier:
/functions
api now natively supported instead of being proxied to stock Graphite Web. - [ENHANCEMENT] Enterprise Metrics Ruler: added
insight=true
field to ruler’s evaluation logs. - [ENHANCEMENT] Update base image to
alpine:3.18.3
. - [ENHANCEMENT] Added support to rate limit application logs:
-log.rate-limit-enabled
-log.rate-limit-logs-per-second
-log.rate-limit-logs-per-second-burst
- [ENHANCEMENT] Use Go 1.21.1.
- [BUGFIX] Graphite querier: return 499s instead of 500s when clients cancel requests in certain scenarios.
- [BUGFIX] Graphite querier: correctly support setting
SplitQueriesByInterval
to0
. - [BUGFIX] Graphite querier: fixes a JSON serialization bug when a returned value is infinity.
- [BUGFIX] Graphite querier: fixes handling of holtWinters functions, that are now proxied to Graphite Web.
- [BUGFIX] Graphite querier: Updated rollup aggregation to aggregate NaN or empty sample lists to NaN instead of zero.
- [BUGFIX] Graphite querier: wildcard matchers in patterns are now correctly honored, and series with less nodes than queried are no longer returned as part of the result.
- [BUGFIX] Graphite querier: CarbonAPI’s
groupByNode
function implementation now returns the expected number of series. - [BUGFIX] Graphite querier: CarbonAPI’s
holtWintersConfidenceBands
andholtWintersForecast
functions implementation now return correct results. - [BUGFIX] Graphite querier: CarbonAPI’s
moving*
functions implementation now returns correct results. - [BUGFIX] Graphite querier: CarbonAPI’s
exponentialMovingAverage
functions implementation now returns correct results. - [BUGFIX] Graphite querier: filter out series with fewer nodes than queried when path expressions end with glob patterns.
- [BUGFIX] Graphite querier: handle remote querier disconnections more gracefully, avoiding internal errors in certain scenarios.
- [BUGFIX] Admin client: Fix not invalidating in-memory cache sooner. If admin client cache is in-memory, then revert to the original maximum 1 minute cache expiration before
-admin.client.cache.expiration
was introduced. - [BUGFIX] Admin client: Fix ignoring
-admin-api.leader-election.client
configuration parameters.
Upstream Grafana Mimir details
- Version: 2.10.0
- Hash:
77906f76055188998b0f8f28b89aaeb68ab08feb
- Changelog: CHANGELOG.md
v2.9.4 – December 11th 2023
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.9.4
- Manifest digest:
sha256:9bd1106439216fac3d1e99414fcded85b39b59e00539a60f6a7966f1ecd5e575
- linux/amd64 digest:
sha256:f773e16b65856e754122c3fb5f909a8cd1a68ca94e2239a96c74efb6939a6895
- linux/arm64 digest:
sha256:151ad40bccfe112c6f007b37ca2281d118bb1dc927379943b0bed5c50b65db64
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Update Alpine to 3.18.5 to address CVE-2023-5363.
- [BUGFIX] Update Golang to 1.20.12 to address CVE-2023-39326.
Upstream Grafana Mimir details
- Version: 2.9.3
- Hash:
97e892ac6f17a682e831b744ad96cbc09a3b0d44
- Changelog: CHANGELOG.md
v2.9.3 – November 14th 2023
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.9.3
- Manifest digest:
sha256:e92e1fff77ef0a9d3a11a43d00ac5d9cf8b178b872f4e0e748480cb180138c45
- linux/amd64 digest:
sha256:ff6aa6376432a944739d0f1d42ee05519bb9f91e8f88b23963a2fe3cc919ed3c
- linux/arm64 digest:
sha256:6cda00e4a839c38a33e6e6c9acbeaa0e2fca564692486deb2698f86ee736867b
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Update otelhttp library to v0.44.0 as a mitigation for CVE-2023-45142.
- [BUGFIX] Admin client: fix memory leak.
Upstream Grafana Mimir details
- Version: 2.9.3
- Hash:
97e892ac6f17a682e831b744ad96cbc09a3b0d44
- Changelog: CHANGELOG.md
v2.9.2 – October 19th 2023
Links
Binary (Linux AMD64)
Docker images:
- Standard image:
docker pull grafana/enterprise-metrics:v2.9.2
- Manifest digest:
sha256:1a63c2ded4bd50aec629e86ce64def21ee6610b810f5ba7264a80f088b2aad80
- linux/amd64 digest:
sha256:3857c1d2705c02ff653608c317494416d3c1bb414ee2c81cedf608ad18ac464b
- linux/arm64 digest:
sha256:820766d44a554e8b6cf96ed2b27e6ef1d808df6ac45670354b0f676e6df2eaf0
- Manifest digest:
- Standard image:
License: Grafana Labs license
Changelog
- [BUGFIX] Update grpc-go library to 1.57.2-dev to fix a concurrency issue introduced in 1.57.1.
Upstream Grafana Mimir details
This is correct, Mimir version was not changed for this release.
- Version: 2.9.2
- Hash:
74bc18c39f0c81cd50b979103fd8ade62703f198
- Changelog: CHANGELOG.md
v2.9.1 – October 17th 2023
Links
Binary (Linux AMD64)
Docker image linux/amd64: run
docker pull grafana/enterprise-metrics:v2.9.1@sha256:7774e3243aab96ff87bcb9d0844a81a65e46761243a516983c5f39665e672adf
)Docker image linux/arm64: run
docker pull grafana/enterprise-metrics:v2.9.1@sha256:6142820316fd568cc21c2c6962e738b7766723c930491f73093babf7753743a0
)License: Grafana Labs license
Changelog
- [ENHANCEMENT] Update Go to version 1.20.10, grpc-go to version 1.57.1 and golang.org/x/net to version 0.17 to fix CVE-2023-44487.
Upstream Grafana Mimir details
- Version: 2.9.2
- Hash:
74bc18c39f0c81cd50b979103fd8ade62703f198
- Changelog: CHANGELOG.md
v2.9.0 – June 22nd 2023
Links
Binary (Linux AMD64)
Docker image linux/amd64: run
docker pull grafana/enterprise-metrics:v2.9.0@sha256:6d63e65e9b35ebce301ab2da1896e2448f9e2c8ac7c82a8bbce4af3dcfe98442
)Docker image linux/arm64: run
docker pull grafana/enterprise-metrics:v2.9.0@sha256:3bccd07e3e7a4efc70029c346debfb4636ee548cac6103e5b00a475023831d59
)License: Grafana Labs license
Changelog
- [FEATURE] Admin API: Add experimental audit logging for requests on the admin-api path. Enabled with
admin-api.auditlogging.enabled
. - [ENHANCEMENT] GRPC client config for components can now set three new advanced flags:
connect_timeout
,connect_backoff_base_delay
, andconnect_backoff_max_delay
. - [ENHANCEMENT] Graphite querier: applyByNode and useSeriesAbove functions are now correctly processed by CarbonAPI and not proxied to stock Graphite Web.
- [ENHANCEMENT] Publish a multi-architecture Docker image with x86 and ARM images.
- [ENHANCEMENT] Graphite querier: aliasQuery is now correctly processed by CarbonAPI and not proxied to stock Graphite Web.
- [ENHANCEMENT] Update base image to
alpine:3.18.2
. - [BUGFIX] Graphite querier: count function in summarize now correctly only counts non-NaN values.
- [BUGFIX] Graphite querier: smartSummarize is now correctly processed by CarbonAPI and not proxied to stock Graphite Web.
- [BUGFIX] Graphite querier: hitcount is now correctly processed by CarbonAPI and not proxied to stock Graphite Web.
- [BUGFIX] Graphite querier: certain bad requests no longer return 500 errors.
- [BUGFIX] Make gRPC proxy fail write request immediately if receiving request body times out, not updating
cortex_request_downstream_duration_seconds
metric. - [BUGFIX] Ruler: fix performance of max rule group enforcement when per-tenant max rule group limits are disabled.
Upstream Grafana Mimir details
- Version: 2.9.0
- Hash:
761114d8b026dac77ea94517fd9773632c32b42b
- Changelog: CHANGELOG.md
v2.8.1 – May 8th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.8.1
(digest:sha256:8ff9e898f79388d238c13954e566c30a9bf2cfc3f25d3c689bdfa7da3b09be0d
)License: Grafana Labs license
Changelog
- [BUGFIX] Ruler: fix performance of max rule group enforcement when per-tenant max rule group limits are disabled.
Upstream Grafana Mimir details
- Version: 2.8.0
- Hash:
f917e084d4a5596565b5debd659db0d4a5f9da6d
- Changelog: CHANGELOG.md
v2.8.0 – May 3rd 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.8.0
(digest:sha256:6e33054dbbde73480bded4a556cbbd291602a357a19c118b90910489c361f848
)License: Grafana Labs license
Changelog
- [CHANGE] Graphite querier: New default retry behavior for the querier whenever proxied requests to graphite web fail with a 502 error.
- [CHANGE] Graphite querier: the heavy rate limiting now tries to delay requests instead of dropping them directly and returning an error.
- [CHANGE] Removed
raw_increase
PromQL function. - [FEATURE] Experimental PromQL mapping for pre aggregated metrics: removed
-query-aggregations-enabled
and-query-aggregations-optonal
experimental flags, instead of that- Added
-query-aggregations-middleware-enabled
which can be used to add the middleware to the query path. - Added
-query-aggregations-mapping-enabled-default
which can be used to enable mapping by default. - Per-request
X-Query-Aggregations
header can be used withtrue
/false
values to force mapping regardless of the configuration. - Runtime limits
query_aggregations: mapping_enabled: true/false
configuration can be used to enable mapping on per-tenant basis.
- Added
- [ENHANCEMENT] Include timezone data in Docker image. In particular, this enables the use of timezones in alertmanager silencing rules.
- [ENHANCEMENT] Update all images to use Alpine 3.17.3.
- [BUGFIX] Ruler: fix parsing of Mimir-specific YAML fields under
ruler.query_frontend
.
Upstream Grafana Mimir details
- Version: 2.8.0
- Hash:
f917e084d4a5596565b5debd659db0d4a5f9da6d
- Changelog: CHANGELOG.md
v2.7.2 – May 4th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.7.2
(digest:sha256:886208b41108be72cb98f642f00a5cbfedc886a1fdb387135873ac6e5f2c25b8
)License: Grafana Labs license
Changelog
- [BUGFIX] Updated Go to version 1.20.4 to fix CVE-2023-24539, CVE-2023-24540, CVE-2023-29400.
Upstream Grafana Mimir details
- Version: 2.7.1
- Hash:
dbe4ccd39cb58391e6174441ae11ae25bfebd395
- Changelog: CHANGELOG.md
v2.7.1 – April 20th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.7.1
(digest:sha256:37bfcfb7cd9d287e7259e479b45a596cba548ec950325b9348391eabe03ac961
)License: Grafana Labs license
Changelog
- [BUGFIX] Updated Go to version 1.20.3 to fix CVE-2023-24538.
Upstream Grafana Mimir details
- Version: 2.7.1
- Hash:
dbe4ccd39cb58391e6174441ae11ae25bfebd395
- Changelog: CHANGELOG.md
v2.7.0 – March 17th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.7.0
(digest:sha256:0410f5cf48cef116b1e08b6d8bbc8442e3396c65dafc85b50d5141c10a855a1c
)License: Grafana Labs license
Changelog
- [FEATURE] Adding new PromQL function
raw_increase
which will be used to perform aggregations. - [FEATURE] Added experimental PromQL mapping middleware that adds support for pre-aggregated time series.
- Added
-query-aggregations-enabled
which can be used to add the middleware to the query path. - Added
-query-aggregations-optonal
which can be used to make the query mapping optional unlessX-Query-Aggregations: true
header is passed (defaulttrue
, set tofalse
to enforce).
- Added
- [ENHANCEMENT] Added remote cache support for object storage client.
- The following CLI flags (and their respective YAML config options) have been added:
-admin.client.cache.expiration
-admin.client.cache.type
-admin.client.cache.memcached.addresses
-admin.client.cache.memcached.max-async-buffer-size
-admin.client.cache.memcached.max-async-concurrency
-admin.client.cache.memcached.max-get-multi-batch-size
-admin.client.cache.memcached.max-get-multi-concurrency
-admin.client.cache.memcached.max-idle-connections
-admin.client.cache.memcached.max-item-size
-admin.client.cache.memcached.timeout
-admin.client.cache.redis.endpoint
-admin.client.cache.redis.username
-admin.client.cache.redis.password
-admin.client.cache.redis.db
-admin.client.cache.redis.master-name
-admin.client.cache.redis.connection-pool-size
-admin.client.cache.redis.dial-timeout
-admin.client.cache.redis.idle-timeout
-admin.client.cache.redis.read-timeout
-admin.client.cache.redis.write-timeout
-admin.client.cache.redis.min-idle-connections
-admin.client.cache.redis.max-connection-age
-admin.client.cache.redis.max-async-buffer-size
-admin.client.cache.redis.max-async-concurrency
-admin.client.cache.redis.max-get-multi-batch-size
-admin.client.cache.redis.max-get-multi-concurrency
-admin.client.cache.redis.max-item-size
-admin.client.cache.redis.tls-enabled
-admin.client.cache.redis.tls-ca-path
-admin.client.cache.redis.tls-cert-path
-admin.client.cache.redis.tls-key-path
-admin.client.cache.redis.tls-server-name
-admin.client.cache.redis.tls-insecure-skip-verify
-admin.client.cache.redis.tls-min-version
-admin.client.cache.redis.tls-cipher-suites
-admin.client.cache.refresh-interval
- The following CLI flags (and their respective YAML config options) have been added:
- [ENHANCEMENT] Update all base images from
alpine:3.17.1
toalpine:3.17.2
.
Upstream Grafana Mimir details
- Version: 2.7.1
- Hash:
dbe4ccd39cb58391e6174441ae11ae25bfebd395
- Changelog: CHANGELOG.md
v2.6.1 – April 21st 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.6.1
(digest:sha256:f53020d39b991143cd88b1ef0474ad40b589672eb04f0f379e4e579a388957f5
)License: Grafana Labs license
Changelog
- [ENHANCEMENT] Update all base images from
alpine:3.17.1
toalpine:3.17.3
. - [BUGFIX] Updated Go to version 1.19.8 to fix CVE-2023-24538.
Upstream Grafana Mimir details
- Version: 2.6.0
- Hash:
27698f399fc9e13c6fe0a8c79f882993814fda4a
- Changelog: CHANGELOG.md
v2.6.0 – February 22nd 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.6.0
(digest:sha256:51c6c3f9decc4a4422b0ac34e88a278a6f8a992032747159220935c828f3d372
)License: Grafana Labs license
Changelog
- [CHANGE] Graphite querier: add experimental support for optional rate limitting at the subqueries level, using the
.max-concurrent-sub-queries-per-request
flag. - [CHANGE] Graphite querier: the storage aggregation method set in
storage-aggregation.conf
can no longer be overridden during runtime usingconsolidateBy
when metrictank is used as a render engine for remote queries. - [CHANGE] Graphite querier:
/tags/autoComplete/values
now takes input time range into account. Previously only tag values in the last hour were returned. - [CHANGE] Graphite querier: The
to
/until
parameter for/find
and/tags
endpoints are now respected by the querier, though ifto
/until
is greater than the current time, it’s adjusted to the current time. Previously this value was always overwritten by the current time. - [FEATURE] Graphite querier: Add support for
/metrics/expand
endpoint. - [ENHANCEMENT] Refactor caching logic in the versioned bucket client to reduce the number of requests to object storage.
- [ENHANCEMENT] Update all base images from
alpine:3.16.2
toalpine:3.17.1
. - [BUGFIX] Graphite querier: flush metric name cache for remote queries.
- [BUGFIX] Graphite querier: fix panic when running certain combinations of functions.
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance.
Upstream Grafana Mimir details
- Version: 2.6.0
- Hash:
27698f399fc9e13c6fe0a8c79f882993814fda4a
- Changelog: CHANGELOG.md
v2.5.3 – April 21st 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.5.3
(digest:sha256:000177d4349224b58092e9f800a311a36d1c66376bc77f53a026d6542865dcc8
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix CVE-2023-24538 by upgrading to Go 1.19.8.
Upstream Grafana Mimir details
- Version: 2.5.0
- Hash:
25533fdfcf5d8e26ee8f49bcaf13e30bd678d4b3
- Changelog: CHANGELOG.md
v2.5.2 – February 17th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.5.2
(digest:sha256:760a027454e44cf3c817359eb7c27482b8355d7254dea15b4714c8316a43c011
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance
Upstream Grafana Mimir details
- Version: 2.5.0
- Hash:
25533fdfcf5d8e26ee8f49bcaf13e30bd678d4b3
- Changelog: CHANGELOG.md
v2.5.1 – January 6th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.5.1
(digest:sha256:da2a349151c1fe42c7e952aef1835ce33ab88fbd21f201e0aa059aa8d8e8a4bb
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix empty buildinfo in GEM binary
Upstream Grafana Mimir details
- Version: 2.5.0
- Hash:
25533fdfcf5d8e26ee8f49bcaf13e30bd678d4b3
- Changelog: CHANGELOG.md
v2.5.0 – December 15th 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.5.0
(digest:sha256:ffe4873a520c981a0c3ea2ef0531846cad38cf2a96dc62d6a3432bb0a609c95e
)License: Grafana Labs license
Changelog
- [CHANGE] Flag -*.azure.msi-resource is now ignored, and will be removed in Mimir 2.7. This setting is now made automatically by Azure.
- [CHANGE] Graphite querier: the storage aggregation method set in storage-aggregation.conf can no longer be overridden during runtime using consolidateBy when metrictank is used as a render engine. This matches Graphite’s behavior.
- [CHANGE] Graphite querier: caches default TTL is now lowered to 10 minutes. This is done to keep consistency in the event that out-of-order ingestion is enabled in mimir so that graphite queries answer with latest available data instead of caching responses for days.
- [ENHANCEMENT] Added
.tls-min-version and .tls-cipher-suites flags to configure cipher suites and min TLS version supported by servers. - [ENHANCEMENT] All: Add clustername label to cpu usage metrics (cortex_quota_cpu_count, cortex_quota_gomaxprocs, cortex_quota_cgroup_cpu_max, cortex_quota_cgroup_cpu_period). The value is the cluster name in the GEM license.
- [ENHANCEMENT] Add recording rules to fulfill requirements for all Mimir mixin dashboards. The following recording rules have been added to GEM Self Monitoring to better align with the Mimir mixin: target:cortex_ingester_queried_exemplars:99quantile target:cortex_ingester_queried_exemplars:50quantile target:cortex_ingester_queried_exemplars:avg target:cortex_ingester_queried_exemplars_bucket:sum_rate target:cortex_ingester_queried_exemplars_sum:sum_rate target:cortex_ingester_queried_exemplars_count:sum_rate target_instance:cortex_alertmanager_alerts:sum target_instance:cortex_alertmanager_silences:sum target:cortex_alertmanager_state_replication_total:rate5m target:cortex_alertmanager_state_replication_failed_total:rate5m cortex_alertmanager_alerts_invalid_total:rate5m target:cortex_alertmanager_alerts_received_total:rate5m target:cortex_alertmanager_partial_state_merges_total:rate5m target:cortex_alertmanager_partial_state_merges_failed_total:rate5m target_integration:cortex_alertmanager_notifications_total:rate5m target_integration:cortex_alertmanager_notifications_failed_total:rate5m
- [ENHANCEMENT] Optimise the latest version lookup mechanism for versioned bucket client to reduce count of requests to object storage. Now it uses binary search instead of using sequential search.
- [BUGFIX] Fixed a bug in the Graphite querier where render requests that failed to be processed by the native engine were not being proxied to Graphite web.
Upstream Grafana Mimir details
- Version: 2.5.0
- Hash:
25533fdfcf5d8e26ee8f49bcaf13e30bd678d4b3
- Changelog: CHANGELOG.md
v2.4.2 – April 21st 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.4.2
(digest:sha256:589072e2e9802c820ad3a281c09f46266dd42c3b98fed07ab741d4ee454fcf9f
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix CVE-2023-24538 by upgrading to Go 1.19.8.
Upstream Grafana Mimir details
- Version: 2.4.0
- Hash:
32137ee2c4c41fa649abfb9582e1f33a9e13363b
- Changelog: CHANGELOG.md
v2.4.1 – February 17th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.4.1
(digest:sha256:86bc899a450e1f052e2dd4d7fd55435711068cfefa6d5da541a1fa59feaa69e6
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance
Upstream Grafana Mimir details
- Version: 2.4.0
- Hash:
32137ee2c4c41fa649abfb9582e1f33a9e13363b
- Changelog: CHANGELOG.md
v2.4.0 – November 14th 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.4.0
(digest:sha256:1f56acfb6c9ddbb5d6e961401ba55963ae51752889a1b5536b840837df8f44be
)License: Grafana Labs license
Changelog
- [CHANGE] CarbonAPI is now being used instead of MetricTank as the default native query engine for the Graphite querier.
- [CHANGE] Enterprise metrics docker image no longer requests the CAP_NET_BIND_SERVICE capability as the default HTTP port was changed from 80 to 8080.
- If you set
-server.http-listen-port
or-server.grpc-listen-port
to a value lower than 1024, then you need to modify your configurationWhen using Docker provide the flag
--cap-add net_bind_service
.When using the
mimir-distributed
Helm chart, make sure that all the GEM components have the following additional securityContext setting in their respective values file sections:securityContext: sysctls: - name: net.ipv4.ip_unprivileged_port_start value: "0" # might be set to the lowest listen port number as well
- If you set
- [FEATURE] Added a new flag
-graphite.querier.cache-ttl
to the Graphite querier to configure the TTL of cached metric names and aggregation configs. - [FEATURE] Added optional rate limiting capabilities to the Graphite querier.
- This can be configured using the following flags:
-graphite.querier.rate-limit-enabled
-graphite.querier.rate-limit-qps
-graphite.querier.tenant-rate-limit-qps
-graphite.querier.heavy-rate-limit-qps
- This can be configured using the following flags:
- [ENHANCEMENT] Ruler: Add
<prometheus-http-prefix>/api/v1/status/buildinfo
endpoint. - [ENHANCEMENT] Update all build images to use Go 1.19.2.
- [BUGFIX] Fix CVE-2022-44643
Upstream Grafana Mimir details
- Version: 2.4.0
- Hash:
32137ee2c4c41fa649abfb9582e1f33a9e13363b
- Changelog: CHANGELOG.md
v2.3.3 – April 21st 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.3.3
(digest:sha256:28ff68b1e37ae46b3b9ec749d3a437d6a4d1187c7e32c428a46bd15755d6c96a
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix CVE-2023-24538 by upgrading to Go 1.19.8.
Upstream Grafana Mimir details
- Version: v2.3.1
- Hash:
e18bcff2f8648c4edb8446bff0256a070717684c
- Changelog: CHANGELOG.md
v2.3.2 – February 17th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.3.2
(digest:sha256:dfba678a8b13647634dc9fa021ff1e8d9f23741d2e8590552c5ad22bedd59c81
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix issue where authentication caches were not sized correctly resulting in poor performance
Upstream Grafana Mimir details
- Version: v2.3.1
- Hash:
e18bcff2f8648c4edb8446bff0256a070717684c
- Changelog: CHANGELOG.md
v2.3.1 – November 14th 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.3.1
(digest:sha256:d697519012b4f8307ea3f39774235e99d0e5f9c498c7e93685551597179340b3
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix CVE-2022-44643
Upstream Grafana Mimir details
- Version: v2.3.1
- Hash:
e18bcff2f8648c4edb8446bff0256a070717684c
- Changelog: CHANGELOG.md
v2.3.0 – September 28th 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.3.0
(digest:sha256:0cb46f23551037c8f9df40572d5a09876a04cb59536ff7a06eb558c2e1bf558e
)License: Grafana Labs license
Changelog
- [CHANGE] Gateway: Dial timeout now defaults to 5s instead of 30s.
- [CHANGE] Gateway: Dialing gRPC proxy backends during startup now blocks until the connection is established.
- [FEATURE] Gateway: Add support for TSDB block upload routes.
- [FEATURE] Admin client:
common
config block introduced in Mimir now configures Admin Client in GEM too. - [ENHANCEMENT] Gateway: the CLI flag
-gateway.request.limit
has been added for configuring request limiter middleware. - [ENHANCEMENT] Update all build images to use Go 1.18.6.
- [ENHANCEMENT] Update all images to use Alpine 3.16.2.
- [ENHANCEMENT] Gateway: Dial timeout is now configurable via
-gateway.proxy.*.dial-timeout
. - [BUGFIX] Gateway: Expose /distributor/ring endpoint on the distributors.
- [BUGFIX] LBAC: some query limits would not be applied for requests that use LBAC.
Upstream Grafana Mimir details
- Version: v2.3.1
- Hash:
e18bcff2f8648c4edb8446bff0256a070717684c
- Changelog: CHANGELOG.md
v2.2.2 – April 24th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.2.2
(digest:sha256:1f8eb9f3ed6cc86442cafb394363d241fd27d497da042b7a3ef6a4d59e509bbb
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix CVE-2023-24538 by upgrading to Go 1.19.8.
Upstream Grafana Mimir details
- Version: v2.2.0
- Hash:
65344e2ed2cf305b50de805824026f5c5a6fadcf
- Changelog: CHANGELOG.md
v2.2.0 – July 21st 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.2.0
(digest:sha256:5165f84eeb399c1701757efc5a3f9219422bc43935bf995ea7e3d31417b2d6cb
)License: Grafana Labs license
Changelog
- [CHANGE] Ruler:
/api/v1/rules*
and/prometheus/rules*
configuration endpoints are removed in favour of/prometheus/config/v1/rules*
. Requests through the gateway are unaffected. - [CHANGE] The remote subquerier for the Graphite query proxy is no longer optional
- The following CLI flags (and their respective YAML config options) have been removed:
-graphite.querier.enable-remote-subquerier
-graphite.querier.use-remote-results
- The following CLI flags (and their respective YAML config options) have been removed:
- [CHANGE] The YAML config options for the
datadog.api
have been broken out intodatadog.read_api
anddatadog.write_api
- [ENHANCEMENT] Admin-client: added experimental support for refreshing authentication cache entries before they expire. When enabled, a cache entry is refreshed and its time to live is extended if it is retrieved and has less than or equal to
-auth.cache.refresh.refresh-ttl
time left to live in the cache.- The following CLI flags (and their respective YAML config options) have been added:
-auth.cache.refresh.buffer
-auth.cache.refresh.concurrency
-auth.cache.refresh.enabled
-auth.cache.refresh.refresh-ttl
-auth.cache.refresh.retry-interval
- The following CLI flags (and their respective YAML config options) have been added:
- [ENHANCEMENT] Gateway: Rewrite requests to deleted ruler configuration endpoints to use supported endpoints.
- [BUGFIX] Docs: Make config category labels consistent across command-line help and generated documentation.
Upstream Grafana Mimir details
- Version: v2.2.0
- Hash:
65344e2ed2cf305b50de805824026f5c5a6fadcf
- Changelog: CHANGELOG.md
v2.1.2 – April 24th 2023
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.1.2
(digest:sha256:a5be17c644c96698adb6fa582844de476d1ab65f6b73b4085407da8a8bc11db6
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix CVE-2023-24538 by upgrading to Go 1.19.8.
Upstream Grafana Mimir details
- Version: v2.1.0
- Hash:
3cff860d16e08d14e8aaa10649053a9c0f0f15a7
- Changelog: CHANGELOG.md
v2.1.0 – June 2nd 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.1.0
(digest:sha256:d02650b34c77cb5130b23790c958f658de8a5634f4d66f24dcb631ae7ba34b99
)License: Grafana Labs license
Changelog
- [FEATURE] Ruler: Added support for expression remote evaluation.
- The following CLI flags (and their respective YAML config options) have been added:
-ruler.query-frontend.address
-ruler.query-frontend.auth-token
-ruler.query-frontend.tls-enabled
-ruler.query-frontend.tls-ca-path
-ruler.query-frontend.tls-cert-path
-ruler.query-frontend.tls-key-path
-ruler.query-frontend.tls-server-name
-ruler.query-frontend.tls-insecure-skip-verify
- The following CLI flags (and their respective YAML config options) have been added:
- [ENHANCEMENT] Self-monitoring: Emit OOM kill and page fault metrics as part of self-monitoring.
- [BUGFIX] Ruler API: Ruler Limits are now enforced during rule group creation.
- [BUGFIX] Authentication: Expose internal errors during authentication only in logs, not to clients.
Upstream Grafana Mimir details
- Version: v2.1.0
- Hash:
3cff860d16e08d14e8aaa10649053a9c0f0f15a7
- Changelog: CHANGELOG.md
v2.0.1 – April 14th 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.0.1
(digest:sha256:30c80aa0612aed4e0bab24f9e5c817a112f0bbdfa7b51404a069474d706ceaee
)License: Grafana Labs license
Changelog
- [BUGFIX] Authentication: Only include active tenants when resolving the wildcard tenant (
*
).
Upstream Grafana Mimir details
No changes since GEM v2.0.0:
- Version: v2.0.0
- Hash:
9fd2da5d3dc764fc00e4396a5c0ddd12ccebb00d
- Changelog: CHANGELOG.md
v2.0.0 – April 13th 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v2.0.0
(ID:sha256:43ed80839bd0cb1d799087d5591a8873cfaead182683055bbb8aa207efcf8a5f
, Repo digest:sha256:338bbcf64ea051cc3911908b977ae3b7bb8ed65342e7a2f8df3f781aa0f5e61a
)License: Grafana Labs license
Changelog
- [CHANGE] Admin-API: enable leader election by default
- [CHANGE] Change default value of
instrumentation.enabled
totrue
- [CHANGE] Graphite Querier: The GRPC server is now registered to enable subquerier requests. This requires using the flag EnableRemoteSubquerier.
- [CHANGE] Graphite Querier: The remote read query is now the default behavior. Also, the previous implementation has been removed.
- [CHANGE] Admin-API: Change
auth.type
default fromtrust
toenterprise
- [CHANGE] Limits: The
max_series_per_query
limit has been removed from the Admin API and runtime configuration and is no longer enforced by GEM during queries. - [CHANGE] Graphite: The Graphite Querier and Graphite Write Proxy have been removed from single binary mode (the
all
target). They can still be run using thegraphite-querier
andgraphite-write-proxy
targets, respectively. - [CHANGE] Query-frontend and Graphite Querier: migrated memcached backend client to the same one used in other components (memcached config and metrics are now consistent across all services).
- The following CLI flags (and their respective YAML config options) have been added:
-graphite.querier.metric-name-cache.backend
(set it tomemcached
)-graphite.querier.aggregation-cache.backend
(set it tomemcached
)
- The following CLI flags (and their respective YAML config options) have been changed:
-graphite.querier.metric-name-cache.memcached.hostname
and-graphite.querier.metric-name-cache.memcached.service
: use-graphite.querier.metric-name-cache.memcached.addresses
instead-graphite.querier.aggregation-cache.memcached.hostname
and-graphite.querier.aggregation-cache.memcached.service
: use-graphite.querier.aggregation-cache.memcached.addresses
instead
- The following CLI flags (and their respective YAML config options) have been renamed:
-graphite.querier.metric-name-cache.background.write-back-concurrency
renamed to-graphite.querier.metric-name-cache.memcached.max-async-concurrency
-graphite.querier.metric-name-cache.background.write-back-buffer
renamed to-graphite.querier.metric-name-cache.memcached.max-async-buffer-size
-graphite.querier.metric-name-cache.memcached.batchsize
renamed to-graphite.querier.metric-name-cache.memcached.max-get-multi-batch-size
-graphite.querier.metric-name-cache.memcached.parallelism
renamed to-graphite.querier.metric-name-cache.memcached.max-get-multi-concurrency
-graphite.querier.metric-name-cache.memcached.timeout
renamed to-graphite.querier.metric-name-cache.memcached.timeout
-graphite.querier.metric-name-cache.memcached.max-item-size
renamed to-graphite.querier.metric-name-cache.memcached.max-item-size
-graphite.querier.metric-name-cache.memcached.max-idle-conns
renamed to-graphite.querier.metric-name-cache.memcached.max-idle-connections
-graphite.querier.aggregation-cache.background.write-back-concurrency
renamed to-graphite.querier.aggregation-cache.memcached.max-async-concurrency
-graphite.querier.aggregation-cache.background.write-back-buffer
renamed to-graphite.querier.aggregation-cache.memcached.max-async-buffer-size
-graphite.querier.aggregation-cache.memcached.batchsize
renamed to-graphite.querier.aggregation-cache.memcached.max-get-multi-batch-size
-graphite.querier.aggregation-cache.memcached.parallelism
renamed to-graphite.querier.aggregation-cache.memcached.max-get-multi-concurrency
-graphite.querier.aggregation-cache.memcached.timeout
renamed to-graphite.querier.aggregation-cache.memcached.timeout
-graphite.querier.aggregation-cache.memcached.max-item-size
renamed to-graphite.querier.aggregation-cache.memcached.max-item-size
-graphite.querier.aggregation-cache.memcached.max-idle-conns
renamed to-graphite.querier.aggregation-cache.memcached.max-idle-connections
- The following CLI flags (and their respective YAML config options) have been removed:
-graphite.querier.aggregation-cache.default-validity
: new setting is hardcoded to 7 days-graphite.querier.aggregation-cache.memcached.circuit-breaker-consecutive-failures
: feature removed-graphite.querier.aggregation-cache.memcached.circuit-breaker-interval
: feature removed-graphite.querier.aggregation-cache.memcached.circuit-breaker-timeout
: feature removed-graphite.querier.aggregation-cache.memcached.consistent-hash
: new setting is always enabled-graphite.querier.aggregation-cache.memcached.update-interval
: new setting is hardcoded to 30s-graphite.querier.metric-name-cache.default-validity
and-frontend.memcached.expiration
: new setting is hardcoded to 7 days-graphite.querier.metric-name-cache.memcached.circuit-breaker-consecutive-failures
: feature removed-graphite.querier.metric-name-cache.memcached.circuit-breaker-interval
: feature removed-graphite.querier.metric-name-cache.memcached.circuit-breaker-timeout
: feature removed-graphite.querier.metric-name-cache.memcached.consistent-hash
: new setting is always enabled-graphite.querier.metric-name-cache.memcached.update-interval
: new setting is hardcoded to 30s
- The following metrics have been changed:
cortex_cache_dropped_background_writes_total{name}
changed tothanos_memcached_operation_skipped_total{name, operation, reason}
cortex_cache_value_size_bytes{name, method}
changed tothanos_memcached_operation_data_size_bytes{name}
cortex_cache_request_duration_seconds{name, method, status_code}
changed tothanos_memcached_operation_duration_seconds{name, operation}
cortex_cache_fetched_keys{name}
changed tothanos_cache_memcached_requests_total{name}
cortex_cache_hits{name}
changed tothanos_cache_memcached_hits_total{name}
cortex_memcache_request_duration_seconds{name, method, status_code}
changed tothanos_memcached_operation_duration_seconds{name, operation}
cortex_memcache_client_servers{name}
changed tothanos_memcached_dns_provider_results{name, addr}
cortex_memcache_client_set_skip_total{name}
changed tothanos_memcached_operation_skipped_total{name, operation, reason}
cortex_dns_lookups_total
changed tothanos_memcached_dns_lookups_total
- For all metrics the value of the “name” label has changed from
frontend.memcached
tofrontend-cache
. - Above mentioned metrics are now also available with name=
metric-name
and name=aggregations
for caches used by Graphite Querier.
- The following metrics have been removed:
cortex_cache_background_queue_length{name}
- The following CLI flags (and their respective YAML config options) have been added:
- [CHANGE] Compactor:
-compactor.compaction-strategy
option removed. The only compactor that can be now used is “split and merge” compactor. - [CHANGE] Graphite: Enabled distributed subqueries by default and renamed
remote_write
YAML flags.-graphite.querier.use-remote-results
and-graphite.querier.enable-remote-subquerier
now default totrue
. This means by default subqueries will be distributed across queriers.remote_write
YAML flags have been renamed:keepalive
has been renamed tokeep_alive
maxidleconns
has been renamed tomax_idle_conns
maxconns
has been renamed tomax_conns
skiplabelvalidation
has been renamed toskip_label_validation
.
- [FEATURE] Admin-API Deletion Markers:
- Update
status
field in tenants - Add
status
field to access policies and tokens - Add new Admin API v3 endpoints with soft-deletion of entities
- /admin/api/v3/accesspolicies
- /admin/api/v3/clusters
- /admin/api/v3/features
- /admin/api/v3/licenses
- /admin/api/v3/tenants
- /admin/api/v3/tokens
- List endpoints only return entities in active status
- Update HTTP authentication layer to only authorize requests of active entities
- Update storage cache logic to only store the object’s latest version
- Add v3 endpoints to gateway routes
- Update
- [FEATURE] Graphite Write Proxy: Added
-graphite.remote-write-proxy.enabled
,-graphite.remote-write-proxy.write-endpoint
and-graphite.write-proxy.skip-label-validation
to enhance the internal series write performance of the graphite writer. It’s recommended to enable this flag on every installation as soon as possible because it will become a default configuration in future releases. - [FEATURE] Ruler: Added federated rule groups support.
- Exposed
cortex_ruler_sync_unauthorized_groups
metric to track the number of skipped rule groups during storage synchronizations.
- Exposed
- [FEATURE] Divide configuration parameters into categories “basic”, “advanced”, and “experimental”. Only flags in the basic category are shown when invoking
-help
, whereas-help-all
will include flags in all categories (basic, advanced, experimental). - [FEATURE] Datadog: Added experimental support for ingesting and querying Datadog metrics by adding a Datadog translation layer on top of GEM.
- [FEATURE] Gateway: Forward requests to deprecated and removed endpoints in Mimir 2.0 (grafana/mimir#763) to their non-legacy equivalents.
- [ENHANCEMENT] Update all build images to use Go 1.17.8.
- [ENHANCEMENT] Admin-API: Allow the
max_global_exemplars_per_user
limit to be set via the Admin API. - [ENHANCEMENT] Admin-API: Enable
compactor_blocks_retention_period
to be set on a per-tenant basis via the Admin API. - [ENHANCEMENT] Querier: Apply Label Based Access Policy (LBAC) rules to exemplar endpoints.
- [ENHANCEMENT] Federation frontend: Add
bearer_token
configuration for proxy targets. - [ENHANCEMENT] Self-monitoring: Add support for emitting exemplars as part of self-monitoring metrics.
- [ENHANCEMENT] Federation frontend: Return richer error when downstream data source is failed.
- [BUGFIX] Graphite: no need to configure Mimir’s queryable when starting only
-target=graphite-querier
. - [BUGFIX] Graphite: When configured with enterprise authentication, requests sent to cortex remote read api now forward authorization headers if present.
- [BUGFIX] LBAC: Filter label values using LBAC policies correctly.
- [BUGFIX] Authentication: HTTP 500 errors are now returned for transient errors while attempting to authenticate user requests.
- [BUGFIX] Authentication: Do not cache transient errors while attempting to authenticate user requests.
- [BUGFIX] Config: Enterprise configuration extensions now appear in the
/config
endpoint - [BUGFIX] Admin: Validate the access policy name used for token generation.
- [BUGFIX] Admin: Fixed a cosmetic issue that could report an incorrect license expiration timestamp in the metric
grafana_labs_license_expiry_timestamp
if multiple valid licenses exist in local storage and object storage. - [BUGFIX] Gateway: All Alertmanager endpoints are correctly proxied to the alertmanager backend proxy.
Previously, only the
/alertmanager
endpoint was proxied. Users were able to authenticate but not access the alerts UI page at/alertmanager/#/alerts
.
Upstream Grafana Mimir details
- Version: v2.0.0
- Hash:
9fd2da5d3dc764fc00e4396a5c0ddd12ccebb00d
- Changelog: CHANGELOG.md
v1.7.1 – November 14th 2022
Links
Binary (Linux AMD64)
Docker image: run
docker pull grafana/enterprise-metrics:v1.7.1
(digest:sha256:84576bd0bab9beb98f6c93e6b9d91dc4efc3e5434747c43ca2bd84863219c8c6
)License: Grafana Labs license
Changelog
- [BUGFIX] Fix CVE-2022-44643
v1.7.0 – January 6th 2022
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.7.0
(digest:sha256:286ce03b3dcd50c7924ee6860d58b2bd7986c9548cc6fe6207d23b0212883c33
)License: Grafana Labs license
Changelog
- [FEATURE] Admin-API: Added support for Azure Storage
- [ENHANCEMENT] Federation Frontend: Propagate requests’ bearer token when it is present.
- [ENHANCEMENT] Federation Frontend: Support TLS configuration for targets.
v1.6.2 – January 6th 2022
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.6.2
(digest:sha256:48fef5ef7a339d766274a37448e1c3745fde53ec0e2f4eab1a8a093a786d41d2
)License: Grafana Labs license
Changelog
- [BUGFIX] GEM update from v1.5.0 (or older) to v.1.6+ will not invalidate tenant limits set via API anymore.
v1.6.1 – November 18th 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.6.1
(digest:sha256:66f9eb4cee53df7b95860b1d094cae1dca88e1724de3695fec0449f92fe1db90
)License: Grafana Labs license
Changelog
- [BUGFIX] Admin-API: Make sure that read-path limits inherit defaults from global limits.
v1.6.0 – November 15th 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.6.0
(digest:sha256:1e01fe4d792b53b9a4d37c38a612c2027582d6d7248f567ed31e2ed6102c035d
)License: Grafana Labs license
Changelog
- [CHANGE] Admin-client: Rename the “default” auth method to “trust”.
- [CHANGE] License: Deprecated flag
-bootstrap.license.path
has been removed. The new flag to use for specifying a license is-license.path
. - [CHANGE] Ruler: endpoints for listing rules (
/api/v1/rules
,/api/v1/rules/{namespace}
) now return HTTP status code 200 and an empty map when there are no rules instead of an HTTP 404 and plain text error message. - [CHANGE] Query-frontend: added
sharded
label tocortex_query_seconds_total
metric. - [CHANGE] Query-frontend: changed the flag name for controlling query sharding total shards from
-querier.total-shards
to-frontend.query-sharding-total-shards
. - [CHANGE] Flag
-querier.parallelise-shardable-queries
has been renamed to-query-frontend.parallelize-shardable-queries
- [CHANGE] Querier/ruler: Option
-querier.ingester-streaming
has been removed. Querier/ruler now always use streaming method to query ingesters. - [CHANGE] Limits: Option
-ingester.max-samples-per-query
is now deprecated. YAML fieldmax_samples_per_query
is no longer supported. It required-querier.ingester-streaming
option to be set to false, but since-querier.ingester-streaming
is removed (always defaulting to true), the limit using it was removed as well. - [CHANGE] Limits: Set the default max number of inflight ingester push requests (
-ingester.instance-limits.max-inflight-push-requests
) to 30000 in order to prevent clusters from being overwhelmed by request volume or temporary slow-downs. - [CHANGE] Update Go version to 1.16.9.
- [CHANGE] Admin-API: Require that tenant updates include the
status
field. - [FEATURE] Querier: Added label names cardinality endpoint
<prefix>/api/v1/cardinality/label_names
that is disabled by default. Can be enabled/disabled via the CLI flag-querier.cardinality-analysis-enabled
or its respective YAML config option. Configurable on a per-tenant basis. - [FEATURE] Querier: Added label values cardinality endpoint
<prefix>/api/v1/cardinality/label_values
that is disabled by default. Can be enabled/disabled via the CLI flag-querier.cardinality-analysis-enabled
or its respective YAML config option. Configurable on a per-tenant basis. - [FEATURE] Compactor: added support for a new compaction strategy
-compactor.compaction-strategy=split-and-merge
. When thesplit-and-merge
compactor is used, source blocks for a given tenant are grouped into-compactor.split-groups
number of groups. Each group of blocks is then compacted separately, and is split into-compactor.split-and-merge-shards
shards (configurable on a per-tenant basis). Compaction of each tenant shards can be horizontally scaled. Number of compactors that work on jobs for single tenant can be limited by using-compactor.compactor-tenant-shard-size
parameter, or per-tenantcompactor_tenant_shard_size
override. - [FEATURE] Query Frontend: Updated experimental querysharding for the blocks storage. You can now enabled querysharding for blocks storage (
-store.engine=blocks
) by setting-query-frontend.parallelize-shardable-queries
totrue
. The following additional config and exported metrics have been added.- New config options:
-frontend.query-sharding-total-shards
: The amount of shards to use when doing parallelisation via query sharding.-frontend.query-sharding-max-sharded-queries
: The max number of sharded queries that can be run for a given received query. 0 to disable limit.-blocks-storage.bucket-store.series-hash-cache-max-size-bytes
: Max size - in bytes - of the in-memory series hash cache in the store-gateway.-blocks-storage.tsdb.series-hash-cache-max-size-bytes
: Max size - in bytes - of the in-memory series hash cache in the ingester.
- New exported metrics:
cortex_bucket_store_series_hash_cache_requests_total
cortex_bucket_store_series_hash_cache_hits_total
cortex_frontend_query_sharding_rewrites_succeeded_total
cortex_frontend_sharded_queries_per_query
- Renamed metrics:
cortex_frontend_mapped_asts_total
tocortex_frontend_query_sharding_rewrites_attempted_total
- Modified metrics:
- added
sharded
label tocortex_query_seconds_total
- added
- When query sharding is enabled, the following querier config must be set on query-frontend too:
-querier.max-concurrent
-querier.timeout
-querier.max-samples
-querier.at-modifier-enabled
-querier.default-evaluation-interval
-querier.active-query-tracker-dir
-querier.lookback-delta
- Sharding can be dynamically controlled per request using the
Sharding-Control: 64
header. (0 to disable) - Sharding can be dynamically controlled per tenant using the limit
query_sharding_total_shards
. (0 to disable) - Added
sharded_queries
count to the “query stats” log. - Number of shards is adjusted to be compatible with number of compactor shards used by split-and-merge compactor. Querier can use this to avoid querying blocks that cannot have series in given query shard. This only works when using split-and-merge compactor.
- New config options:
- [FEATURE] Graphite: Added
-graphite.querier.remote-read-enabled
and-graphite.querier.query-address
to enhance the internal query performance of the graphite querier. It’s recommended to enable this flag on every installation as soon as possible because it will become a default configuration in future releases. - [FEATURE] Ingester: Enable snapshotting of in-memory TSDB on disk during shutdown via
-blocks-storage.tsdb.memory-snapshot-on-shutdown
. - [FEATURE] Query-Frontend: Added
-query-frontend.cache-unaligned-requests
option to cache responses for requests that do not have step-aligned start and end times. This can improve speed of repeated queries, but can also pollute cache with results that are never reused. - [ENHANCEMENT] Admin-client: Make the cluster_name configuration optional.
- [ENHANCEMENT] Admin-API: Add new Admin API v2 endpoints that replace the term ‘instance’ used in version v1 with the term ’tenant’
- /admin/api/v2/accesspolicies
- /admin/api/v2/clusters
- /admin/api/v2/features
- /admin/api/v2/licenses
- /admin/api/v2/tenants
- /admin/api/v2/tokens
- [ENHANCEMENT] LBAC: Optimize filtering when using single selector in LBAC policy by passing matchers to downstream querier.
- [ENHANCEMENT] Distributor: reduce latency when HA-Tracking by doing KVStore updates in the background.
- [ENHANCEMENT] Compactor: when sharding is enabled, skip already planned compaction jobs if the tenant doesn’t belong to the compactor instance anymore.
- [ENHANCEMENT] Compactor: Blocks cleaner will ignore users that it no longer “owns” when sharding is enabled, and user ownership has changed since last scan.
- [ENHANCEMENT] Query federation: improve performance in MergeQueryable by memoizing labels.
- [ENHANCEMENT] Querier / store-gateway: optimized regex matchers.
- [ENHANCEMENT] Query-frontend: added
cortex_query_frontend_non_step_aligned_queries_total
to track the total number of range queries with start/end not aligned to step. - [ENHANCEMENT] Compactor: added
-compactor.compaction-jobs-order
support to configure which compaction jobs should run first for a given tenant (in case there are multiple ones). Supported values are:smallest-range-oldest-blocks-first
(default),newest-blocks-first
(not supported bydefault
compaction strategy). - [ENHANCEMENT] Add option (
-querier.label-values-max-cardinality-label-names-per-request
) to configure the maximum number of label names allowed to be queried in a single<prefix>/api/v1/cardinality/label_values
API call. - [ENHANCEMENT] Make distributor inflight push requests count include background calls to ingester.
- [ENHANCEMENT] Store-gateway: added an in-memory LRU cache for chunks attributes. Can be enabled setting
-blocks-storage.bucket-store.chunks-cache.attributes-in-memory-max-items=X
whereX
is the max number of items to keep in the in-memory cache. The following new metrics are exposed:cortex_cache_memory_requests_total
cortex_cache_memory_hits_total
cortex_cache_memory_items_count
- [ENHANCEMENT] Store-gateway: log index cache requests to tracing spans.
- [ENHANCEMENT] Ingester: reduce CPU and memory utilization if remote write requests contains a large amount of “out of bounds” samples.
- [ENHANCEMENT] Ingester: reduce CPU and memory utilization when querying chunks from ingesters.
- [ENHANCEMENT] Querier: when fetching data for specific query-shard, we can ignore some blocks based on compactor-shard ID, since sharding of series by query sharding and compactor is the same. Added metrics:
cortex_querier_blocks_found_total
cortex_querier_blocks_queried_total
cortex_querier_blocks_with_compactor_shard_but_incompatible_query_shard_total
- [ENHANCEMENT] Querier&Ruler: reduce cpu usage, latency and peak memory consumption.
- [ENHANCEMENT] Overrides Exporter: Add
max_fetched_chunks_per_query
limit to the default and per-tenant limits exported as metrics. - [BUGFIX] License: Fixed initialization of AWS subscription manager so it creates a cluster object if not present when running GEM as AWS Marketplace product.
- [BUGFIX] Admin-API: Change the way per-instance limits are stored to avoid breaking changes between versions.
- [BUGFIX] Self-monitoring: Ensure system rules adhere to the sharding configuration of the rulers.
- [BUGFIX] Graphite: fixed
invalid label
error when querying metrics with dashes in the tags. - [BUGFIX] Authentication: Fix caching behavior to ensure tokens are eventually removed from the cache.
- [BUGFIX] Authentication: Enforce that instances must exist even when using wildcard access policies.
- [BUGFIX] Admin-API: Expose metrics
cortex_admin_api_clients
andcortex_admin_client_is_leader
for leader election correctly. - [BUGFIX] Limits: Fix the way
cortex_limits_admin_store_last_update_timestamp_seconds
is set to emit a correct UNIX timestamp. - [BUGFIX] Alertmanager: don’t replace user configurations with blank fallback configurations (when enabled), particularly during scaling up/down instances when sharding is enabled.
- [BUGFIX] Query-frontend: Ensure query_range requests handled by the query-frontend return JSON formatted errors.
- [BUGFIX] Query-frontend: don’t reuse cached results for queries that are not step-aligned.
- [BUGFIX] Querier: fixed UserStats endpoint. When zone-aware replication is enabled,
MaxUnavailableZones
param is used instead ofMaxErrors
, so settingMaxErrors = 0
doesn’t make the Querier wait for all Ingesters responses.
v1.5.1 – September 21st 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.5.1
(digest:sha256:079ed9d61a7ab0953afbfa76de8ab2d38d44ac17e630446bab4084b4aba0c2e4
)License: Grafana Labs license
Changelog
- [ENHANCEMENT] Add ADFS compatibility to our OIDC auth.
- [BUGFIX] Ruler: Use predictable names for Ruler WALs ensuring they are used after crashes and cleaned up.
v1.5.0 – August 24th 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.5.0
(digest:sha256:b0d98ffe49df461a524743a49dca26952a59c9c007231035e52f0a06e5003fff
)License: Grafana Labs license
Changelog
- [CHANGE] Alertmanager: allowed to configure the experimental receivers firewall on a per-tenant basis. The following CLI flags (and their respective YAML config options) have been changed and moved to the limits config section:
-alertmanager.receivers-firewall.block.cidr-networks
renamed to-alertmanager.receivers-firewall-block-cidr-networks
-alertmanager.receivers-firewall.block.private-addresses
renamed to-alertmanager.receivers-firewall-block-private-addresses
- [CHANGE] Memberlist: Expose default configuration values to the command line options. Note that setting these explicitly to zero will no longer cause the default to be used. If the default is desired, then do set the option. The following are affected:
-memberlist.stream-timeout
-memberlist.retransmit-factor
-memberlist.pull-push-interval
-memberlist.gossip-interval
-memberlist.gossip-nodes
-memberlist.gossip-to-dead-nodes-time
-memberlist.dead-node-reclaim-time
- [CHANGE] Authentication: Access Policy names passed via a JWT token in the OIDC auth flow will be downcased before being matched against Access Policies in GEM. This improves interoperability between GEM and other systems since GEM only allows lowercase characters in Access Policy names
- [CHANGE] Change default value of
-server.grpc.keepalive.min-time-between-pings
from5m
to10s
and-server.grpc.keepalive.ping-without-stream-allowed
totrue
. - [CHANGE] Changed
-alertmanager.storage.type
default value fromconfigdb
tolocal
. - [CHANGE] Changed
-ruler.storage.type
default value fromconfigdb
tolocal
. - [CHANGE] Cortex chunks storage has been deprecated and it’s now in maintenance mode: all Cortex users are encouraged to migrate to the blocks storage. No new features will be added to the chunks storage. The default Cortex configuration still runs the chunks engine; please check out the blocks storage doc on how to configure Cortex to run with the blocks storage.
- [CHANGE] Dependency: update go-redis from v8.2.3 to v8.9.0.
- [CHANGE] Deprecated the
bootstrap
target in favor of thetokengen
target. - [CHANGE] Enable strict JSON unmarshal for
pkg/util/validation.Limits
struct. The customUnmarshalJSON()
will now fail if the input has unknown fields. - [CHANGE] Graphite: proxy no longer generates generic metrics metadata. This helps to reduce ingestion rate as counted by Cortex and used for limits.
- [CHANGE] Ingester: Change default value of
-ingester.active-series-metrics-enabled
totrue
. This incurs a small increase in memory usage, between 1.2% and 1.6% as measured on ingesters with 1.3M active series. - [CHANGE] License: Flag
-bootstrap.license.path
has been deprecated in favor of-license.path
. - [CHANGE] Memberlist: the
memberlist_kv_store_value_bytes
has been removed due to values no longer being stored in-memory as encoded bytes. - [CHANGE] Querier / ruler: Change
-querier.max-fetched-chunks-per-query
configuration to limit to maximum number of chunks that can be fetched in a single query. The number of chunks fetched by ingesters AND long-term storare combined should not exceed the value configured on-querier.max-fetched-chunks-per-query
. - [CHANGE] Querier / ruler: deprecated
-store.query-chunk-limit
CLI flag (and its respective YAML config optionmax_chunks_per_query
) in favour of-querier.max-fetched-chunks-per-query
(and its respective YAML config optionmax_fetched_chunks_per_query
). The new limit specifies the maximum number of chunks that can be fetched in a single query from ingesters and long-term storage: the total number of actual fetched chunks could be 2x the limit, being independently applied when querying ingesters and long-term storage. - [CHANGE] Query-frontend: Enable query stats by default, they can still be disabled with
-frontend.query-stats-enabled=false
. - [CHANGE] Removed
configdb
support from Ruler and Alertmanager backend storages. - [CHANGE] Removed
log_messages_total
metric. - [CHANGE] Removed query sharding for the chunks storage. Query sharding is now only supported for blocks storage.
- [CHANGE] Renamed metric
deprecated_flags_inuse_total
asdeprecated_flags_used_total
. - [CHANGE] Renamed metric
experimental_features_in_use_total
asexperimental_features_used_total
. - [CHANGE] Some files and directories on local disk now have stricter permissions, and are only readable by owner, but not group or others.
- [CHANGE] The example Kubernetes manifests (stored at
k8s/
) have been removed due to a lack of proper support and maintenance. - [CHANGE] Update Go version to 1.16.6.
- [FEATURE] Added flag
-debug.block-profile-rate
to enable goroutine blocking events profiling. - [FEATURE] Alertmanager: Added
-alertmanager.max-config-size-bytes
limit to control size of configuration files that Cortex users can upload to Alertmanager via API. This limit is configurable per-tenant. - [FEATURE] Alertmanager: Added
-alertmanager.max-templates-count
and-alertmanager.max-template-size-bytes
options to control number and size of templates uploaded to Alertmanager via API. These limits are configurable per-tenant. - [FEATURE] Alertmanager: Added rate-limits to notifiers. Rate limits used by all integrations can be configured using
-alertmanager.notification-rate-limit
, while per-integration rate limits can be specified via-alertmanager.notification-rate-limit-per-integration
parameter. Both shared and per-integration limits can be overwritten using overrides mechanism. These limits are applied on individual (per-tenant) alertmanagers. Rate-limited notifications are failed notifications. It is possible to monitor rate-limited notifications via newcortex_alertmanager_notification_rate_limited_total
metric. - [FEATURE] Alertmanager: support negative matchers, time-based muting - upstream release notes.
- [FEATURE] Allow for reporting CPU time usage to AWS Marketplace metering service in case GEM is running as AWS Marketplace container product.
- [FEATURE] Collect and store CPU time usage reports in Admin store, which can later be used to submit to metering services, such as the AWS Marketplace API
- [FEATURE] Querier/Ruler: Added new
-querier.max-fetched-chunk-bytes-per-query
flag. When Cortex is running with blocks storage, the max chunk bytes limit is enforced in the querier and ruler and limits the size of all aggregated chunks returned from ingesters and storage as bytes for a query. - [FEATURE] Querier: Added new
-querier.max-fetched-series-per-query
flag. When Cortex is running with blocks storage, the max series per query limit is enforced in the querier and applies to unique series received from ingesters and store-gateway (long-term storage). - [FEATURE] Query Frontend: Add
cortex_query_fetched_chunks_total
per-user counter to expose the number of chunks fetched as part of queries. This metric can be enabled with the-frontend.query-stats-enabled
flag (or its respective YAML config optionquery_stats_enabled
). - [FEATURE] Query Frontend: Add
cortex_query_fetched_series_total
andcortex_query_fetched_chunks_bytes_total
per-user counters to expose the number of series and bytes fetched as part of queries. These metrics can be enabled with the-frontend.query-stats-enabled
flag (or its respective YAML config optionquery_stats_enabled
). - [FEATURE] Query Frontend: Add experimental querysharding for the block storage. You can now enabled querysharding for block storage (
-store.engine
) by setting-querier.parallelise-shardable-queries
totrue
. - [FEATURE] Ruler Storage: S3 header extensions were added to the new ruler storage S3 config block.
- [FEATURE] Ruler: Add new
-ruler.query-stats-enabled
which when enabled will report thecortex_ruler_query_seconds_total
as a per-user metric that tracks the sum of the wall time of executing queries in the ruler in seconds. - [FEATURE] When running GEM as AWS Marketplace container product then the Go runtime variable
GOMAXPROCS
is automatically set to match the container CPU quota, in case Kubernetes CPU resource limits are set. - [FEATURE] Alertmanager: The experimental sharding feature is now considered complete. Detailed information about the configuration options can be found here for alertmanager and here for the alertmanager storage. To use the feature:
- Ensure that a remote storage backend is configured for Alertmanager to store state using
-alertmanager-storage.backend
, and flags related to the backend. Note that thelocal
andconfigdb
storage backends are not supported. - Ensure that a ring store is configured using
-alertmanager.sharding-ring.store
, and set the flags relevant to the chosen store type. - Enable the feature using
-alertmanager.sharding-enabled
. - Note the prior addition of a new configuration option
-alertmanager.persist-interval
. This sets the interval between persisting the current alertmanager state (notification log and silences) to object storage. See the configuration file reference for more information.
- Ensure that a remote storage backend is configured for Alertmanager to store state using
- [ENHANCEMENT] Add Cassandra support.
- [ENHANCEMENT] Add timeout for waiting on compactor to become ACTIVE in the ring.
- [ENHANCEMENT] Added
tenant_ids
tag to tracing spans - [ENHANCEMENT] Added option
-distributor.excluded-zones
to exclude ingesters running in specific zones both on write and read path. - [ENHANCEMENT] Added zone-awareness support to alertmanager for use when sharding is enabled. When zone-awareness is enabled, alerts will be replicated across availability zones.
- [ENHANCEMENT] Admin-API: Add a new endpoint for returning product and feature information at /admin/api/v1/features
- [ENHANCEMENT] Admin-API: Allow admin-api to operate for read-only request when no license is present.
- [ENHANCEMENT] Alertmanager: Added
-alertmanager.max-alerts-count
and-alertmanager.max-alerts-size-bytes
to control max number of alerts and total size of alerts that a single user can have in Alertmanager’s memory. Adding more alerts will fail with a log message and incrementingcortex_alertmanager_alerts_insert_limited_total
metric (per-user). These limits can be overrided by using per-tenant overrides. Current values are tracked incortex_alertmanager_alerts_limiter_current_alerts
andcortex_alertmanager_alerts_limiter_current_alerts_size_bytes
metrics. - [ENHANCEMENT] Alertmanager: Added
-alertmanager.max-dispatcher-aggregation-groups
option to control max number of active dispatcher groups in Alertmanager (per tenant, also overrideable). When the limit is reached, Dispatcher produces log message and increasescortex_alertmanager_dispatcher_aggregation_group_limit_reached_total
metric. - [ENHANCEMENT] Alertmanager: Cleanup persisted state objects from remote storage when a tenant configuration is deleted.
- [ENHANCEMENT] Authentiation: OIDC integration now supports a JWT with multiple roles. When present, these roles will be rolled up into a “virtual” access policy that provides metrics read access to the union of instances contained in those roles.
- [ENHANCEMENT] Blocks storage: support ingesting exemplars and querying of exemplars. Enabled by setting new CLI flag
-blocks-storage.tsdb.max-exemplars=<n>
or config optionblocks_storage.tsdb.max_exemplars
to positive value. - [ENHANCEMENT] Distributor: Added distributors ring status section in the admin page.
- [ENHANCEMENT] Etcd: Added username and password to etcd config.
- [ENHANCEMENT] Expose CPU quota information (number of cores, cgroup quota) as Prometheus metrics.
- [ENHANCEMENT] Expose error counters and timestamps of CPU usage reporting as Prometheus metrics when AWS Marketplace meterting is enabled.
- [ENHANCEMENT] Expose value of GOMAXPROCS as Prometheus metrics.
- [ENHANCEMENT] Facilitate running GEM Docker image as a non-root user. Usage is documented in the Kubernetes deployment documentation.
- [ENHANCEMENT] Ingester: Added option
-ingester.ignore-series-limit-for-metric-names
with comma-separated list of metric names that will be ignored in max series per metric limit. - [ENHANCEMENT] Ingester: added option
-ingester.readiness-check-ring-health
to disable the ring health check in the readiness endpoint. - [ENHANCEMENT] License: Added flag
-license.type
that is used to specify that the APP is running through AWS Marketplace. - [ENHANCEMENT] License: Implemented
/licenses
endpoint that responds with static list of licenses that replaces default implementation if the APP is running through AWS Marketplace. - [ENHANCEMENT] License: Implemented logic to check if AWS Marketplace subscription is active instead of checking license file if the APP is running through AWS Marketplace.
- [ENHANCEMENT] Memberlist: expose configuration of memberlist packet compression via
-memberlist.compression=enabled
. - [ENHANCEMENT] Memberlist: optimized receive path for processing ring state updates, to help reduce CPU utilization in large clusters.
- [ENHANCEMENT] Node-API: Added TSDB block metadata to the exportable debug archive.
- [ENHANCEMENT] Node-API: Register a new endpoint for fetching a compressed debug file containing config and version information at /node/api/v1/debug-export.
- [ENHANCEMENT] Node-API: Register a new endpoint for fetching version information about the nodes at /node/api/v1/version.
- [ENHANCEMENT] Querier now can use the
LabelNames
call with matchers, if matchers are provided in the/labels
API call, instead of using the more expensiveMetricsForLabelMatchers
call as before. This can be enabled by enabling the-querier.query-label-names-with-matchers-enabled
flag once the ingesters are updated to this version. In the future this is expected to become the default behavior. - [ENHANCEMENT] Reduce memory used by streaming queries, particularly in ruler.
- [ENHANCEMENT] Ring, query-frontend: Avoid using automatic private IPs (APIPA) when discovering IP address from the interface during the registration of the instance in the ring, or by query-frontend when used with query-scheduler. APIPA still used as last resort with logging indicating usage.
- [ENHANCEMENT] Ruler: added
rule_group
label to metricscortex_prometheus_rule_group_iterations_total
andcortex_prometheus_rule_group_iterations_missed_total
. - [ENHANCEMENT] Scanner: add support for DynamoDB (v9 schema only).
- [ENHANCEMENT] Scanner: retry failed uploads.
- [ENHANCEMENT] Storage: Added the ability to disable Open Census within GCS client (e.g
-gcs.enable-opencensus=false
). - [ENHANCEMENT] Store-gateway: added
-store-gateway.sharding-ring.wait-stability-min-duration
and-store-gateway.sharding-ring.wait-stability-max-duration
support to store-gateway, to wait for ring stability at startup. - [ENHANCEMENT] Wildcard Datasource: Wildcard “*” datasources are now supported in datasource urls for GEM. This allows an action to have access to all instances in all access policies associated with the provided token. If that set of instances includes a wildcard “*”, then access is expanded to all instances in the cluster.
- [ENHANCEMENT] Added instrumentation to Redis client, with the following metrics:
cortex_rediscache_request_duration_seconds
- [ENHANCEMENT] Include additional limits in the per-tenant override exporter. The following limits have been added to the
cortex_overrides
metric:max_fetched_series_per_query
max_fetched_chunk_bytes_per_query
ruler_max_rules_per_rule_group
ruler_max_rule_groups_per_tenant
- [ENHANCEMENT] License Manager: Added functionality to regularly check the local license file and sync it to the license storage backend.
- Added metrics
grafana_labs_license_syncs_total
andgrafana_labs_license_sync_failures_total
.
- Added metrics
- [ENHANCEMENT] Ring: allow experimental configuration of disabling of heartbeat timeouts by setting the relevant configuration value to zero. Applies to the following:
-distributor.ring.heartbeat-timeout
-ring.heartbeat-timeout
-ruler.ring.heartbeat-timeout
-alertmanager.sharding-ring.heartbeat-timeout
-compactor.ring.heartbeat-timeout
-store-gateway.sharding-ring.heartbeat-timeout
- [ENHANCEMENT] Ring: allow heartbeats to be explicitly disabled by setting the interval to zero. This is considered experimental. This applies to the following configuration options:
-distributor.ring.heartbeat-period
-ingester.heartbeat-period
-ruler.ring.heartbeat-period
-alertmanager.sharding-ring.heartbeat-period
-compactor.ring.heartbeat-period
-store-gateway.sharding-ring.heartbeat-period
- [ENHANCEMENT] Alertmanager: introduced new metrics to monitor operation when using
-alertmanager.sharding-enabled
:cortex_alertmanager_state_fetch_replica_state_total
cortex_alertmanager_state_fetch_replica_state_failed_total
cortex_alertmanager_state_initial_sync_total
cortex_alertmanager_state_initial_sync_completed_total
cortex_alertmanager_state_initial_sync_duration_seconds
cortex_alertmanager_state_persist_total
cortex_alertmanager_state_persist_failed_total
- [ENHANCEMENT] Memberlist: introduced new metrics to aid troubleshooting tombstone convergence:
memberlist_client_kv_store_value_tombstones
memberlist_client_kv_store_value_tombstones_removed_total
memberlist_client_messages_to_broadcast_dropped_total
- [ENHANCEMENT] Ruler: added new metrics for tracking total number of queries and push requests sent to ingester, as well as failed queries and push requests. Failures are only counted for internal errors, but not user-errors like limits or invalid query. This is in contrast to existing
cortex_prometheus_rule_evaluation_failures_total
, which is incremented also when query or samples appending fails due to user-errors.cortex_ruler_write_requests_total
cortex_ruler_write_requests_failed_total
cortex_ruler_queries_total
cortex_ruler_queries_failed_total
- [BUGFIX] Graphite: Fix handling of consolidateBy and make aggregation method part of aggregation cache key.
- [BUGFIX] Alertmanager: fix Alertmanager status page if clustering via gossip is disabled or sharding is enabled.
- [BUGFIX] Authentication: fix handling of missing instances, or when instance has no matching access policy, by properly returning a 401 instead of crashing.
- [BUGFIX] Compactor: fixed panic while collecting Prometheus metrics.
- [BUGFIX] Graphite: Apply the max-points-per-req-hard limit correctly.
- [BUGFIX] Graphite: Fix race in index.json API endpoint which lead to incomplete results.
- [BUGFIX] HA Tracker: when cleaning up obsolete elected replicas from KV store, tracker didn’t update number of cluster per user correctly.
- [BUGFIX] Ingester: fix issue where runtime limits erroneously override default limits.
- [BUGFIX] Ingester: fixed infrequent panic caused by a race condition between TSDB mmap-ed head chunks truncation and queries.
- [BUGFIX] Ingester: fixed ingester stuck on start up (LEAVING ring state) when
-ingester.heartbeat-period=0
and-ingester.unregister-on-shutdown=false
. - [BUGFIX] Invalidate cached authentication tokens when they are deleted from object storage.
- [BUGFIX] Make multiple Get requests instead of MGet on Redis Cluster.
- [BUGFIX] Memberlist: fix to setting the default configuration value for
-memberlist.retransmit-factor
when not provided. This should improve propagation delay of the ring state (including, but not limited to, tombstones). Note that if the configuration is already explicitly given, this fix has no effect. - [BUGFIX] Purger: fix
Invalid null value in condition for column range
caused bynil
value in range for WriteBatch query. - [BUGFIX] Querier: Fix issue where samples in a chunk might get skipped by batch iterator.
- [BUGFIX] Querier: fix queries failing with “at least 1 healthy replica required, could only find 0” error right after scaling up store-gateways until they’re ACTIVE in the ring.
- [BUGFIX] Query-frontend: Fix 401s during
query_range
requests when enterprise authentication is used. The workaround involving disabling enterprise authentication on the querier can now be removed. - [BUGFIX] Ruler: Fix bug in rule forwarding with remote write which could cause filling up the disk because it was not truncated.
- New flags called
-ruler.remote-write.wal-truncate-frequency
,-ruler.remote-write.min-wal-time
and-ruler.remote-write.max-wal-time
have been added.
- New flags called
- [BUGFIX] Ruler: Honor the evaluation delay for the
ALERTS
andALERTS_FOR_STATE
series. - [BUGFIX] Ruler: fix
/ruler/rule_groups
endpoint doesn’t work when used with object store. - [BUGFIX] Ruler: fix startup in single-binary mode when the new
ruler_storage
is used. - [BUGFIX] Ruler: fixed counting of PromQL evaluation errors as user-errors when updating
cortex_ruler_queries_failed_total
. - [BUGFIX] Store-gateway: when blocks sharding is enabled, do not load all blocks in each store-gateway in case of a cold startup, but load only blocks owned by the store-gateway replica.
- [BUGFIX] Upgrade Prometheus. TSDB now waits for pending readers before truncating Head block, fixing the
chunk not found
error and preventing wrong query results.
v1.4.2 – July 21st 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.4.2
(digest:sha256:385b563669a5ba4a459f833a2c356884b757de719e43369ead0c5dc59cb11d94
)License: Grafana Labs license
Changelog
- [SECURITY] Prevent path traversal attack from users able to control the HTTP header
X-Scope-OrgID
. (CVE-2021-36157)- Users only have control of the HTTP header when GEM is configured with
flags
-auth.type=default
and-tenant-federation.enabled=false
- Users only have control of the HTTP header when GEM is configured with
flags
- [SECURITY] Update build image to use Go 1.16.6. (CVE-2021-34558) #1874
- [BUGFIX] Ruler: Register remote write metrics correctly. #1814
Upstream Cortex details
- Cortex Hash:
2210ebb7052a9efb99d0e4dc53043a3f5d806d00
v1.4.1 – June 29th 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.4.1
(digest:sha256:d1d17bfe2ec984b093b9da1ab8cdea1f764f24f16b38557d719254c4e64c9f9a
)License: Grafana Labs license
Changelog
- [BUGFIX] Update the GEM build image to use Alpine 3.14, python 3.9 and gsutil 4.52.
Upstream Cortex details
- Cortex Hash:
98dd0c4d69576fdfaf2b9bfd7aa475e835e11429
v1.4.0 – June 28th 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.4.0
(digest:sha256:ff38e0544d805bfd1450a1f033ed79585252a4444d247e0e4c649625619215ab
)License: Grafana Labs license
Changelog
- [CHANGE] Breaking: Verify token issuer when using OIDC authentication. Includes a breaking change for users of OIDC authentication. #1571
- Before this change the configuration of OIDC authentication required the OIDC provider’s
jwks_uri
to be set in the configuration flagauth.admin.oidc.url
. This flag has been deprecated. - A new flag named
auth.admin.oidc.issuer-url
has been added, and it must be set to the URL of the OIDC provider. For example:-auth.admin.oidc.issuer-url=https://accounts.google.com
Note: This is not simply a rename of the old flag; you also need to update the value. The defined issuer is required to provide the OIDC discovery endpoint (/.well-known/openid-configuration
)
- Before this change the configuration of OIDC authentication required the OIDC provider’s
- [CHANGE] Breaking: The GEM/GEL Ruler can now be accessed by access policies with rules read/write permissions, which are no longer metrics/logs specific #1366 & #1403
- Before this change, there were metric rule specific permissions
metrics:rules:read
andmetrics:rules:write
. - The data representation for this change in object storage is backwards compatible, so no change is needed for existing access policies using the new rules.
- The JSON representation for these rules is not backwards compatible, and so any JSON interactions with the API that specified the strings
metrics:rules:read
ormetrics:rules:write
must be updated to the stringsrules:read
andrules:write
respectively. - This breaking change applies to the GEM Plugin as well, so please update to version v3.0.X.
- Before this change, there were metric rule specific permissions
- [CHANGE] Remove
enterprise_features
config block entirely. #1453 - [CHANGE] Alertmanager: deprecated
-alertmanager.storage.*
CLI flags (and their respective YAML config options) in favour of-alertmanager-storage.*
. This change doesn’t apply toalertmanager.storage.path
andalertmanager.storage.retention
. - [CHANGE] Blocks storage: removed the config option
-blocks-storage.bucket-store.index-cache.postings-compression-enabled
, which was deprecated. Postings compression is always enabled. - [CHANGE] GEM now fails fast on startup if it is unable to connect to the ring backend.
- [CHANGE] Querier / ruler: deprecated
-store.query-chunk-limit
CLI flag (and its respective YAML config optionmax_chunks_per_query
) in favor of-querier.max-fetched-chunks-per-query
(and its respective YAML configuration optionmax_fetched_chunks_per_query
). The new limit specifies the maximum number of chunks that can be fetched in a single query from ingesters and long-term storage: the total number of chunks that are actually fetched, in the worst case, can be twice the limit because the limit is applied to ingesters as well as long-term storage. - [CHANGE] Query frontend: removed the configuration option
-querier.compress-http-responses
, which was deprecated. Instead, use-api.response-compression-enabled
. - [CHANGE] Runtime-config / overrides: removed the config options
-limits.per-user-override-config
(use-runtime-config.file
) and-limits.per-user-override-period
(use-runtime-config.reload-period
), both deprecated. - [FEATURE] Add embedded recording rules to the Enterprise Ruler to support building dashboards and
alerts from internal metrics written directly to GEM itself via a distributor. #1459
- To enable or disable the feature, use the
-instrumentation.enabled
flag or associatedenabled
setting on theinstrumentation
configuration block. The feature is disabled by default.
- To enable or disable the feature, use the
- [FEATURE] Add the ability to write internal metrics directly to GEM itself via a distributor. #1281
- To configure, or enabled or disabled the feature, user the
-instrumentation.enabled
flag and associated other flags or theinstrumentation
configuration block:The feature is disabled by default.instrumentation: enabled: false flush_period: 15s write_timeout: 10s distributor_client: address: dns:///:9095 connect_timeout: 5s tls_enabled: false tls_cert_path: tls_key_path: tls_ca_path: tls_server_name: tls_insecure_skip_verify:
- To configure, or enabled or disabled the feature, user the
- [FEATURE] Self-monitoring: expose filesystem usage metrics to source the disk utilization panel in the self-monitoring resource dashboards #1618
- [FEATURE] Add an experimental GEM component
federation-frontend
, which can be used to federate queries between multiple GEM clusters. #1274 - [FEATURE] Querier: Added new
-querier.max-fetched-series-per-query
flag. When GEM is running with blocks storage, the max series per query limit is enforced in the querier and applies to unique series received from ingesters and store-gateway (long-term storage). - [FEATURE] Querier/Ruler: Added new
-querier.max-fetched-chunk-bytes-per-query
flag. When GEM is running with blocks storage, the max chunk bytes limit is enforced in the querier and ruler and limits the size of all aggregated chunks returned from ingesters and storage as bytes for a query. - [ENHANCEMENT] Introduce configuration parameter to limit how many points we process per query. #1292
- [ENHANCEMENT] Adding API endpoints via which a user can post / get their storage schemas / aggregations. #1389
- [ENHANCEMENT] Admin-API: Listing mutable resources now includes a comma separated list of versions for those resources in the
ETag
header #1419 - [ENHANCEMENT] Admin-API: Updating a mutable resources now allows a wildcard value (
"*"
) to be passed as theIf-Match
header, which allows the updating of any current version #1449 - [ENHANCEMENT] The
/config
HTTP endpoint now also returns GEM specific options alongside regular Cortex configuration. #1380 - [BUGFIX] Fix LBAC regular expression matchers #1305
- [BUGFIX] Validate all fields of JWT tokens used for auth, except the issuer. #1500
- [BUGFIX] Ruler: ensure the S3 rule storage flags properly maps to the upstream flags. #1460
- [BUGFIX] Admin-API: rejecting update requests when access policies have empty scopes or realms. #1447
- [BUGFIX] Updated licenses are now persisted to object storage, fixing the responses from the license API which would show old license information. #1568
- [BUGFIX] Validate all fields of JWT tokens used for auth, except the issuer. #1500
- [BUGFIX] OAuth: Don’t use default access policy when an invalid JWT claim is provided. #1635
- [BUGFIX] Authentiation: Invalidate cached authentication tokens when they are deleted from object storage. #1703
Upstream Cortex details
- Cortex Hash:
98dd0c4d69576fdfaf2b9bfd7aa475e835e11429
- Cortex Commits
v1.3.1 – Jul 21st 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.3.1
(digest:sha256:e03a7ae061d5f617490812a6f45c6362fdc9ef79010555a207ebee2174ef9b23
)License: Grafana Labs license
Changelog
- [SECURITY] Prevent path traversal attack from users able to control the HTTP header
X-Scope-OrgID
. (CVE-2021-36157)- Users only have control of the HTTP header when GEM is configured with
flags
-auth.type=default
and-tenant-federation.enabled=false
- Users only have control of the HTTP header when GEM is configured with
flags
- [SECURITY] Update build image to use Go 1.16.6. (CVE-2021-34558) #1874
- [BUGFIX] Update the GEM build image to use Alpine 3.14, python 3.9 and gsutil 4.52. #1781
- [BUGFIX] Ruler: Register remote write metrics correctly. #1814
Upstream Cortex details
- Cortex Hash:
64592254fe91c86e903882947a58d572a316884d
v1.3.0 – April 26th 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.3.0
License: Grafana Labs license
Changelog
- [SECURITY] Alertmanager: Fix a local file disclosure vulnerability when
-experimental.alertmanager.enable-api
is used (CVE-2021-31231):- The HTTP Basic auth password_file can be used as an attack vector to send any file content via a webhook.
- The Alertmanager templates can be used as an attack vector to send any file content because the Alertmanager can load any text file specified in the templates list.
- [CHANGE] Admin API: Concurrent requests to the same resource are no longer allowed. If two requests are issued to create, update, or delete the same resource, then the first one to achieve a lock executes and the second one returns a conflict error. This is handled per process. To enforce this behavior on multiple processes, use leader election. #1186
- [CHANGE] Admin API: all errors encountered during the processing of HTTP requests are converted to GRPC errors in order to determine the correct HTTP status to return. This enforces consistency for leader election, because some requests are handled internally, and others are forwarded to other instances. #1217
- [CHANGE] Admin API: all mutation operations (
PUT
/DELETE
) now require anIf-Match
header to be set (an integer between""
such as"27"
) to verify that the correct version of the resource is being modified and prevent against race conditions. You can find the current version of a resource in theETag
header that is returned when that resource is read (viaGET
) or updated (viaPUT
). - [FEATURE] Admin API: you can set per-instance resource limits via the Admin API. This is enabled by default. #1173
- You can enable or disable this feature by using the
-admin-api.limits.enabled
or-admin-api.limits.refresh-period
flags. Also, you can configure this feature by using theadmin_api
configuration block:admin_api: limits: enabled: true refresh_period: 1m
- You can enable or disable this feature by using the
- [ENHANCEMENT] Upgrade build image to use Go 1.16.3. #1294
- [ENHANCEMENT] Admin client: Add
cortex_admin_client_is_leader
gauge metric to determine when the client considers itself the leader. #1175 - [ENHANCEMENT] Admin API: update an access policy via the Admin API using a
PUT
request. #1139 - [ENHANCEMENT] Admin API: Update an instance via the Admin API using a
PUT
request. #1180 - [ENHANCEMENT] Gateway: Forward
/multitenant_alertmanager/ring
and/ruler/ring
routes to thealertmanager
andruler
proxy backends. #1144 - [BUGFIX] Graphite: Fix aggregation cache to generate cache keys using correct input data. #963
- [BUGFIX] Authentication: Fix issue where all requests would trigger a panic if authentication is enabled but no admin client is configured. A error is now printed instead. #1106
Upstream Cortex details
- Cortex Hash:
2d8477c4a325ce5071676e906efcee4adb687513
- Cortex Commits
v1.2.1 – April 27 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.2.1
License: Grafana Labs license
Changelog
- [SECURITY] Alertmanager: Fix a local file disclosure vulnerability when
-experimental.alertmanager.enable-api
is used (CVE-2021-31231):- The HTTP Basic auth password_file can be used as an attack vector to send any file content via a webhook.
- The Alertmanager templates can be used as an attack vector to send any file content because the Alertmanager can load any text file specified in the templates list.
v1.2.0 – March 10 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.2.0
License: Grafana Labs license
Changelog
- [CHANGE] Gateway: Remove purger proxy configuration, which is not a supported target for blocks clusters.
- [CHANGE] Auth: Override authentication flags have been renamed:
- The
auth.override-admin-token
flag has been changed toauth.override.token
. - The
auth.override-admin-token-file
flag has been changed toauth.override.token-file
.
- The
- [FEATURE] Gateway: Improve the
gateway
target to support unique TLS configurations and write timeouts for each backend.- New fields have been added to allow for configuration:
gateway: proxy: default: tls: tls_cert_path: <string> tls_key_path: <string> tls_ca_path: <string> tls_insecure_skip_verify: <bool> distributor: read_timeout: <duration> write_timeout: <duration> tls: ...
- New fields have been added to allow for configuration:
- [FEATURE] Compactor: Introduced
time-sharding
compaction strategy.
- [ENHANCEMENT] Distributor: Wrap remote writes in distributor to sample and log them as business intelligence events.
- [ENHANCEMENT] Metrics emitted for TLS certificate expiration now reflect certificates being reloaded.
- [ENHANCEMENT] Remove the Graphite Auto Complete Index and use Cortex index instead.
- [ENHANCEMENT] Add Graphite API endpoint /metrics/index.json.
- [ENHANCEMENT] Distributor: Wrap remote writes in distributor to sample and log them as business intelligence events.
- [ENHANCEMENT] Call Cortex Distributor over gRPC from Graphite Write Proxy (formerly Graphite Distributor)
- [ENHANCEMENT] Admin API: Add feature to elect and admin-api leader instance to handle all mutation requests. Requests to non-leader instances are forwarded to the leader instance.
- New fields have been added to allow for configuration:
admin_api: leader_election: enabled: <bool> ring: kvstore: <kv.Config> heartbeat_period: <duration> heartbeat_timeout: <duration> tokens_observe_period: <duration> instance_interface_name: <[]string> client_config: <grpcclient.Config>
- [BUGFIX] LBAC: Fix issue where debug logs would not print the selector and instead print
selector="unsupported value type"
. - [BUGFIX] Admin-Client: Warning logs are no longer created on resource creation.
- [BUGFIX] Ruler: Fix issue where invalid remote-write URLs cause a panic.
- [BUGFIX] Querier: Apply label access filters on multi tenant access policies.
Upstream Cortex details
- Cortex Hash:
003eb33266ca464d7290a938a9d767c36b9a03a4
- Cortex CHANGELOG
v1.1.3 – April 27 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
Docker image: run
docker pull grafana/metrics-enterprise:v1.1.3
License: Grafana Labs license
Changelog
- [SECURITY] Alertmanager: Fix a local file disclosure vulnerability when
-experimental.alertmanager.enable-api
is used (CVE-2021-31231):- The HTTP Basic auth password_file can be used as an attack vector to send any file content via a webhook.
- The Alertmanager templates can be used as an attack vector to send any file content because the Alertmanager can load any text file specified in the templates list.
v1.1.2 – January 20 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
License: Grafana Labs license
Changelog
- [BUGFIX] Querier: fix default value incorrectly overriding
-querier.frontend-address
in single-binary mode.
v1.1.1 – January 14 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
License: Grafana Labs license
Changelog
- [BUGFIX] Ruler: Minimize gaps on rule evaluations with stale input and enabled ruler evaluation delay.
v1.1.0 – January 12 2021
Links
Binary (Linux AMD64)
Deb (Linux AMD64)
RPM (Linux AMD64)
License: Grafana Labs License
Changelog
[CHANGE] Admin-API: Resources must not be both prefixed and suffixed with the
__
characters. If any of your existing resources exist with this naming pattern, they must be deleted and recreated with a new name before upgrading.[CHANGE] Graphite: Allow storage schema and storage aggregation configs to be defined per tenant.
[CHANGE] Admin-Client: Instance management client calls no longer use object storage
Iter
calls when retrieving the latest version of a resource.[CHANGE] Graphite: Add API endpoints to explore the available Graphite functions.
[CHANGE] Admin: The selectors for label policies are now provided as PromQL label strings instead of typed objects.
Deprecated:
"label_policies": [ { "selector": [ { "name": "env", "value": "dev", "type": "EQ" } ] } ]
New:
"label_policies": [ { "selector": "{env=\"dev\"}" } ]
[CHANGE] Admin: Operations with an
ADMIN
scope are no longer restricted to operating on clusters they have as a configured realm.[CHANGE] Deprecate
enterprise_features
config section in favor of the Cortex config extension.Deprecated:
enterprise_features: ruler_s3_request_headers: file: <string> poll_interval: <duration> ruler_remote_write: enabled: <bool> wal_dir: <string>
New:
ruler: storage: s3: header_map_file_path: <string> header_map_poll_interval: <duration> remote_write: enabled: <bool> wal_dir: <string>
[FEATURE] Ruler: Alerts can now be correctly forwarded to the Alertmanager with enterprise authentication enabled by setting the basic authentication username to
__alertmanager__
and the password to a API token with access to every instance.[FEATURE] Queries: LBAC enforcement has been added for queries and label value requests.
- When GEM is run using the
default
authentication mode, LBAC policies are specified using theX-Prom-Label-Policy
HTTP header in the format:X-Prom-Label-Policy: <tenant-id>:urlEscaped(<prometheus label selector>)
. For example, a policy that only allows metrics with the labelenv
equal todev
for tenanttest-instance
could specified with the following header:X-Prom-Label-Policy: test-instance:%7Benv=%22dev%22%7D
. To specify multiple policies either set the header multiple times or set the header with a single string of multiple policies separated by an unescaped comma.
- When GEM is run using the
[FEATURE] Admin API: add
label_policies
field, which contains an array of label matchers to the access policy realm JSON.{ "realms": [ { "instance": "<string>", "cluster": "<string>", "label_policies": [ { "selector": [ { "type": "<enum: EQ | NEQ | RE | NRE>", "name": "<string>", "value": "<string>" } ] } ] } ] }
[FEATURE] Admin: Add target
tokengen
to generate tokens for the default or a custom access policy.[FEATURE] Admin: Added a default
__admin__
access policy that has anADMIN
scope. This policy can be disabled adding the following to the GEM configuration file.admin_client: disable_default_admin_policy: true
[FEATURE] Querier: Queries can be federated across multiple tenants. The tenants IDs involved need to be specified separated by a
|
character in theX-Scope-OrgID
request header.[FEATURE] Add
gateway
target that can be configured to proxy requests to microservices and can be used to load balance remote_write requests to the distributors.[ENHANCEMENT] AdminAPI: Add scope for read only admin access,
admin:read
.[ENHANCEMENT] AdminAPI: Add separate set of scopes for alerts and rules.
alerts:read
alerts:write
logs:rules:read
logs:rules:write
metrics:rules:read
metrics:rules:write
[ENHANCEMENT] Reduce allocations in Graphite Ingester, when ingesting untagged Graphite metrics.
[ENHANCEMENT] Serve Graphite /metrics/find requests by keeping track of all recent metrics in an in-memory index on the Ingesters to reduce latency.
[ENHANCEMENT] Add auxiliary Graphite API endpoints to explore tags and obtain auto-complete suggestions for the Grafana query editor.
[ENHANCEMENT] Admin API: add ClusterKind support for Logs & Traces.
[ENHANCEMENT] Admin API: add scopes for Logs.
[ENHANCEMENT] Admin: The bootstrap target no longer needs to be run before being able to start GEM with enterprise features. Every target will now try to perform bootstrapping on startup if it has not already been done. Failure to bootstrap will not prevent GEM running, but enterprise features will not be available.
[ENHANCEMENT] Add
grafana_labs_license_expiry_timestamp
metric to expose GEM license expiration as a UNIX timestamp, in seconds.[BUGFIX] Graphite: Fixing a bug in the request parsing of GET requests on the auto-complete endpoints.
[BUGFIX] Graphite: When ingesting datapoints resulting in out-of-order/out-of-bounds/duplicate-sample we need to return status 200 to prevent an indefinite loop.
[BUGFIX] Ruler: Fix issue where remote-write rule groups are created then immediately deleted when a rule group name contains the
/
delimiter character.
Upstream Cortex changes
- Upstream Cortex hash:
c3b8c46fd8fc9a2aa85accbe54cb00be2552dcd9
- Changes since last GEM release
v1.0.2 – October 16 2020
Links
Changelog
- [CHANGE] Update vendored Cortex from v1.4.0 to [v1.4.0-21bad5][21bad5]
- [BUGFIX] Fix potential panic due to writing into a closed chan in the graphite query executor.
- [ENHANCEMENT] Admin: Access policy create operations now enforce valid instance/cluster names for the realms configured on the access policy.
- [ENHANCEMENT] Add
-version
flag to GEM. - [FEATURE] Add config options to rate limit the LIST methods of buckets.
- [FEATURE] Adds the Graphite /render API endpoint, which can be used to query metrics with the Graphite query language.
- [FEATURE] Add config options to specify and poll files to inject arbitrary HTTP headers in requests to S3 for the admin and blocks client.
blocks_storage: s3: header_map_file_path: <path to header file> header_map_poll_interval: <duration string> admin_client: storage: s3: header_map_file_path: <path to header file> header_map_poll_interval: <duration string>
- [FEATURE] Adds the Graphite /metrics/find API endpoint, which can be used to obtain lists of metrics matching a given pattern (Grafana query editor auto-complete, dashboard variable population, etc).
- [FEATURE] Add a default access policy option for OpenID Connect tokens.
Upstream Cortex details
- Cortex Hash: [21bad57b346c730d684d6d0205efef133422ab28][21bad5]
- Cortex CHANGELOG
v1.0.1 – October 06 2020
Links
Upstream Cortex details
- Cortex Hash: 23554ce028c090a4a3413ac0e35e5e1dc9fa929f
- Cortex Version: 1.4.0
Changelog
- [CHANGE] Update vendored Cortex to v1.4.0.
v1.0.0 – September 17 2020
Links
Upstream Cortex details
- Cortex Hash: bb5fcc929832f7bd2a6c2df348b387abcb8b961e
- Cortex Version: 1.4.0-rc.0
Changelog
- [BUGFIX] Make config field names consistent.
- [CHANGE] Use Go 1.14.9 to build the project and cut
build-image@v0.1.3
.
v1.0.0-rc.2 – September 15 2020
Links
Upstream Cortex details
- Cortex Hash: c3a344784a0c8ce70ef2521f543033dee3dce6c6
- Cortex Version: 1.3.1
Changelog
- [BUGFIX] Admin API: Fix panic on start up for
admin-api
target.
v1.0.0-rc.1 – September 04 2020
Links
Upstream Cortex details
- Cortex Hash: 4f6e1e5c48ccad2c1988cf1d36ca522ae0c805ed
- Cortex Version: 1.3.1
Changelog
- [CHANGE] Admin-Client: The storage backend for the admin client no longer defaults to
s3
. Instead no default is set and the admin client will not start up unless a default is set. - [CHANGE] The following features will no longer be active unless GEM is started with access to a valid license.
- Admin API
- Ruler S3 auth headers
- Ruler API to configure remote write rule groups
v0.6.3 – August 20 2020
Links
Upstream Cortex details
- Cortex Hash: 2bda7b94
- Cortex Version: 1.2.1
Changelog
- [CHANGE] Auth: removed
auth.enable
flag and addauth.type
flag withdefault
andenterprise
options. - [FEATURE] Admin API: Add list endpoint for stored licenses.
v0.6.2 – August 04 2020
Links
Upstream Cortex details
- Cortex Hash: 6db67a4efbbf62b1133fa037a95382a21f752bbf
- Cortex Version: 1.2.1
Changelog
- [CHANGE] Ruler: S3 Headers are no longer protected by a license.