Release notes for Grafana Kubernetes Monitoring
Grafana Kubernetes Monitoring includes these elements, which all have their own releases:
- Frontend GUI of app (also called plugin)
- Backend mixin (also called integration), covering the alert and recording rules
- Helm chart for the default and other configurations of Kubernetes Monitoring - For release notes on the Helm chart, refer to Grafana Kubernetes Monitoring Helm chart release notes.
Frontend release notes
The following are release notes for the Kubernetes Monitoring GUI.
2.22.0
10/22/25
Type | Description |
---|---|
Feature | Jobs: Added jobs and cron jobs monitoring. |
Enhancement | Filtering: Enabled column filtering on all tables. |
Enhancement | Activation: Changed name of activation tab to Manage app. |
Fix | Workbench: Removed filter from columns with a checkbox. |
Fix | Node table queries: Used node filter for accuracy. |
Fix | Column headings: Removed incorrect units. |
2.20.0
8/29/25
Type | Description |
---|---|
Enhancement | Integrations: Added a Kubernetes filter to the Connections page. |
Enhancement | Configuration: Added the ability to activate Kubernetes Monitoring. |
2.19.8
8/21/25
Type | Description |
---|---|
Enhancement | Integrations: Removed integrations from the Configuration page. |
2.19.7
8/19/25
Type | Description |
---|---|
Enhancement | Configuration: Removed the selection in configuration to switch on Application Observability now that users can activate it from the product. |
8/5/25
Type | Description |
---|---|
Enhancement | Asserts: Added new term ‘insights’ instead of ‘assertions’. |
Fix | OTel: Updated the cluster events query to improve support with OpenTelemetry Collectors. |
Fix | Scrape intervals: Removed hard-coded time interval so that panels reflect custom scrape interval. |
2.19.5
7/25/25
Type | Description |
---|---|
Enhancement | Asserts: Added health indicator icons to lists of components. |
Fix | Pod CPU and memory: Reverted fix from 2.19.3 due to bugs it was causing. |
Fix | Container memory: Fixed container memory panels to aggregate correctly. |
Fix | Cluster list: Corrected the number of nodes for a cluster in the info section. |
2.19.3
7/16/25
Type | Description |
---|---|
Enhancement | Cost overview: Added ability to jump to list of Clusters from Cost Overview page. |
Enhancement | Integrations: Added a message on the Integrations page that integrations will soon be accessible from the Connections page. |
Fix | Pods: Include all Pods if no pod value specified and use podName value if available on Pod CPU and Memory queries to prevent no data from showing. |
2.19.2
7/3/25
Type | Description |
---|---|
Feature | GPU: Added GPU tabs and panels to Cluster and Node pages. |
Feature | EC2 link: Added link to navigate between EC2 instance in Cloud Provider Observability and Kubernetes Monitoring. |
Enhancement | Cost: Added tooltips for cost panels to clarify underlying calculations and their time frame. |
Enhancement | User experience: Streamlined descriptions of panels for clarity and brevity. |
Fix | Storage: Changed inode values to percentages. |
Fix | Workloads: Fixed query to ensure all barepods workloads are listed, whether desired Pods or ready Pods. |
Fix | CPU and memory tabs: Corrected the calculation for the Efficiency Usage/Requests panel on the CPU and Memory tabs of the Node detail page. |
Fix | Argo Rollout Pods: Removed duplicate query to prevent incorrect count. |
Fix | Container page: Fixed tab links to navigate to appropriate pages. |
2.19.1
6/11/25
Type | Description |
---|---|
Enhancement | Serverless and Windows Nodes: Added support for showing Node memory usage on platforms that cannot use Node Exporter (such as GKE Autopilot and AWS EKS Fargate), and Windows. |
Enhancement | Logs table: Added the ability for a user to rollover a line in the logs table and click an icon to see the full log message. |
Fix | Queries: Fixed queries for Cluster names with a . , which causes results to be empty. |
Fix | Non-Kubernetes Nodes: Prevented the app from selecting Nodes that are not Kubernetes Nodes to be counted and displayed. |
2.19.0
6/4/25
Type | Description |
---|---|
Enhancement | Memory: Added a memory tab and panels to Node, Namespace, Workload, Pod, and Cluster detail pages. |
Enhancement | Workloads: Added the capability to view nonstandard workloads that contain cronjobs, bare pods, static pods, Argo Rollouts, Strimzi Pod sets, and others. |
Enhancement | Logs: Created a table format to display logs and events. |
2.18.0
5/21/25
Type | Description |
---|---|
Enhancement | CPU tabs: Added CPU tabs to the Pod, Workload, and Namespace detail pages. |
Enhancement | GKE Autopilot and AWS EKS Fargate: Used Kubelet Resource Metrics (node_cpu_usage_seconds_total ) to show CPU usage for platforms that cannot use Node Exporter, including GKE Autopilot and AWS EKS Fargate. |
Fix | Link to Pod: Fixed ability to go to a specific pod from the Network Bandwidth by pod panel on the Node details page on the Network tab. |
Fix | URL: Fixed variables not synching with URL. |
Fix | Metrics status page: Corrected spacing between panels. |
Fix | DNS: Added a . to the end of hostnames to prevent DNS confusion. |
Fix | OTLP: Included the OTLP Gateway destination in the Kubernetes Helm chart to route metrics, logs, and traces using the OTLP format. |
Fix | Outlier detection: Changed Outlier detection to use podNames to prevent incorrect counts of outlier Pods. |
2.17.0
5/6/25
Type | Description |
---|---|
Feature | CPU details: Added CPU tab for Node details page and Cluster details page. |
Fix | Table headers: Made table headers visible for light mode. |
Fix | Pods: Fixed Pods with the same name on different Clusters to be differentiated from each other. |
Fix | Table columns: Make it more apparent table columns can be resized. |
Fix | Data accuracy: Ensure that CPU and memory requests only look at active pods, not completed or failed. |
Fix | Terraform files: Ensure use of strings for usernames in Terraform files to ensure compatibility for Helm chart values. |
2.16.0
4/23/25
Type | Description |
---|---|
Enhancement | Storage: Added IOPS and throughput panels on all Storage tabs. |
Fix | Cost: Prevented duplicate cAdvisor scrape jobs from affecting cost allocation. |
Fix | Alerts: Fixed namespace drop-down menu on Alerts page from not being populated. |
Fix | Pod names: Corrected formatting of Pod names when using Explore from Pod lists. |
Fix | Memory usage: Prevented completed and pending pods from affecting memory usage queries on the Pod details page. |
2.15.1
4/17/25
Type | Description |
---|---|
Enhancement | Alerts: Added capability to filter alerts on Alert page by Node. |
Enhancement | Outlier detection: Outlier detection for memory automatically runs when you visit a workload details page for multi-Pod workloads, and links to Explore to show the outlier Pod(s) using more memory. |
Fix | Integration: Correct the configuration snippet for the Core DNS integration. |
2.15.0
4/14/25
Type | Description |
---|---|
Feature | Outlier detection: Outlier detection automatically runs when you visit a workload details page for multi-Pod workloads, and links to Explore to show the outlier Pod(s) using more CPU. |
Enhancement | Detail pages: CPU and memory columns and prediction buttons for each row have been added to the Cluster, workload, and namespace detail pages. |
Fix | Events and logs status check: Fixed status check and Explore links for events and logs on the Metrics status tab. |
Fix | Overview page: Fixed the All alert links on the home page. |
Fix | Overview page: Corrected the workload count on the overview page to more closely correlate to the workload count shown on the workloads list page. |
Fix | Fleet Management: Fixed reference to Fleet Management Terraform variables. |
2.14.1
4/2/25
Type | Description |
---|---|
Enhancement | Performance: Decreased the training and inference latency of the machine learning functionality to show results quicker. |
Enhancement | Alerts: Added filtering by alert name. |
Enhancement | Overview page: After filtering by Cluster or namespace, you can click on the count of Clusters, Nodes, namespaces, or workloads and show only those objects within the scope of the filter. |
Enhancement | User experience: On Workload and Namespace list pages, made the Cluster names clickable and added graphs to the counts. |
Fix | Workload counts: Corrected calculations on workload counts on the Overview page. |
Fix | List pages: Restored the alerts table column. |
Fix | Alloy logs instance: Set the Pod security context for the Alloy logs instance on OpenShift. |
Fix | OpenShift: Corrected the port names for OpenShift so that Alloy can discover targets to scrape. |
Fix | EKS Fargate: Removed mounting of /var/log and /var/lib/docker/containers. |
2.14.0
3/24/25
Type | Description |
---|---|
Feature | Troubleshooting: Added a debug panel showing the source of a panel’s metrics, available from the panel menu. |
Enhancement | Performance: Added a view for each list page when queries extend beyond performance capabilities which degrade page loading. This view includes a message to search or filter for specific components. |
Fix | Performance: Stopped unrelated panels from reloading when the logs data source is changed. |
Fix | Accuracy: Stopped counting containers that are not running, which caused CPU limits and requests to appear higher than they should. |
Fix | Platforms: Disabled unnecessary components for specific platforms: nodelogs on Fargate, and profiling, Beyla, and Windows Exporter on Autopilot. |
2.13.2
3/18/25
Type | Description |
---|---|
Fix | Fleet Management: Fixed interaction with Fleet Management so that Alloy installation shows positive on Metrics Status page. |
Fix | Fleet Management: Changed configuration to store secret for Fleet Management in secret instead of environmental variable. |
2.13.1
3/13/25
Type | Description |
---|---|
Fix | Removed stabilityLevel: public-preview in relation to Alloy from the code generated on the Configuration page. |
Fix | Prevented a race condition causing HTML error messages to appear. |
Fix | Asserts: Restored RCA workbench linking. |
2.13.0
3/9/25
Type | Description |
---|---|
Feature | Configuration: Added toggle in easy deploy for users to add Fleet Management for management of Alloy deployments. |
Enhancement | Cluster detail page: Added buttons to allow users to navigate to workloads and namespaces within the cluster. |
Fix | Memory queries: Switched to Working Set Size (WSS) instead of Resident Set Size (RSS) to increase accuracy of memory statistics and match kubectl and Linux Out-of-Memory behavior. |
2.12.3
2/20/25
Type | Description |
---|---|
Fix | Configuration and status tabs: Fixed issue of configuration state not persisting when user switches between the tabs. |
Fix | URL stability: Maintain URL on refresh of page. |
Fix | Integrations: Updated integrations to work with Kubernetes Monitoring Helm chart version 2. |
2.12.2
2/6/25
Type | Description |
---|---|
Fix | Workload details page: Fixed TypeError to prevent unsafe access to data source. |
2.12.1
2/6/25
Type | Description |
---|---|
Fix | Receiver URLs: Updated URLs to match the version 2 Alloy deployment. |
Fix | Tempo and Logs: Updated configuration to fix Tempo protocol and autopilot for logs. |
2.12.0
2/5/25
Type | Description |
---|---|
Feature | Storage: Added storage tabs and graphs to Cluster, Node, workload, namespace, and Pod detail pages. |
Enhancement | Configuration: Updated the easy deployment configuration page to use Helm chart v2. |
Enhancement | Recording rules: Added the install/uninstall button to the Workload Recording Rule status for the cloud-provisioned hosted metrics data source. |
Fix | Prediction button: Disabled if the time range selected is less than two hours. |
Fix | Data source drop-down: Removed machine learning metrics data source from the drop-down menu. |
Fix | Data source drop-down: Added verification that the data source is valid prior to displaying in the drop-down menu. |
Fix | Workload list page: Fixed to not display all columns if filters result in more than 1,000 results. |
2.11.2
1/15/25
Type | Description |
---|---|
Fix | Time range: Ensured the time range selected in Kubernetes Monitoring is transferred to Application Observability when the View application layer button is clicked. |
Fix | Performance loading: Resolved issues related to rerendering of pages. |
Fix | Filters: Ensured when time range is changed that all filters are not returned to the default. |
2.11.1
1/9/25
Type | Description |
---|---|
Enhancement | User experience: Added the tool tip to the Copy icon. |
Fix | Node count: Fixed incorrect Node count on Kubernetes Overview page due to empty node labels. |
Fix | Performance loading: Improve loading performance of tables in detail pages. |
Fix | Memory data: Rollback to the original recommendation for memory requests to equal memory limits. |
2.11.0
1/6/25
Type | Description |
---|---|
Enhancement | Performance: If Workload list page result is over 1,000 workloads, the page will load quickly but no usage or cost data will show. A message will display suggesting the list be filtered, and include a link to search. |
Fix | Metrics status: Detect duplicate container metrics. |
Fix | Data source: Filter the data source drop-down menu to be limited to Grafana Cloud data sources. |
2.10.0
12/9/24
Type | Description |
---|---|
Feature | Performance: Improve performance by performing a code split. |
Fix | Loop error: Fixed a history loop error and page breaking. |
2.9.0
11/25/24
Type | Description |
---|---|
Feature | Search: Added the ability to search for any Kubernetes object. |
Fix | Metrics status: Show the status checks even if no clusters were detected. |
2.8.8
11/12/24
Type | Description |
---|---|
Fix | Erroneous no data error: Added a null check to prevent the no data message from displaying due to UID not found for a database. |
2.8.7
11/6/24
Type | Description |
---|---|
Fix | Asserts: Fix to ensure Kubernetes Monitoring is aware Asserts is enabled. |
2.8.6
11/5/24
Type | Description |
---|---|
Fix | Asserts: Fix the time range so that it is synchronized between Kubernetes Monitoring and Asserts. |
2.8.5
10/30/24
Type | Description |
---|---|
Fix | Updated link to the Application Observability GUI. |
Fix | Properly align Cluster columns. |
2.8.4
10/17/24
Type | Description |
---|---|
Enhancement | User experience: Created Cluster, namespace, and Pod columns for alerts on home page. |
Enhancement | User experience: On the Metrics status tab, added the ability to select the status of all or many Clusters. |
Fix | Instead of displaying no data on the home page should an outage occur, the home page will show historical data prior to the outage and current data post-outage. |
Fix | GUI: Fixed various anomolies with the GUI, including the location of the navigation drop-down menu and the width of certain columns. |
2.8.3
9/23/24
Type | Description |
---|---|
Fix | Alerts: Hide panel menu item for creating alerts if a user is not an administrator. |
2.8.2
9/12/24
Type | Description |
---|---|
Enhancement | User experience: Restricted the namespace field for the quick configuration process to only valid characters. |
Enhancement | User experience: Added the link to the Cluster below the title on detail pages. |
Enhancement | Troubleshooting: Added the Helm chart to the Metrics status page. |
Enhancement | User experience: Added waiting reasons and colors for Pods that are either waiting due to startup or due to a problem. |
Enhancement | Troubleshooting: Added the waiting reason for Pods that are waiting instead of running. |
Enhancement | Alerts: Added the ability to create an alert item from a panel. |
Fixes | Fixed the container memory limit on Container detail pages to be accurate. Ensured logs and events are not listed when there are no Pods. Fixed ability to remove a ’no data’ filter when a time range is changed. |
2.8.1
8/25/24
Type | Description |
---|---|
Feature | New platform: Added Azure AKS as a separate platform for configuration. |
Enhancement | Metrics status: Added statuses for recording rules and KSM job label. |
Enhancements | User experience: Moved up the required backend installations step to prevent misconfigurations. Restricted integrations view to administrators only and improved message to non-administrative user. Added links to documentation on Overview, Cost, Alerts, and Configuration pages. Adjusted the width of filters and moved copy button closer to item for ease of use. Refresh home page when data starts being sent. |
Enhancement | Code snippets for integrations: Added configuration code snippets for Apache CouchDB, Apache Hadoop, Apache Tomcat, and IBM DB2. |
Fix | CPU statistics: Fixed issue that was causing the memory legend to show in relation to CPU stats. |
2.8.0
8/6/24
Type | Description |
---|---|
Feature | Energy use: Added energy metrics by using third-party Kepler open source software. |
Fix | Duplicates: De-duplication of Pods and containers. |
2.7.0
7/29/24
Type | Description |
---|---|
Enhancement | User experience: Added a tab for logs and events on every detail page, including the Cluster detail page. |
Enhancement | Integrations: Added snippets for the Alloy and Varnish integrations. |
Enhancement | Logs: Made log queries work with OTel-style labels. |
Fixes | Correct time range format on home page. Ensured Kubernetes Monitoring opens in correct time range from Asserts. |
2.6.1
7/16/24
Type | Description |
---|---|
Enhancement | Alerts: Added the ability to click on the alert name from the Kubernetes Overview page. |
Enhancement | Integrations: Added configuration snippets for Squid and Apache Mesos. |
Fix | Prevented continuous rendering of data in the Safari browser. |
2.6.0
7/10/24
Type | Description |
---|---|
Feature | Home page: Upgraded the home page to better show the status of healthy Kubernetes objects as well as any potential issues. |
Feature | Asserts button: Added Assertions button to details pages. |
Enhancement | User experience enhancements: Added quick ways to navigate within the app, including the drop-down menu for all main pages, the overview and network tabs, deep linking from the home page CPU and memory graphs to the Cluster, improved Helm chart configuration process, and visual tags to indicate the type of Kubernetes object for each view. |
2.5.0
6/25/24
Type | Description |
---|---|
Feature | Network panels: Added Network Bandwidth and Network Saturation data. |
Feature | Cost overview panels: Added a 90-day view of total compute cost, average cost per Pod, and average Pod count on the Cost Overview page. |
Enhancement | OpenTelemetry: Added support for displaying events coming from the OpenTelemetry Kubernetes Events Receiver. |
Fix | Workloads with same name: Corrected CPU and memory usage data for workloads with the same name but in different Clusters. |
2.4.0
6/18/24
Type | Description |
---|---|
Feature | Deleted objects: Added the ability to find deleted Clusters, Nodes, Pods, containers, workloads, and namespaces. |
Enhancement | User experience: Improved “no data” messages on home page. |
Enhancement | Metrics: Host metrics generation. |
Fix | Error messages: Fixed query to prevent an invalid error message. |
Fix | Update host metrics switch language and disable CRDs on OpenShift. |
2.3.5
6/12/24
Type | Description |
---|---|
Enhancement | Integration: Added Apache Airflow configuration snippets for integration. |
Fix | Duplicates: Removed duplicate data showing on home page. |
Fix | Alerts: Improved alert count accuracy for certain kinds of Pod types. |
2.3.4
6/7/24
Type | Description |
---|---|
Enhancement | Added time range selector to home snapshot page. |
2.3.3
6/3/24
Type | Description |
---|---|
Enhancement | Integrations: Added configuration snippets for the integrations Wildfly and OpenSearch. |
Enhancement | Performance: Improved load time for queries spanning longer time ranges. |
Fix | Refresh and Cluster filters: Fixed so refresh does not cause filters to be removed. |
Fix | Cardinality page: Ensured the Series per namespace column data link is accurate. |
Fix | Documentation links: Updated and corrected documentation links on Kubernetes Monitoring main page (that is shown prior to configuring). |
Fix | Workload type: Ensured different workloads that are similarly named accurately show the type of workload and the data for the workload. |
2.3.2
5/21/24
Type | Description |
---|---|
Enhancement | Performance: Improved loading performance of CPU and memory usage tables. |
Fix | Added a null check for provider to avoid errors on Cluster list page. |
2.3.1
5/16/24
Type | Description |
---|---|
Enhancement | Integration: Added configuration snippets for the Apache Solr integration. |
Fix | Fixed Helm chart integration snippets to be compatible with Alloy. |
Fix | Fixed recording rule to include ReplicaSets, static Pods, and bare Pods so that missing workloads now show and incorrectly labeled workloads are now labeled correctly. This also fixes workload usage and cost data. |
2.2.5
4/25/24
Type | Description |
---|---|
Enhancement | Static Pod and Pod type: Added the static Pod label to all static Pods and the Pod label to bare/unmanaged Pods . |
Enhancement | Integrations: Added flow configuration snippets for Velero and MS SQL Server integrations. |
Fix | Fixed link to alert rule after automatic refresh. |
2.2.4
4/22/24
Type | Description |
---|---|
Enhancement | Data source persistence in URL: Persisted the Loki data source in the URL so that accurate links can be shared. |
Fix | Added check to prevent query errors. |
2.2.3
4/18/24
Type | Description |
---|---|
Feature | Refresh control: Added refresh and auto-refresh choices to all pages. |
2.2.2
4/17/24
Type | Description |
---|---|
Feature | Alerts: Added the Alerts column to the Nodes and Cluster list pages. |
Enhancement | URL data source persistence: Persisted Prometheus data source in the URL so that accurate links can be shared. |
Enhancement | Integration: Added configuration snippets for OracleDB integration. |
2.2.1
4/15/24
Type | Description |
---|---|
Feature | Alloy update: Replace Grafana Agent with Grafana Alloy. |
Feature | Cardinality: Added the Cardinality page to replace the Kubernetes Cardinality dashboard. |
Feature | Alerts: Added the Alerts column to these pages: Pod detail, Pod list, and Workloads list. |
Enhancement | Integration: Added configuration snippets for TensorFlow Serving integration. |
Enhancement | User experience: Added the ability to select multiple clusters for filtering. |
Fix | Removed logs panel from Node detail page for future refactoring. |
Fix | Corrected improper indent in configuration snippet of IBM MQ integration. |
Fix | Added kube_deployment as well as kube_replicaset to ensure all workload types are collected. |
2.2.0
4/8/24
Type | Description |
---|---|
Fix | Refactored workloads to use more efficient queries. |
2.1.3
4/4/24
Type | Description |
---|---|
Enhancement | Added the workload type when available to filter the list of Pods to improve search performance. |
Fix | Enhanced the multi-line regular expression to support a broader collection of logs. |
2.1.2
4/2/24
Type | Description |
---|---|
Fix | Reverted from support for OTel log labels. |
2.1.1
4/2/24
Type | Description |
---|---|
Feature | Added interactivity with Sift automated machine learning investigation tool. |
Enhancement | Added configuration snippets for the Couchbase and IBM MQ integrations. |
Enhancement | Added support for OTel log labels. |
Fix | Combined target_info with traces_target_info metrics to ensure all applications are found. |
2.1.0
3/25/24
Type | Description |
---|---|
Enhancement | Configuration: Added configuration snippets for the PgBouncer integration. Added link to extraConfig information in the Helm documents from the code snippet areas of an integration. |
Enhancement | Dashboards: Removed dashboards due to Angular deprecation. |
Enhancement | Messaging: Improve wording of default messages when a data source is updated for a view. |
Fix | Persist Cluster URL values to correct broken links. |
2.0.4
3/13/24
Type | Description |
---|---|
Feature | Asserts workbench: Added RCA Workbench button to allow navigating to Asserts. |
Enhancement | Explore: Expanded the explore feature to the list of all Pods. Improved the user experience for exploring data. |
Fix | Removed inaccurate timestamp for namespace creation on Namespace details page. |
Fix | Corrected the query for phase state, which is shown by expanding a list item on the Namespaces page. |
Fix | Add the http prefix to OTLP endpoints on the Cluster configuration tab when OTel receivers are selected. |
Fix | Persisted the data source between views. |
2.0.3
3/6/24
Type | Description |
---|---|
Feature | Added the capability to navigate to Application Observability. |
Feature | On the Cluster list page, you can click the Explore button to view the queries used in each column of the usage and cost data. This enables you not only to understand the data, but debug any issues or create your own queries or dashboards |
Feature | Added support for the Istio integration. |
Feature | Added a warning icon and tooltip on column headers of lists to indicate error detection and query failure. |
Feature | With the easy configuration using the Helm chart, added the capability to enable OTel and Zipkin receivers. |
Enhancement | Added the filter toolbar to list tables for better user experience. |
Enhancement | Added duplicate metrics detection to Node Exporter, Windows Exporter, OpenCost, cAdvisor, and kubelet. When there is an error, a yellow warning icon appears next to the item on the Metrics status page. |
Enhancement | Refined queries for “OOMKilled” containers to prevent counting of duplicate containers, specifically when a container is being killed and another is being created in its place. |
Enhancement | The link to examine an alert opens in a new tab so that context within Kubernetes Monitoring is not lost. |
Enhancement | Updated URL so that any shared URL includes any variables and the time range selected. |
Enhancement | Release note text in the app links to the release notes in documentation. |
Enhancement | Breadcrumbs now include page tab names. |
Enhancement | Corrected the query for unallocated resources (PV) panel. |
Fix | Created an additional check for deep links related to breadcrumbs display. |
Fix | Refined queries to ensure de-duplication of metrics, including removing reliance of uid label and having node/instance joins also use the Cluster label. |
Fix | Corrected queries for the Explore button on the Workloads page. |
Fix | Fixed CPU usage prediction graph to display correctly within window. |
2.0.2
2/15/24
Type | Description |
---|---|
Enhancement | Refined de-duplication of KSM kube_node_status_capacity , and added support for node with or without trailing port numbers. |
Fix | Created more accurate cost calculations in the case of mixed node and instance labels. |
2.0.1
2/15/24
Type | Description |
---|---|
Enhancement | Added the workloads column to the list of namespaces. |
Fix | Corrected the calculation for CPU average queries. |
2.0.0
2/6/24
Type | Description |
---|---|
Feature | The Alerts page provides a centralized location to view all alerts related to your infrastructure and the applications running within it. Graphs show alerts firing by Cluster and namespace, as well as by alert severity. The alert name and severity also display on the home page in the list of Pods in trouble. On the Namespaces and Workloads tabs, a column for each item shows the amount of firing alerts. |
Feature | The ClickHouse integration is available for use with Kubernetes Monitoring. |
Enhancement | On the home page, you can click the Cluster count to navigate directly to the list of Clusters on the Clusters tab, and click the Node count to navigate directly to the list of Nodes on the Nodes tab. |
Enhancement | On usage cost/efficiency tables, the tables are sorted by default alphabetically by the name of the item (Cluster, namespace, and so on). |
Enhancement | To improve loading performance, the namespaces, workloads, and Nodes lists are filtered by default by the first Cluster in the filter list. |
Enhancement | You can click the Explore button on any item on the Metrics status tab to troubleshoot configuration issues and customize queries for status. |
Enhancement | To prevent deployment failure due to timeout, increased the timeout for the Helm chart to five minutes. |
Enhancement | An informational banner displays on the Dashboards page to indicate the prebuilt dashboards are being deprecated and removed, with the exception of the Cardinality dashboard. |
Fix | Resolved a duplicate series error on the container details page. |
Fix | Improved the status panel failure detection for kube-state-metrics. |
Fix | When you click Start Sending Data, the Configuration page displays. |
1.7.0
1/22/24
Type | Description |
---|---|
Feature | You can view usage and cost graphs and data on the Pod details page. |
Feature | When using EKS on Fargate, Pod logs are made available by using the Kubernetes API to gather them. |
Feature | Added or updated support for these integrations: Aerospike, ActiveMQ, Cassandra, Apache HTTP Server Project, cert-manager, ClickHouse, CoreDNS, GitLab, Kafka, MongoDB, NGINX, and PrestoDB . |
Enhancement | On the Configuration page, you can add an expiration date to the token and view the scope of the token. |
Fix | Corrected the following broken links: workload links on the Pod list of the Node details page, Cluster status link on the Configuration page, and links when Cluster name contains a slash character. |
1.6.2
1/10/24
Type | Description |
---|---|
Enhancement | User experience enhancements: The home page displays the home page name on the breadcrumbs, and filter options display alphabetically. |
Enhancement | You can copy any object in the table lists by clicking the copy button within the row. |
1.6.1
12/19/23
Type | Description |
---|---|
Feature | IBM Cloud is available as a platform that you can select when using the configuration wizard. |
Enhancement | You can choose Loki from the data source selector in all detailed views. |
Enhancement | A link to documentation for understanding CPU limits is available on detail pages showing CPU limits set. |
Enhancement | All IEC units show in standard form. |
Enhancement | Throughout Kubernetes Monitoring, you have the copy icon available to copy any text so that you can use it in your own query, use it in the command line, send it to others, and so on. |
Fix | Fixed empty Cost panels for customers who use forward slashes in Cluster names by changing to pipe formatter. |
Fix | Corrected inconsistencies in the number of Nodes displayed by using an instant query for any single value result instead of using a range query. This improves app performance and data accuracy. |
Fix | Fixed accuracy of CPU usage data on the Node details page. |
Fix | Improved accuracy of machine learning prediction by using query variables to dynamically set parameters for calculation. |
1.6.0
12/10/23
Type | Description |
---|---|
Feature | Cluster detail page: Added optimization panel showing CPU and memory utilization, and cost information for the time range selected. |
Feature | Node detail page: Added optimization panel showing CPU and memory utilization, and cost information for the time range selected. |
Enhancement | Cost page: All panels on the Cost page provide the Explore button so you can view the query, debug issues, or create your own dashboard. |
Fix | Storage savings: Fixed storage results, which displayed multiple series errors in cases where a PersistentVolumeClaim was present on more than one node within the selected time range. |
Fix | Container details page: Fixed events, which weren’t displaying. |
1.5.0
11/29/23
Type | Description |
---|---|
Feature | Container detail page: Added page. This page includes CPU and memory utilization, and cost information. The restart and termination panels are adjacent to each other to allow quicker correlation of restart and termination events. |
Feature | Namespace detail page: Added optimization panel showing CPU and memory utilization, and cost information. |
Enhancement | Workloads detail page: Added optimization panel showing CPU and memory utilization, and cost information. Updated the feature for detecting outlier CPU usage in the Pods to uses the time range selected in the time selector instead the latest/current time. |
Fix | Pod detail page: Updated the CPU and memory queries for accuracy. Removed duplicate containers that were sometimes listing. |
Fix | Home page: Decreased load time for the Pods in trouble panel, and filtered only for issues that are currently active. |
Fix | Added Zipkin port configuration to ensure traces work properly. |
Fix | Updated internal release process and documents. |
1.4.1
11/10/23
Type | Description |
---|---|
Feature | Configuration wizard: Added a selector for the platform. |
Feature | Pod detail page: Added an optimization panel containing CPU and memory utilization and cost. |
Fix | Time range selector: Changed to use the time zone of the user, to ensure accuracy of time range data. |
Fix | Use new metrics receiver flow component in the integration code snippets. |
Fix | Fixed missing data source for k3d environment. |
1.4.0
11/3/23
Type | Description |
---|---|
Enhancement | Time range selector: Added to all appropriate pages to allow a specific time range for data to be displayed. |
Enhancement | Metrics status page: Improved for better user experience. |
Enhancement | Configuration wizard: Improved visibility of OpenCost configuration information. |
Enhancement | Pod details page: Used Scenes to build panels on this page. |
Fix | Default instructions in Github for integration: Fixed the appropriate name for Prometheus component. |
Fix | Error message for testing integration: Revised for validity and comprehension. |
Fix | Workload key and data list fixes: Revised to use correct key on workloads list. |
Fix | Kafka integration: Revised job name to kafka for integration to work properly. |
Fix | Status queries: Adjusted to show consistent results. |
Backend release notes
The following are release notes for the Kubernetes Monitoring backend.
2.2.0
June 2025
Type | Description |
---|---|
Enhancement | Alert rules: Added the cluster label to any alert description where it was missing. Added KubePdbNotEnoughHealthyPods , KubeNodePressure , and KubeNodeEviction . Updated KubeletTooManyPods to de-duplicate Pod counts and improve evaluation performance. Updated KubeStatefulSetReplicasMismatch to also fire when Pods cannot be created. Updated KubeNodeNotReady and KubeNodeReadinessFlapping to filter out cordoned Nodes. Removed KubeAPIDown and KubeletDown alerts as they are misleading. |
Enhancement | Recording rules: Added node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate5m . Updated namespace_workload_pod:kube_pod_owner:relabel to support more workloads. Updated all apiserver rules to support both Prometheus v2 and v3 historgram bucket formats. Updated apiserver_request:availability30d to prevent null values. Deprecated node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate . |
2.1.2
December 2024
Type | Description |
---|---|
Enhancement | Alert rules: Make KubeDaemonSetRolloutStuck alert configurable. Added instance label to KubeAggregatedAPIErrors. Added cluster label to KubeClientCertificateExpiration description. |
Enhancement | Build process/tooling: Introduced automated Go dependency updates. Security updates CVE-2024-45337 and GO-2024-3333. |
Fix | Recording rules: Corrected a data calculation in cluster_verb_scope:apiserver_request_sli_duration_seconds_count:increase.*. |
2.1.1
November 2024
Type | Description |
---|---|
Enhancement | Alert rules: To better support multi-cluster environments, addedcluster label to KubeAPIErrorBudgetBurn , KubeAPITerminatedRequests , and KubeClientCertificateExpiration . |
Enhancement | Recording rules: Improved data consistency for cluster_verb_scope_le:apiserver_request_sli_duration_seconds_bucket:increase1h , cluster_verb_scope_le:apiserver_request_sli_duration_seconds_bucket:increase30d , cluster_verb_scope:apiserver_request_sli_duration_seconds_count:increase1h , and cluster_verb_scope:apiserver_request_sli_duration_seconds_count:increase30d . |
Enhancement | Build/process tooling: Updated dependencies to address vulnerabilities. |
2.1.0
September 2024
Type | Description |
---|---|
Enhancement | Added alert groups: - kubernetes-storage (five KubePersistent.* alert rules)- kube-apiserver-slos (four KubeAPIErrorBudgetBurn alert rules)- kubernetes-system-apiserver (six system API Server alert rules, including KubeAggregatedAPIDown ). |
Enhancement | Added recording rule groups: - kube-apiserver-.*.rules (32 *apiserver.* recording rules)- kubelet.rules (including node_quantile:kubelet_pleg_relist_duration_seconds:histogram_quantile )- kube-scheduler.rules (nine cluster_quantile:scheduler_.*:histogram_quantile recording rules). |
Enhancement | Updated upstream Kubernetes mixin: Split k8s.rules.container_resource group into four groups to improve performance. |
2.0.0
March 2024
Type | Description |
---|---|
Enhancement | Removed dashboards from Kubernetes Monitoring now that all views are available in the application. |
1.1.2
January 2023
Type | Description |
---|---|
Fix | Removed duplicate recording rule group. |
Fix | Corrected broken links on Home dashboard. |
1.1.1
November 2023
Type | Description |
---|---|
Fix | Disabled “all” and multi-select on Events Data Source variable. |
Enhancement | Split k8s.rules group into seven groups to improve evaluation performance. |
1.1.0
October 2023
Type | Description |
---|---|
Feature | You can focus on cardinality exclusively related to Kubernetes by using the prebuilt Kubernetes Monitoring Cardinality dashboard. |
1.0.0
October 2023
Type | Description |
---|---|
Enhancement | Added job label to KubePodNotReady alert to be compatible with new Grafana Alerting structure. |
0.0.14
July 2023
Type | Description |
---|---|
Feature | Added recording rules for Windows Exporter, for compatibility with node-exporter queries. |
Enhancement | Restored events selector to prevent logs from being returned as events. |
Enhancement | Updated upstream Kubernetes mixin. |
0.0.13
May 2023
Type | Description |
---|---|
Enhancement | Added ability to use any (non-empty) job label on dashboards panels and rules. |
Enhancement | Added Node Exporter to configuration status panel. |
Enhancement | Added duplicate installation detection to configuration status panel. |
Enhancement | Added informational tooltip to configuration status panel. |
0.0.12
April 2023
Type | Description |
---|---|
Fix | Updated upstream agent-operator jsonnet-libs to inherit a fix for relabeling related to hitting a labeling limit with Loki. |
0.0.11
March 2023
Type | Description |
---|---|
Fix | Corrected template variables on Efficiency dashboard. |
0.0.10
February 2023
Type | Description |
---|---|
Fix | Removed the beta tag from the Events panel on the homepage dashboard. |
0.0.9
February 2023
Type | Description |
---|---|
Fix | Corrected Cluster aggregation for some recording rules. |
Helm chart release notes
For release notes on the Helm chart, refer to Grafana Kubernetes Monitoring Helm chart release notes.