Backend mixin (also called integration), covering the alert and recording rules
Helm chart for the default and other configurations of Kubernetes Monitoring
Frontend release notes
The following are release notes for the Kubernetes Monitoring GUI.
2.28.2
03/25/26
Type
Description
Fix
Continuous profiling: Fixed profiling time range to use the dashboard time range instead of a hardcoded 24-hour lookback, and disabled the “View profiles” button when the specific profile type doesn’t exist for a workload.
Fix
Continuous profiling: Fixed Pyroscope service name matching for namespace/workload format when multiple service names share the workload name as a substring.
Fix
Configuration: Renamed configuration wizard step labels to match instructions (“Stack information” to “Stack and platform”, “Platform and features” to “Monitoring type”).
2.28.1
03/16/26
Type
Description
Enhancement
Continuous profiling: Added Pyroscope integration with embedded flame graphs on workload and Pod detail pages, and direct links to Profiles Drilldown from CPU and Memory tabs.
Enhancement
Configuration: Renamed and clarified all four configuration wizard step labels for consistency, and reworded the step two instructions to better match the workflow order.
Fix
Search: Fixed search page not loading previous search terms when opened from a shared or bookmarked URL.
Fix
Configuration: Fixed the Integration Hub onboarding flow where pasting an existing access token did not populate the token in the generated helm upgrade command.
Fix
Configuration: Fixed the OpenCost documentation link not displaying as a clickable link in the cost metrics alert.
Fix
Security: Updated dependencies to address CVEs.
2.28.0
03/11/26
Type
Description
Feature
Configuration: Redesigned the configuration page into four steps for a clearer setup experience.
Feature
Health: Added a new Health page showing a live snapshot of active issues across all Clusters.
Enhancement
OpenTelemetry: As part of adding support for OpenTelemetry, added CPU metrics for OTel data sources in Node, Namespace, and Cluster views.
Fix
Error messaging: Improved error messages for recording rule operations to show the actual error instead of a generic failure message.
Fix
Metrics accuracy: Panels now filter out internal bookkeeping metrics reported by the container runtime (cAdvisor), so CPU and memory values reflect only your actual application containers.
Fix
Workload metrics: Used workload-level recording rules instead of Pod filtering for more accurate workload metrics.
Fix
Panel queries: Added unique query reference IDs to prevent data conflicts in panels with multiple queries sharing the same runner.
2.27.2
03/04/26
Type
Description
Fix
Predictions: Dynamically load prediction modules to prevent page crashes if a prediction module fails to load.
2.27.1
03/03/26
Type
Description
Feature
Search: Redesigned the search page for better user experience.
Enhancement
OpenTelemetry: As part of adding support for OpenTelemetry, added CPU limit metrics for OTel data sources in Namespace, Node, and Cluster views so you can compare CPU usage against configured limits.
Enhancement
Configuration: Added a disabled state to feature toggles that aren’t supported on the selected platform.
Fix
Logs: Fixed Loki data source picker to only show Grafana Cloud Loki data sources, which previously could show non-Grafana-Cloud data sources that wouldn’t work correctly.
Fix
Events: Added a dedicated query for object_name and object_kind fields in event logs so that event details display correctly.
Fix
Workload detail: Renamed “Pods” label to “Pods in time range” to more accurately reflect the metric.
2.27.0
02/23/26
Type
Description
Enhancement
OpenTelemetry: Added support for OpenTelemetry kubeletstatsreceiver CPU metrics.
Fix
IOPS by Pod: Fixed write IOPS to use correct container_fs_writes_total metric.
Fix
Pod filtering: Fixed page breaks with many Pods by using recording rule instead of URL parameters.
Fix
OpenTelemetry events: Fixed events labels not displaying correctly for OpenTelemetry events.
Fix
Configuration: Changed default Cluster name to ‘my-cluster’ in onboarding instructions.
2.26.0
02/10/26
Type
Description
Feature
Manage costs: Redesigned the costs and savings view.
Enhancement
Logs: Use node filter (if available) for OTel logs in node logs view for more accurate filtering.
Enhancement
Configuration instructions: Added cluster name to the simplified Alloy Helm instructions for better clarity.
Fix
Alerts: Removed node filter in pods table alerts to prevent incorrect filtering.
2.25.3
02/03/26
Type
Description
Feature
Search: Added ability to search by container image name, making it easier to find all instances of a specific image.
Enhancement
Alert detection: Enhanced alert detection to support both Prometheus Alertmanager-based alerts and Grafana-managed alerts during the alerting platform migration.
Enhancement
Knowledge Graph integration: Added banner to enable and navigate to Knowledge Graph for root cause analysis when not enabled.
2.25.2
01/27/26
Type
Description
Feature
Pod count: Added Pod count panel to workload detail pages.
Fix
Logs: Fallback to default logs query if no data found in any of the queries.
2.25.1
01/27/26
Type
Description
Fix
Logs: Fallback to OTel logs queries as expected.
Fix
Security: Updated dependencies to satisfy CVEs and updated eslint.
2.25.0
01/20/26
Type
Description
Feature
Logs: Replaced logs with new drilldown logs extension for improved log exploration.
Enhancement
Network metrics: Added network metrics for Windows nodes.
Fix
Logs: Use query runner as data provider to get the logs query.
Fix
Security: Updated @remix-run/router to v1.23.2 to fix vulnerability.
Fix
Pod metrics: Removed by label filter in Pod level request and memory sets for more accurate metrics.
Fix
Cronjob: Fixed cronjob filtering to properly filter by namespace.
2.24.3
12/11/25
Type
Description
Feature
Configuration: Added the capability to enable or disable forwarding traces to application receivers.
Enhancement
Changed the default view in the Logs and Events tab from the table view to the logs list view.
Fix
Jobs: Fixed Reason column in the Pods table under Jobs.
Fix
Jobs: Fixed an issue where empty or missing schedules could cause errors when displaying Cronjob schedules.
Fix
Workloads list: Fixed workload counts on the namespace list so that they now accurately reflect the number of workloads in each namespace.
2.24.2
11/20/25
Type
Description
Feature
EC2 instances: Added links showing which EC2 instances correspond to which Cluster nodes.
Enhancement
Explore link: The Explore link opens in a new browser tab, offering stability of the GUI with large datasets.
Fix
Jobs: Fixed job URL creation to use the job name instead of owner kind, ensuring more accurate links for Kubernetes jobs.
2.24.1
11/20/25
Type
Description
Fix
Knowledge graph: ObjectDetailExtension now uses the Kubernetes workload concept instead of entityName. This ensures displayed details are more accurate and terminology matches current Kubernetes standards.
Fix
Jobs: - Fixed job URL creation to use the job name instead of owner kind, ensuring more accurate links for Kubernetes jobs.
2.24.0
11/13/25
Type
Description
Feature
Access control: Added the ability to use RBAC roles.
Fix
Jobs: Assigned a default owner if no owner type exists, to ensure job links work properly.
2.23.0
10/30/25
Type
Description
Feature
Grafana Assistant: Added Grafana Assistant icons to most panels and added the Investigate with Assistant button to all detail pages.
2.22.0
10/22/25
Type
Description
Feature
Jobs: Added jobs and cron jobs monitoring.
Enhancement
Filtering: Enabled column filtering on all tables.
Enhancement
Activation: Changed name of activation tab to Manage app.
Fix
Workbench: Removed filter from columns with a checkbox.
Fix
Node table queries: Used node filter for accuracy.
Fix
Column headings: Removed incorrect units.
2.20.0
8/29/25
Type
Description
Enhancement
Integrations: Added a Kubernetes filter to the Connections page.
Enhancement
Configuration: Added the ability to activate Kubernetes Monitoring.
2.19.8
8/21/25
Type
Description
Enhancement
Integrations: Removed integrations from the Configuration page.
2.19.7
8/19/25
Type
Description
Enhancement
Configuration: Removed the selection in configuration to switch on Application Observability now that users can activate it from the product.
8/5/25
Type
Description
Enhancement
Asserts: Added new term ‘insights’ instead of ‘assertions’.
Fix
OTel: Updated the cluster events query to improve support with OpenTelemetry Collectors.
Fix
Scrape intervals: Removed hard-coded time interval so that panels reflect custom scrape interval.
2.19.5
7/25/25
Type
Description
Enhancement
Asserts: Added health indicator icons to lists of components.
Fix
Pod CPU and memory: Reverted fix from 2.19.3 due to bugs it was causing.
Fix
Container memory: Fixed container memory panels to aggregate correctly.
Fix
Cluster list: Corrected the number of nodes for a cluster in the info section.
2.19.3
7/16/25
Type
Description
Enhancement
Cost overview: Added ability to jump to list of Clusters from Cost Overview page.
Enhancement
Integrations: Added a message on the Integrations page that integrations will soon be accessible from the Connections page.
Fix
Pods: Include all Pods if no pod value specified and use podName value if available on Pod CPU and Memory queries to prevent no data from showing.
2.19.2
7/3/25
Type
Description
Feature
GPU: Added GPU tabs and panels to Cluster and Node pages.
Feature
EC2 link: Added link to navigate between EC2 instance in Cloud Provider Observability and Kubernetes Monitoring.
Enhancement
Cost: Added tooltips for cost panels to clarify underlying calculations and their time frame.
Enhancement
User experience: Streamlined descriptions of panels for clarity and brevity.
Fix
Storage: Changed inode values to percentages.
Fix
Workloads: Fixed query to ensure all barepods workloads are listed, whether desired Pods or ready Pods.
Fix
CPU and memory tabs: Corrected the calculation for the Efficiency Usage/Requests panel on the CPU and Memory tabs of the Node detail page.
Fix
Argo Rollout Pods: Removed duplicate query to prevent incorrect count.
Fix
Container page: Fixed tab links to navigate to appropriate pages.
2.19.1
6/11/25
Type
Description
Enhancement
Serverless and Windows Nodes: Added support for showing Node memory usage on platforms that cannot use Node Exporter (such as GKE Autopilot and AWS EKS Fargate), and Windows.
Enhancement
Logs table: Added the ability for a user to rollover a line in the logs table and click an icon to see the full log message.
Fix
Queries: Fixed queries for Cluster names with a ., which causes results to be empty.
Fix
Non-Kubernetes Nodes: Prevented the app from selecting Nodes that are not Kubernetes Nodes to be counted and displayed.
2.19.0
6/4/25
Type
Description
Enhancement
Memory: Added a memory tab and panels to Node, Namespace, Workload, Pod, and Cluster detail pages.
Enhancement
Workloads: Added the capability to view nonstandard workloads that contain cronjobs, bare pods, static pods, Argo Rollouts, Strimzi Pod sets, and others.
Enhancement
Logs: Created a table format to display logs and events.
2.18.0
5/21/25
Type
Description
Enhancement
CPU tabs: Added CPU tabs to the Pod, Workload, and Namespace detail pages.
Enhancement
GKE Autopilot and AWS EKS Fargate: Used kubelet Resource Metrics (node_cpu_usage_seconds_total) to show CPU usage for platforms that cannot use Node Exporter, including GKE Autopilot and AWS EKS Fargate.
Fix
Link to Pod: Fixed ability to go to a specific Pod from the Network Bandwidth by Pod panel on the Node details page on the Network tab.
Fix
URL: Fixed variables not synching with URL.
Fix
Metrics status page: Corrected spacing between panels.
Fix
DNS: Added a . to the end of hostnames to prevent DNS confusion.
Fix
OTLP: Included the OTLP Gateway destination in the Kubernetes Helm chart to route metrics, logs, and traces using the OTLP format.
Fix
Outlier detection: Changed Outlier detection to use podNames to prevent incorrect counts of outlier Pods.
2.17.0
5/6/25
Type
Description
Feature
CPU details: Added CPU tab for Node details page and Cluster details page.
Fix
Table headers: Made table headers visible for light mode.
Fix
Pods: Fixed Pods with the same name on different Clusters to be differentiated from each other.
Fix
Table columns: Make it more apparent table columns can be resized.
Fix
Data accuracy: Ensure that CPU and memory requests only look at active pods, not completed or failed.
Fix
Terraform files: Ensure use of strings for usernames in Terraform files to ensure compatibility for Helm chart values.
2.16.0
4/23/25
Type
Description
Enhancement
Storage: Added IOPS and throughput panels on all Storage tabs.
Fix
Cost: Prevented duplicate cAdvisor scrape jobs from affecting cost allocation.
Fix
Alerts: Fixed namespace drop-down menu on Alerts page from not being populated.
Fix
Pod names: Corrected formatting of Pod names when using Explore from Pod lists.
Fix
Memory usage: Prevented completed and pending pods from affecting memory usage queries on the Pod details page.
2.15.1
4/17/25
Type
Description
Enhancement
Alerts: Added capability to filter alerts on Alert page by Node.
Enhancement
Outlier detection: Outlier detection for memory automatically runs when you visit a workload details page for multi-Pod workloads, and links to Explore to show the outlier Pod(s) using more memory.
Fix
Integration: Correct the configuration snippet for the Core DNS integration.
2.15.0
4/14/25
Type
Description
Feature
Outlier detection: Outlier detection automatically runs when you visit a workload details page for multi-Pod workloads, and links to Explore to show the outlier Pod(s) using more CPU.
Enhancement
Detail pages: CPU and memory columns and prediction buttons for each row have been added to the Cluster, workload, and namespace detail pages.
Fix
Events and logs status check: Fixed status check and Explore links for events and logs on the Metrics status tab.
Fix
Overview page: Fixed the All alert links on the home page.
Fix
Overview page: Corrected the workload count on the overview page to more closely correlate to the workload count shown on the workloads list page.
Fix
Fleet Management: Fixed reference to Fleet Management Terraform variables.
2.14.1
4/2/25
Type
Description
Enhancement
Performance: Decreased the training and inference latency of the machine learning functionality to show results quicker.
Enhancement
Alerts: Added filtering by alert name.
Enhancement
Overview page: After filtering by Cluster or namespace, you can click on the count of Clusters, Nodes, namespaces, or workloads and show only those objects within the scope of the filter.
Enhancement
User experience: On Workload and Namespace list pages, made the Cluster names clickable and added graphs to the counts.
Fix
Workload counts: Corrected calculations on workload counts on the Overview page.
Fix
List pages: Restored the alerts table column.
Fix
Alloy logs instance: Set the Pod security context for the Alloy logs instance on OpenShift.
Fix
OpenShift: Corrected the port names for OpenShift so that Alloy can discover targets to scrape.
Fix
EKS Fargate: Removed mounting of /var/log and /var/lib/docker/containers.
2.14.0
3/24/25
Type
Description
Feature
Troubleshooting: Added a debug panel showing the source of a panel’s metrics, available from the panel menu.
Enhancement
Performance: Added a view for each list page when queries extend beyond performance capabilities which degrade page loading. This view includes a message to search or filter for specific components.
Fix
Performance: Stopped unrelated panels from reloading when the logs data source is changed.
Fix
Accuracy: Stopped counting containers that are not running, which caused CPU limits and requests to appear higher than they should.
Fix
Platforms: Disabled unnecessary components for specific platforms: nodelogs on Fargate, and profiling, Beyla, and Windows Exporter on Autopilot.
2.13.2
3/18/25
Type
Description
Fix
Fleet Management: Fixed interaction with Fleet Management so that Alloy installation shows positive on Metrics Status page.
Fix
Fleet Management: Changed configuration to store secret for Fleet Management in secret instead of environmental variable.
2.13.1
3/13/25
Type
Description
Fix
Removed stabilityLevel: public-preview in relation to Alloy from the code generated on the Configuration page.
Fix
Prevented a race condition causing HTML error messages to appear.
Fix
Asserts: Restored RCA workbench linking.
2.13.0
3/9/25
Type
Description
Feature
Configuration: Added toggle in easy deploy for users to add Fleet Management for management of Alloy deployments.
Enhancement
Cluster detail page: Added buttons to allow users to navigate to workloads and namespaces within the cluster.
Fix
Memory queries: Switched to Working Set Size (WSS) instead of Resident Set Size (RSS) to increase accuracy of memory statistics and match kubectl and Linux Out-of-Memory behavior.
2.12.3
2/20/25
Type
Description
Fix
Configuration and status tabs: Fixed issue of configuration state not persisting when user switches between the tabs.
Fix
URL stability: Maintain URL on refresh of page.
Fix
Integrations: Updated integrations to work with Kubernetes Monitoring Helm chart version 2.
2.12.2
2/6/25
Type
Description
Fix
Workload details page: Fixed TypeError to prevent unsafe access to data source.
2.12.1
2/6/25
Type
Description
Fix
Receiver URLs: Updated URLs to match the version 2 Alloy deployment.
Fix
Tempo and Logs: Updated configuration to fix Tempo protocol and autopilot for logs.
2.12.0
2/5/25
Type
Description
Feature
Storage: Added storage tabs and graphs to Cluster, Node, workload, namespace, and Pod detail pages.
Enhancement
Configuration: Updated the easy deployment configuration page to use Helm chart v2.
Enhancement
Recording rules: Added the install/uninstall button to the Workload Recording Rule status for the cloud-provisioned hosted metrics data source.
Fix
Prediction button: Disabled if the time range selected is less than two hours.
Fix
Data source drop-down: Removed machine learning metrics data source from the drop-down menu.
Fix
Data source drop-down: Added verification that the data source is valid prior to displaying in the drop-down menu.
Fix
Workload list page: Fixed to not display all columns if filters result in more than 1,000 results.
2.11.2
1/15/25
Type
Description
Fix
Time range: Ensured the time range selected in Kubernetes Monitoring is transferred to Application Observability when the View application layer button is clicked.
Fix
Performance loading: Resolved issues related to rerendering of pages.
Fix
Filters: Ensured when time range is changed that all filters are not returned to the default.
2.11.1
1/9/25
Type
Description
Enhancement
User experience: Added the tool tip to the Copy icon.
Fix
Node count: Fixed incorrect Node count on Kubernetes Overview page due to empty node labels.
Fix
Performance loading: Improve loading performance of tables in detail pages.
Fix
Memory data: Rollback to the original recommendation for memory requests to equal memory limits.
2.11.0
1/6/25
Type
Description
Enhancement
Performance: If Workload list page result is over 1,000 workloads, the page will load quickly but no usage or cost data will show. A message will display suggesting the list be filtered, and include a link to search.
Data source: Filter the data source drop-down menu to be limited to Grafana Cloud data sources.
2.10.0
12/9/24
Type
Description
Feature
Performance: Improve performance by performing a code split.
Fix
Loop error: Fixed a history loop error and page breaking.
2.9.0
11/25/24
Type
Description
Feature
Search: Added the ability to search for any Kubernetes object.
Fix
Metrics status: Show the status checks even if no clusters were detected.
2.8.8
11/12/24
Type
Description
Fix
Erroneous no data error: Added a null check to prevent the no data message from displaying due to UID not found for a database.
2.8.7
11/6/24
Type
Description
Fix
Asserts: Fix to ensure Kubernetes Monitoring is aware Asserts is enabled.
2.8.6
11/5/24
Type
Description
Fix
Asserts: Fix the time range so that it is synchronized between Kubernetes Monitoring and Asserts.
2.8.5
10/30/24
Type
Description
Fix
Updated link to the Application Observability GUI.
Fix
Properly align Cluster columns.
2.8.4
10/17/24
Type
Description
Enhancement
User experience: Created Cluster, namespace, and Pod columns for alerts on home page.
Enhancement
User experience: On the Metrics status tab, added the ability to select the status of all or many Clusters.
Fix
Instead of displaying no data on the home page should an outage occur, the home page will show historical data prior to the outage and current data post-outage.
Fix
GUI: Fixed various anomolies with the GUI, including the location of the navigation drop-down menu and the width of certain columns.
2.8.3
9/23/24
Type
Description
Fix
Alerts: Hide panel menu item for creating alerts if a user is not an administrator.
2.8.2
9/12/24
Type
Description
Enhancement
User experience: Restricted the namespace field for the quick configuration process to only valid characters.
Enhancement
User experience: Added the link to the Cluster below the title on detail pages.
Enhancement
Troubleshooting: Added the Helm chart to the Metrics status page.
Enhancement
User experience: Added waiting reasons and colors for Pods that are either waiting due to startup or due to a problem.
Enhancement
Troubleshooting: Added the waiting reason for Pods that are waiting instead of running.
Enhancement
Alerts: Added the ability to create an alert item from a panel.
Fixes
Fixed the container memory limit on Container detail pages to be accurate. Ensured logs and events are not listed when there are no Pods. Fixed ability to remove a ’no data’ filter when a time range is changed.
2.8.1
8/25/24
Type
Description
Feature
New platform: Added Azure AKS as a separate platform for configuration.
Enhancement
Metrics status: Added statuses for recording rules and KSM job label.
Enhancements
User experience: Moved up the required backend installations step to prevent misconfigurations. Restricted integrations view to administrators only and improved message to non-administrative user. Added links to documentation on Overview, Cost, Alerts, and Configuration pages. Adjusted the width of filters and moved copy button closer to item for ease of use. Refresh home page when data starts being sent.
Enhancement
Code snippets for integrations: Added configuration code snippets for Apache CouchDB, Apache Hadoop, Apache Tomcat, and IBM DB2.
Fix
CPU statistics: Fixed issue that was causing the memory legend to show in relation to CPU stats.
2.8.0
8/6/24
Type
Description
Feature
Energy use: Added energy metrics by using third-party Kepler open source software.
Fix
Duplicates: Deduplication of Pods and containers.
2.7.0
7/29/24
Type
Description
Enhancement
User experience: Added a tab for logs and events on every detail page, including the Cluster detail page.
Enhancement
Integrations: Added snippets for the Alloy and Varnish integrations.
Enhancement
Logs: Made log queries work with OTel-style labels.
Fixes
Correct time range format on home page. Ensured Kubernetes Monitoring opens in correct time range from Asserts.
2.6.1
7/16/24
Type
Description
Enhancement
Alerts: Added the ability to click on the alert name from the Kubernetes Overview page.
Enhancement
Integrations: Added configuration snippets for Squid and Apache Mesos.
Fix
Prevented continuous rendering of data in the Safari browser.
2.6.0
7/10/24
Type
Description
Feature
Home page: Upgraded the home page to better show the status of healthy Kubernetes objects as well as any potential issues.
Feature
Asserts button: Added Assertions button to details pages.
Enhancement
User experience enhancements: Added quick ways to navigate within the app, including the drop-down menu for all main pages, the overview and network tabs, deep linking from the home page CPU and memory graphs to the Cluster, improved Helm chart configuration process, and visual tags to indicate the type of Kubernetes object for each view.
2.5.0
6/25/24
Type
Description
Feature
Network panels: Added Network Bandwidth and Network Saturation data.
Feature
Cost overview panels: Added a 90-day view of total compute cost, average cost per Pod, and average Pod count on the Cost Overview page.
Enhancement
OpenTelemetry: Added support for displaying events coming from the OpenTelemetry Kubernetes Events Receiver.
Fix
Workloads with same name: Corrected CPU and memory usage data for workloads with the same name but in different Clusters.
2.4.0
6/18/24
Type
Description
Feature
Deleted objects: Added the ability to find deleted Clusters, Nodes, Pods, containers, workloads, and namespaces.
Enhancement
User experience: Improved “no data” messages on home page.
Enhancement
Metrics: Host metrics generation.
Fix
Error messages: Fixed query to prevent an invalid error message.
Fix
Update host metrics switch language and disable CRDs on OpenShift.
2.3.5
6/12/24
Type
Description
Enhancement
Integration: Added Apache Airflow configuration snippets for integration.
Fix
Duplicates: Removed duplicate data showing on home page.
Fix
Alerts: Improved alert count accuracy for certain kinds of Pod types.
2.3.4
6/7/24
Type
Description
Enhancement
Added time range selector to home snapshot page.
2.3.3
6/3/24
Type
Description
Enhancement
Integrations: Added configuration snippets for the integrations Wildfly and OpenSearch.
Enhancement
Performance: Improved load time for queries spanning longer time ranges.
Fix
Refresh and Cluster filters: Fixed so refresh does not cause filters to be removed.
Fix
Cardinality page: Ensured the Series per namespace column data link is accurate.
Fix
Documentation links: Updated and corrected documentation links on Kubernetes Monitoring main page (that is shown prior to configuring).
Fix
Workload type: Ensured different workloads that are similarly named accurately show the type of workload and the data for the workload.
2.3.2
5/21/24
Type
Description
Enhancement
Performance: Improved loading performance of CPU and memory usage tables.
Fix
Added a null check for provider to avoid errors on Cluster list page.
2.3.1
5/16/24
Type
Description
Enhancement
Integration: Added configuration snippets for the Apache Solr integration.
Fix
Fixed Helm chart integration snippets to be compatible with Alloy.
Fix
Fixed recording rule to include ReplicaSets, static Pods, and bare Pods so that missing workloads now show and incorrectly labeled workloads are now labeled correctly. This also fixes workload usage and cost data.
2.2.5
4/25/24
Type
Description
Enhancement
Static Pod and Pod type: Added the static Pod label to all static Pods and the Pod label to bare/unmanaged Pods .
Enhancement
Integrations: Added flow configuration snippets for Velero and MS SQL Server integrations.
Fix
Fixed link to alert rule after automatic refresh.
2.2.4
4/22/24
Type
Description
Enhancement
Data source persistence in URL: Persisted the Loki data source in the URL so that accurate links can be shared.
Fix
Added check to prevent query errors.
2.2.3
4/18/24
Type
Description
Feature
Refresh control: Added refresh and auto-refresh choices to all pages.
2.2.2
4/17/24
Type
Description
Feature
Alerts: Added the Alerts column to the Nodes and Cluster list pages.
Enhancement
URL data source persistence: Persisted Prometheus data source in the URL so that accurate links can be shared.
Enhancement
Integration: Added configuration snippets for OracleDB integration.
2.2.1
4/15/24
Type
Description
Feature
Alloy update: Replace Grafana Agent with Grafana Alloy.
Feature
Cardinality: Added the Cardinality page to replace the Kubernetes Cardinality dashboard.
Feature
Alerts: Added the Alerts column to these pages: Pod detail, Pod list, and Workloads list.
Enhancement
Integration: Added configuration snippets for TensorFlow Serving integration.
Enhancement
User experience: Added the ability to select multiple clusters for filtering.
Fix
Removed logs panel from Node detail page for future refactoring.
Fix
Corrected improper indent in configuration snippet of IBM MQ integration.
Fix
Added kube_deployment as well as kube_replicaset to ensure all workload types are collected.
2.2.0
4/8/24
Type
Description
Fix
Refactored workloads to use more efficient queries.
2.1.3
4/4/24
Type
Description
Enhancement
Added the workload type when available to filter the list of Pods to improve search performance.
Fix
Enhanced the multi-line regular expression to support a broader collection of logs.
2.1.2
4/2/24
Type
Description
Fix
Reverted from support for OTel log labels.
2.1.1
4/2/24
Type
Description
Feature
Added interactivity with Sift automated machine learning investigation tool.
Enhancement
Added configuration snippets for the Couchbase and IBM MQ integrations.
Enhancement
Added support for OTel log labels.
Fix
Combined target_info with traces_target_info metrics to ensure all applications are found.
2.1.0
3/25/24
Type
Description
Enhancement
Configuration: Added configuration snippets for the PgBouncer integration. Added link to extraConfig information in the Helm documents from the code snippet areas of an integration.
Enhancement
Dashboards: Removed dashboards due to Angular deprecation.
Enhancement
Messaging: Improve wording of default messages when a data source is updated for a view.
Fix
Persist Cluster URL values to correct broken links.
2.0.4
3/13/24
Type
Description
Feature
Asserts workbench: Added RCA Workbench button to allow navigating to Asserts.
Enhancement
Explore: Expanded the explore feature to the list of all Pods. Improved the user experience for exploring data.
Fix
Removed inaccurate timestamp for namespace creation on Namespace details page.
Fix
Corrected the query for phase state, which is shown by expanding a list item on the Namespaces page.
Fix
Add the http prefix to OTLP endpoints on the Cluster configuration tab when OTel receivers are selected.
Fix
Persisted the data source between views.
2.0.3
3/6/24
Type
Description
Feature
Added the capability to navigate to Application Observability.
Feature
On the Cluster list page, you can click the Explore button to view the queries used in each column of the usage and cost data. This enables you not only to understand the data, but debug any issues or create your own queries or dashboards
Added a warning icon and tooltip on column headers of lists to indicate error detection and query failure.
Feature
With the easy configuration using the Helm chart, added the capability to enable OTel and Zipkin receivers.
Enhancement
Added the filter toolbar to list tables for better user experience.
Enhancement
Added duplicate metrics detection to Node Exporter, Windows Exporter, OpenCost, cAdvisor, and kubelet. When there is an error, a yellow warning icon appears next to the item on the Metrics status page.
Enhancement
Refined queries for “OOMKilled” containers to prevent counting of duplicate containers, specifically when a container is being killed and another is being created in its place.
Enhancement
The link to examine an alert opens in a new tab so that context within Kubernetes Monitoring is not lost.
Enhancement
Updated URL so that any shared URL includes any variables and the time range selected.
Enhancement
Release note text in the app links to the release notes in documentation.
Enhancement
Breadcrumbs now include page tab names.
Enhancement
Corrected the query for unallocated resources (PV) panel.
Fix
Created an additional check for deep links related to breadcrumbs display.
Fix
Refined queries to ensure deduplication of metrics, including removing reliance of uid label and having node/instance joins also use the Cluster label.
Fix
Corrected queries for the Explore button on the Workloads page.
Fix
Fixed CPU usage prediction graph to display correctly within window.
2.0.2
2/15/24
Type
Description
Enhancement
Refined deduplication of KSM kube_node_status_capacity, and added support for node with or without trailing port numbers.
Fix
Created more accurate cost calculations in the case of mixed node and instance labels.
2.0.1
2/15/24
Type
Description
Enhancement
Added the workloads column to the list of namespaces.
Fix
Corrected the calculation for CPU average queries.
2.0.0
2/6/24
Type
Description
Feature
The Alerts page provides a centralized location to view all alerts related to your infrastructure and the applications running within it. Graphs show alerts firing by Cluster and namespace, as well as by alert severity. The alert name and severity also display on the home page in the list of Pods in trouble. On the Namespaces and Workloads tabs, a column for each item shows the amount of firing alerts.
Feature
The ClickHouse integration is available for use with Kubernetes Monitoring.
Enhancement
On the home page, you can click the Cluster count to navigate directly to the list of Clusters on the Clusters tab, and click the Node count to navigate directly to the list of Nodes on the Nodes tab.
Enhancement
On usage cost/efficiency tables, the tables are sorted by default alphabetically by the name of the item (Cluster, namespace, and so on).
Enhancement
To improve loading performance, the namespaces, workloads, and Nodes lists are filtered by default by the first Cluster in the filter list.
Enhancement
You can click the Explore button on any item on the Metrics status tab to troubleshoot configuration issues and customize queries for status.
Enhancement
To prevent deployment failure due to timeout, increased the timeout for the Helm chart to five minutes.
Enhancement
An informational banner displays on the Dashboards page to indicate the prebuilt dashboards are being deprecated and removed, with the exception of the Cardinality dashboard.
Fix
Resolved a duplicate series error on the container details page.
Fix
Improved the status panel failure detection for kube-state-metrics.
Fix
When you click Start Sending Data, the Configuration page displays.
1.7.0
1/22/24
Type
Description
Feature
You can view usage and cost graphs and data on the Pod details page.
Feature
When using EKS on Fargate, Pod logs are made available by using the Kubernetes API to gather them.
Feature
Added or updated support for these integrations: Aerospike, ActiveMQ, Cassandra, Apache HTTP Server Project, cert-manager, ClickHouse, CoreDNS, GitLab, Kafka, MongoDB, NGINX, and PrestoDB .
Enhancement
On the Configuration page, you can add an expiration date to the token and view the scope of the token.
Fix
Corrected the following broken links: workload links on the Pod list of the Node details page, Cluster status link on the Configuration page, and links when Cluster name contains a slash character.
1.6.2
1/10/24
Type
Description
Enhancement
User experience enhancements: The home page displays the home page name on the breadcrumbs, and filter options display alphabetically.
Enhancement
You can copy any object in the table lists by clicking the copy button within the row.
1.6.1
12/19/23
Type
Description
Feature
IBM Cloud is available as a platform that you can select when using the configuration wizard.
Enhancement
You can choose Loki from the data source selector in all detailed views.
Enhancement
A link to documentation for understanding CPU limits is available on detail pages showing CPU limits set.
Enhancement
All IEC units show in standard form.
Enhancement
Throughout Kubernetes Monitoring, you have the copy icon available to copy any text so that you can use it in your own query, use it in the command line, send it to others, and so on.
Fix
Fixed empty Cost panels for customers who use forward slashes in Cluster names by changing to pipe formatter.
Fix
Corrected inconsistencies in the number of Nodes displayed by using an instant query for any single value result instead of using a range query. This improves app performance and data accuracy.
Fix
Fixed accuracy of CPU usage data on the Node details page.
Fix
Improved accuracy of machine learning prediction by using query variables to dynamically set parameters for calculation.
1.6.0
12/10/23
Type
Description
Feature
Cluster detail page: Added optimization panel showing CPU and memory utilization, and cost information for the time range selected.
Feature
Node detail page: Added optimization panel showing CPU and memory utilization, and cost information for the time range selected.
Enhancement
Cost page: All panels on the Cost page provide the Explore button so you can view the query, debug issues, or create your own dashboard.
Fix
Storage savings: Fixed storage results, which displayed multiple series errors in cases where a PersistentVolumeClaim was present on more than one node within the selected time range.
Fix
Container details page: Fixed events, which weren’t displaying.
1.5.0
11/29/23
Type
Description
Feature
Container detail page: Added page. This page includes CPU and memory utilization, and cost information. The restart and termination panels are adjacent to each other to allow quicker correlation of restart and termination events.
Feature
Namespace detail page: Added optimization panel showing CPU and memory utilization, and cost information.
Enhancement
Workloads detail page: Added optimization panel showing CPU and memory utilization, and cost information. Updated the feature for detecting outlier CPU usage in the Pods to uses the time range selected in the time selector instead the latest/current time.
Fix
Pod detail page: Updated the CPU and memory queries for accuracy. Removed duplicate containers that were sometimes listing.
Fix
Home page: Decreased load time for the Pods in trouble panel, and filtered only for issues that are currently active.
Fix
Added Zipkin port configuration to ensure traces work properly.
Fix
Updated internal release process and documents.
1.4.1
11/10/23
Type
Description
Feature
Configuration wizard: Added a selector for the platform.
Feature
Pod detail page: Added an optimization panel containing CPU and memory utilization and cost.
Fix
Time range selector: Changed to use the time zone of the user, to ensure accuracy of time range data.
Fix
Use new metrics receiver flow component in the integration code snippets.
Fix
Fixed missing data source for k3d environment.
1.4.0
11/3/23
Type
Description
Enhancement
Time range selector: Added to all appropriate pages to allow a specific time range for data to be displayed.
Enhancement
Metrics status page: Improved for better user experience.
Enhancement
Configuration wizard: Improved visibility of OpenCost configuration information.
Enhancement
Pod details page: Used Scenes to build panels on this page.
Fix
Default instructions in Github for integration: Fixed the appropriate name for Prometheus component.
Fix
Error message for testing integration: Revised for validity and comprehension.
Fix
Workload key and data list fixes: Revised to use correct key on workloads list.
Fix
Kafka integration: Revised job name to kafka for integration to work properly.
Fix
Status queries: Adjusted to show consistent results.
Backend release notes
The following are release notes for the Kubernetes Monitoring backend.
2.2.4
February 2026
Type
Description
Enhancement
Alert detection: Re-enabled support for Grafana-managed alerts. Alert queries now check both Prometheus Alertmanager (ALERTS metric) and Grafana-managed alerts (GRAFANA_ALERTS metric) to ensure all alerts are detected. During the transition period, you may temporarily see duplicate alerts for the same condition.
2.2.3
January 2026
Type
Description
Enhancement
Updated kubernetes-mixin integration to 1.4.2.
2.2.2
December 2025
Type
Description
Enhancement
Alert rules: Updated KubePodNotReady to remove Failed from phase label filter; updated KubeContainerWaiting from aggregated metrics using sum to using raw metrics; updated KubeCPUOverCommit and KubeMemoryOvercommit to differentiate between Non-HA and HA clusters; updated KubeQuotaAlmostFull, KubeQuotaFullyUsed, KubeQuotaExceeded to do explicit join instead of ignore; updated CPUThrottlingHigh to deduplicate with topk; updated KubeletPodStartUpLatencyHigh with topk for deduplication.
Enhancement
Recording rules: Updated scheduler_e2e_scheduling_duration_seconds bucket to scheduler_scheduling_attempt_duration_seconds_bucket (recording rule name also updated); updated scheduler_binding_duration_seconds_bucket to scheduler_pod_scheduling_sli_duration_seconds_bucket (recording rule name also updated); updated the kubelet PLEG node_quantile:kubelet_pleg_relist_duration_seconds:histogram_quantile recording rule to deduplicate with max by.
2.2.1
December 2025
Type
Description
Fix
Temporarily disabled Grafana-managed alerts.
2.2.0
June 2025
Type
Description
Enhancement
Alert rules: Added the cluster label to any alert description where it was missing. Added KubePdbNotEnoughHealthyPods, KubeNodePressure, and KubeNodeEviction. Updated KubeletTooManyPods to deduplicate Pod counts and improve evaluation performance. Updated KubeStatefulSetReplicasMismatch to also fire when Pods cannot be created. Updated KubeNodeNotReady and KubeNodeReadinessFlapping to filter out cordoned Nodes. Removed KubeAPIDown and KubeletDown alerts as they are misleading.
Enhancement
Recording rules: Added node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate5m. Updated namespace_workload_pod:kube_pod_owner:relabel to support more workloads. Updated all apiserver rules to support both Prometheus v2 and v3 histogram bucket formats. Updated apiserver_request:availability30d to prevent null values. Deprecated node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate.
2.1.2
December 2024
Type
Description
Enhancement
Alert rules: Made the KubeDaemonSetRolloutStuck alert configurable. Added instance label to KubeAggregatedAPIErrors. Added cluster label to KubeClientCertificateExpiration description.
Enhancement
Build process/tooling: Introduced automated Go dependency updates. Security updates CVE-2024-45337 and GO-2024-3333.
Fix
Recording rules: Corrected a data calculation in cluster_verb_scope:apiserver_request_sli_duration_seconds_count:increase.*.
2.1.1
November 2024
Type
Description
Enhancement
Alert rules: To better support multi-cluster environments, added cluster label to KubeAPIErrorBudgetBurn, KubeAPITerminatedRequests, and KubeClientCertificateExpiration.
Enhancement
Recording rules: Improved data consistency for cluster_verb_scope_le:apiserver_request_sli_duration_seconds_bucket:increase1h, cluster_verb_scope_le:apiserver_request_sli_duration_seconds_bucket:increase30d, cluster_verb_scope:apiserver_request_sli_duration_seconds_count:increase1h, and cluster_verb_scope:apiserver_request_sli_duration_seconds_count:increase30d.
Enhancement
Build/process tooling: Updated dependencies to address vulnerabilities.
2.1.0
September 2024
Type
Description
Enhancement
Added alert groups: - kubernetes-storage (five KubePersistent.* alert rules) - kube-apiserver-slos (four KubeAPIErrorBudgetBurn alert rules) - kubernetes-system-apiserver (six system API Server alert rules, including KubeAggregatedAPIDown).
Added the ability to enrich metrics with pod or namespace labels.
Feature
Added the ability to set protobufMessage and a shortcut for the remote_write protocol.
Feature
Added a feature for gathering logs using PodLogs objects.
Enhancement
Set CRI as default logs processor if runtime is unset.
Enhancement
Automatically set required environment variables when enabling remote config.
Enhancement
Updated Windows Exporter to 0.12.3, Beyla to 1.11.0, and Prometheus Operator Object CRDs to 26.0.1.
3.7.5
Type
Description
Feature
Added configuration to specify the overrides section of the Span Logs Connector component for the Application Observability chart.
Enhancement
Updated Alloy Operator to 0.4.1.
3.7.4
Type
Description
Enhancement
Improved OTLP destination protocol validation.
Enhancement
Updated Prometheus Operator Object CRDs.
Fix
Fixed indentation issues for Beyla relabel.
Fix
Added erofs to node-exporter filesystem exclusions.
3.7.3
Type
Description
Feature
Added eBPF sample rate for profiling.
Feature
Added the ability to skip cluster metrics ServiceMonitor checks.
Enhancement
Updated Node Exporter.
Enhancement
Made the loki.process CRI stage maxPartialLines configurable.
3.7.2
Type
Description
Feature
Added cloudProvider configuration support to MySQL and PostgreSQL databaseObservability.
Enhancement
Updated Node Exporter and Alloy Operator.
3.7.1
Type
Description
Enhancement
Updated Node Exporter and kube-state-metrics.
Fix
Fixed MySQL and PostgreSQL integrations for missing log destinations and secrets.
3.7.0
Type
Description
Feature
Added an integration for Istio sidecar and service metrics.
Feature
Added an integration for PostgreSQL, including support for Database Observability.
Feature
Added the ability to set otel_annotations flag for the k8sattributes processor.
Feature
Added more options to the secretFilter component in the pod logs features.
Enhancement
Deployed beyla-k8s-cache with 1 replica by default in auto-instrumentation feature chart.
Enhancement
Updated Alloy Operator and Beyla.
Enhancement
Added check for the presence of kube-state-metrics or Node Exporter ServiceMonitors if clusterMetrics and prometheusOperatorObjects features are enabled.
3.6.2
Type
Description
Enhancement
Updated Alloy Operator to 0.3.14.
Fix
Fixed extra quotes for the sending queue storage in OTLP destinations.
Fix
Fixed the inclusion of the destination secret for Service Graph instance.
3.6.1
Type
Description
Feature
Added the ability to override the security context for the waitForAlloyRemoval Helm Hook.
Feature
Added the ability to define the sending queue for OTLP destinations.
Feature
Added the ability to define the remote timeout for Loki destinations.
3.6.0
Type
Description
Feature
Added Pod Init container metrics to the kube-state-metrics allow list.
Feature
Added Database Observability to the MySQL integration.
Feature
Added the ability to disable sending traces from Beyla to the Application Observability feature.
Feature
Added the ability to use ScrapeConfig objects with the Prometheus Operator Objects feature.
Fix
Updated OpenCost to include an emptydir volume mount for its config path.
3.5.7
Type
Description
Feature
Added the ability to set scrape_native_histograms for Prometheus scrape configs.
Enhancement
Updated kube-state-metrics and Prometheus Operator CRDs.
Fix
Fixed port assignment on the etcd integration and several cluster metrics control plane services.
Fix
Fixed MySQL integration.
3.5.6
Type
Description
Feature
Added the ability to set resources for the Helm hooks.
Feature
Added the ability to set K8s Attributes Processor filters.
Enhancement
Updated Alloy Operator and Node Exporter.
3.5.5
Type
Description
Enhancement
Updated Beyla chart to 1.9.9.
Fix
Set node label for Tempo service integration.
3.5.4
Type
Description
Enhancement
Updated Alloy Operator, Prometheus Operator CRDs, Beyla, and OpenCost.
Enhancement
Updated default Service Graph dimensions.
Fix
Ensured Jaeger Compact and Jaeger Thrift protocols use UDP.
Fix
Ensured Service Graph utilizes destination defaults.
Fix
Fixed integration scrape intervals and timeouts to check properly inside the metrics object.
Fix
Set node label for certain service integrations.
3.5.3
Type
Description
Feature
Made Alloy Profiles deployable on all nodes.
Feature
Added label selectors for the Pod Logs feature.
Feature
Added labels and annotations to Alloy CR instances.
Enhancement
Updated Alloy Operator, Beyla, OpenCost, and Prometheus Operator CRDs.
3.5.2
Type
Description
Feature
Added the ability to change pod associations.
3.5.1
Type
Description
Feature
Added the ability to set the MySQL protocol.
Enhancement
Updated prometheus-node-exporter from 4.47.3 to 4.48.0.
Fix
Fixed OpenCost validation when using destinationsMap.
3.5.0
Type
Description
Feature
Created a separate feature for Pod Logs via Kubernetes API.
Feature
Added Custom type destinations.
Feature
Added the ability to set scrape timeout everywhere.
Enhancement
Updated default resource attributes remove list and labels set for profiling feature.
Enhancement
Span metrics: Added spans prefilter for internal.
3.4.1
Type
Description
Feature
Added the ability to set labels and annotations on the hook pod.
Enhancement
Updated kube-state-metrics and OpenCost.
Enhancement
Updated Node Exporter filesystem exclusion list.
Fix
Fixed the excludeNamespaces option in the Prometheus Operator Object feature.
3.4.0
Type
Description
Feature
Added default attribute remove list support for OTLP destinations.
Feature
Added the ability to override attributes and set TLS settings for remote config.
Fix
Properly truncated tail sampling and service grapher Alloy instances.
Fix
Removed the “wait for Alloy operator” hook.
3.3.2
Type
Description
Feature
Added the ability to set the default scrape timeout for Prometheus Operator Objects feature.
Feature
Added sys.env functions to remoteConfig collector.
Enhancement
Updated OpenCost to 2.2.2 and Prometheus Operator CRDs to 23.0.0.
Fix
Properly truncated tail sampling and service grapher Alloy instances.
3.3.1
Type
Description
Feature
Added image pull secrets and pull policy to Helm hooks.
Enhancement
Updated OpenCost to 2.2.
Enhancement
Added transform support for span metrics.
Fix
Fixed scrapeProtocols typo.
Fix
Fixed hook arm64 compatibility and prevented failure if Alloy instance already deleted.
3.3.0
Type
Description
Feature
Added passthrough for k8sattributes processor.
Feature
Added hooks to add finalizers and Alloy instance cleanup to prevent orphaned resources.
Feature
Added pre-install hook to wait for Alloy Operator readiness before creating Alloy resources.
Feature
Added alloy-operator.waitForReadiness configuration option to control the timing behavior.
Feature
Allowed passing destinations as a map.
Feature
Allowed modifying Alloy settings in one place.
Feature
Added span attribute support for skip metrics generation.
Enhancement
Improved Windows Exporter discovery rules.
Enhancement
Added kube_cronjob.* to kube-state-metrics default allow list.
3.2.6
Type
Description
Feature
Added the ability to set error mode for filter and transform processors.
Enhancement
Updated Beyla Helm chart to 1.9.2 in auto-instrumentation feature chart.
Fix
Skipped Beyla-generated traces in span metrics.
3.2.5
Type
Description
Enhancement
Updated kube-state-metrics to 6.1.4.
Fix
Allowed destination and remote config usernames to be numbers.
3.2.4
Type
Description
Enhancement
Updated Beyla, Alloy, Alloy Operator, kube-state-metrics, and Prometheus Operator CRDs.
3.2.3
Type
Description
Enhancement
Updated Windows Exporter to 0.12.1.
Enhancement
Updated annotation handling for profiling targets.
Enhancement
Included collector.id in span metrics transform.
Fix
Fixed bad double quote in CRD validation.
3.2.2
Type
Description
Enhancement
Updated Alloy Operator to 0.3.7.
Enhancement
Improved how integration logs and metrics are enabled so they can be properly disabled.
3.2.1
Type
Description
Enhancement
Updated Node Exporter to 4.47.3.
Fix
Fixed newline handling at the end of the Prometheus file.
Fix
Removed check preventing deploying if /var/log or /var/lib/docker/containers are set when they may not be needed.
3.2.0
Type
Description
Feature
New feature: Profiles Receiver.
Enhancement
Prevented Node Exporter from generating metrics about ramfs and tmpfs.
Enhancement
Set the job label on sources from the Annotation Autodiscovery feature to more reasonable values.
Enhancement
Set service.namespace and service.instance.id labels from typical sources when using the Pod Logs feature.