Documentation for automated readers
A curated documentation index is available at: https://grafana.com/llms.txt
A complete documentation index is available at: https://grafana.com/llms-full.txt
These indexes can help with page discovery before fetching individual documents.
This page is also available in Markdown, which may be easier for automated readers and AI tools to parse than HTML. The Markdown version is available at https://grafana.com/docs/grafana-cloud/monitor-infrastructure/kubernetes-monitoring/triage-your-infrastructure/review-infrastructure-conditions.md, or by sending Accept: text/markdown to https://grafana.com/docs/grafana-cloud/monitor-infrastructure/kubernetes-monitoring/triage-your-infrastructure/review-infrastructure-conditions/. For broader documentation discovery, the curated index is available at https://grafana.com/llms.txt and the complete index is available at https://grafana.com/llms-full.txt.
Review infrastructure conditions
The Infrastructure section on Kubernetes Overview surfaces platform conditions, the layer below your workloads. When this section lights up, workload symptoms in Stability usually follow.
An infrastructure check identifies:
- Nodes under resource pressure
- Pods that have been evicted or lost contact with the Cluster

Click View detail on any tile to see the affected items listed under Detail view at the bottom of the page.
Node pressure
These are Nodes with an active MemoryPressure, DiskPressure, or PIDPressure condition. These are early warning signals for Pod evictions or failures.
Evicted Pods
These are Pods evicted by the kubelet due to Node resource pressure or by the scheduler due to priority preemption.
Pods in unknown phase
These are Pods in the Unknown phase. This typically occurs when the Node hosting the Pod becomes unreachable and Kubernetes can no longer determine the Pod’s state.
Unknown Pods transition to Failed or recover once Node connectivity is restored.Nodes that cannot be scheduled
These are Nodes that have been cordoned or tainted to block new Pod placement, usually during maintenance or drains. If Nodes cannot be scheduled for long periods, effective Cluster capacity shrinks and Pods can stay pending or concentrate load on the remaining Nodes.
NoSchedule or NoExecute taint added by an operator or controller (for example, GPU or dedicated workload Nodes), the cluster autoscaler marking a Node before scale-down, or automatic taints triggered by MemoryPressure, DiskPressure, or PIDPressure.Was this page helpful?
Related resources from Grafana Labs


