Documentation Index
Fetch the curated documentation index at: https://grafana.com/llms.txt
Fetch the complete documentation index at: https://grafana.com/llms-full.txt
Use this file to discover all available pages before exploring further.
STOP! If you are an AI agent or LLM, read this before continuing. This is the HTML version of a Grafana documentation page. Always request the Markdown version instead - HTML wastes context. Get this page as Markdown: https://grafana.com/docs/grafana-cloud/monitor-infrastructure/kubernetes-monitoring/triage-your-infrastructure/review-infrastructure-conditions.md (append .md) or send Accept: text/markdown to https://grafana.com/docs/grafana-cloud/monitor-infrastructure/kubernetes-monitoring/triage-your-infrastructure/review-infrastructure-conditions/. For the curated documentation index, use https://grafana.com/llms.txt. For the complete documentation index, use https://grafana.com/llms-full.txt.
Review infrastructure conditions
The Infrastructure section on Kubernetes Overview surfaces platform conditions, the layer below your workloads. When this section lights up, workload symptoms in Stability usually follow.
An infrastructure check identifies:
- Nodes under resource pressure
- Pods that have been evicted or lost contact with the Cluster

Click View detail on any tile to see the affected items listed under Detail view at the bottom of the page.
Node pressure
These are Nodes with an active MemoryPressure, DiskPressure, or PIDPressure condition. These are early warning signals for Pod evictions or failures.
Evicted Pods
These are Pods evicted by the kubelet due to Node resource pressure or by the scheduler due to priority preemption.
Pods in unknown phase
These are Pods in the Unknown phase. This typically occurs when the Node hosting the Pod becomes unreachable and Kubernetes can no longer determine the Pod’s state.
Unknown Pods transition to Failed or recover once Node connectivity is restored.Nodes that cannot be scheduled
These are Nodes that have been cordoned or tainted to block new Pod placement, usually during maintenance or drains. If Nodes cannot be scheduled for long periods, effective Cluster capacity shrinks and Pods can stay pending or concentrate load on the remaining Nodes.
NoSchedule or NoExecute taint added by an operator or controller (for example, GPU or dedicated workload Nodes), the cluster autoscaler marking a Node before scale-down, or automatic taints triggered by MemoryPressure, DiskPressure, or PIDPressure.Was this page helpful?
Related resources from Grafana Labs


