Runbook

Nodes with PID Pressure in Kubernetes

Back to Runbooks

Overview

Nodes with PID Pressure in Kubernetes is an incident type that occurs when a Kubernetes cluster node experiences PID pressure, meaning that it may not be able to start more containers. This is a rare condition where a pod or container spawns too many processes and starves the node of available process IDs. Each node has a limited number of process IDs to distribute amongst running processes; and if it runs out of IDs, no other processes can be started. Kubernetes lets you set PID thresholds for pods to limit their ability to perform runaway process-spawning, and a PID pressure condition means that one or more pods are using up their allocated PIDs and need to be examined.

Parameters

Debug

Get the list of nodes that have PID pressure

Get the list of pods that are using too many PIDs

Get the list of containers running on a node

Get the PID usage of a process

Get the number of PIDs in use by a process

Get the maximum number of PIDs available to a process

Repair

Check if there are any misbehaving or stuck processes in the node and kill them to free up PIDs.

Monitor the Kubernetes cluster carefully and identify which pods are using up the most PIDs. Once identified, adjust the PID thresholds for those pods to limit their ability to perform runaway process-spawning.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.