This incident type pertains to a situation where the available memory for requests in percentage is low in a Kubernetes cluster. It means that the resource limit for a container has been exceeded, and this can result in performance issues and potentially cause the application to crash. This incident requires immediate attention and resolution to ensure the stability and availability of the affected system.
Parameters
Debug
Check if any nodes are in NotReady state
Check if any pods are in the Pending state
Check the memory usage of all pods in the default namespace
Check the memory usage of a specific pod in the default namespace
Check the memory usage of all containers in a specific pod in the default namespace
Check the resource requests and limits for all pods in the default namespace
Check the resource requests and limits for a specific pod in the default namespace
Check the current memory usage and available memory for each node in the cluster
Repair
Optimize memory usage of applications running on the cluster by tuning their memory requests and limits.
Increase available memory on the Kubernetes cluster by adding more nodes or increasing the memory allocated to existing nodes.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.