This incident type relates to a high urgency issue regarding host context switching. The incident is triggered when the context switching grows on the node beyond a certain level, typically over 10000 per second. This issue can cause performance degradation and impact the stability of the system. The incident requires immediate attention from a software engineer to identify the root cause and take the necessary steps to resolve the issue.
Parameters
Debug
Check the CPU usage of the affected instance
Check the memory usage of the affected instance
Check the number of context switches per second on the affected instance
Check the network connection status of the affected instance
Check the disk usage of the affected instance
Check the logs related to the affected service
Check the status of the affected service
The system may be overloaded with too many requests, causing the CPU to switch between different processes frequently, leading to high context switching rates.
Repair
Consider restarting the affected service or host.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.