This incident type refers to an alert triggered due to high request duration on CoreDNS, which is a DNS server that translates domain names to IP addresses. The alert indicates that the request duration process for CoreDNS is high, meaning that it is taking longer than expected to process DNS requests. This can cause delays or failures in the resolution of domain names, leading to potential service disruptions or outages.
Parameters
Debug
Get the name(s) of the pod(s) running CoreDNS
Check the logs of the CoreDNS pod(s) for errors
Check the CPU and memory usage of the CoreDNS pod(s)
Check the Kubernetes events related to the CoreDNS pod(s)
Check the status of the container(s) in the CoreDNS pod(s)
Check the network latency between the CoreDNS pod(s) and other pods/services
Check the Kubernetes services and endpoints related to CoreDNS
Repair
Scale the CoreDNS deployment to handle the increased load.
Update resources limits for CoreDNS deployment to handle the increased load.
Increase the resources allocated to the affected system to handle the increased load.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.