Runbook

Kubernetes - DNS Resolution Failure

Back to Runbooks

Overview

DNS resolution failure is a common incident type that occurs when services are unable to resolve domain names into IP addresses, leading to communication failure. DNS (Domain Name System) is responsible for translating human-readable domain names into machine-readable IP addresses. When this translation fails, services are unable to communicate with each other, resulting in outages or performance degradation. This incident type can be caused by various factors such as misconfigured DNS settings, DNS server failures, network connectivity issues, or cyber attacks.

Parameters

Debug

Check if the kube-dns service is running

View the logs of the kube-dns pod

Check if the DNS service IP is correctly set

Check if the DNS pod is running and ready

Check if the DNS configuration is correct

Check if the DNS server is reachable from the cluster

Check if the DNS resolution is working for a specific pod

Check the DNS server configuration and ensure that it is configured correctly. Verify that the DNS server is operational and responding to queries.

Repair

Restart the DNS server to clear any cache or temporary issues.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.