Runbook

Jenkins Master Server Failure

Back to Runbooks

Overview

Jenkins is a popular automation server used in software development. The master server is the main node that controls the distribution of tasks to worker nodes. A failure in the Jenkins master server can cause a disruption in the automation pipeline, leading to delays in software development and deployment. This type of incident requires immediate attention to restore the Jenkins master server and resume normal operations.

Parameters

Debug

Get a list of all pods in the Jenkins namespace

Check the status of the Jenkins pod(s)

Check the logs of the Jenkins pod(s)

Check the status of the Jenkins deployment

Check the replica set(s) of the Jenkins deployment

Check the status of the Jenkins service(s)

Check the network policies affecting the Jenkins pod(s)

Check the storage volume(s) used by the Jenkins pod(s)

Check the resource usage of the Jenkins pod(s)

Check the node(s) where the Jenkins pod(s) are running

Repair

Restart the Jenkins server to see if it's a temporary issue.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.