Runbook

Elasticsearch Cluster Health Showing Red

Back to Runbooks

Overview

This incident type occurs when the health status of an Elasticsearch cluster changes to red, indicating that there are critical issues impacting the cluster's ability to function properly. This can be caused by various factors such as hardware failures, network issues, or software bugs, and requires immediate attention to prevent data loss or service disruption. Resolving this incident type requires a thorough investigation of the underlying issue and applying appropriate fixes to restore the cluster's health to normal.

Parameters

Debug

Check if Elasticsearch service is running

Check Elasticsearch logs for any errors

Check Elasticsearch cluster health status

Check Elasticsearch cluster nodes

Check Elasticsearch cluster indices

Check Elasticsearch cluster shard allocation

Check Elasticsearch cluster disk usage

Check Elasticsearch cluster stats

Check Elasticsearch cluster settings

Check the Elasticsearch cluster logs for any errors that could be causing the red health status. Once identified, fix the issue or escalate it to the appropriate team member.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.