Runbook

High Replication Factor Issues in Cassandra

Back to Runbooks

Overview

This incident type refers to issues that occur in Cassandra due to a high replication factor. Replication factor is a setting that determines how many copies of data are stored in a cluster. A high replication factor means that more copies of data are stored in the cluster, which can cause performance issues and potentially lead to data inconsistencies. When this incident occurs, it may affect the availability and stability of the Cassandra cluster, impacting the ability to read and write data.

Parameters

Debug

Connect to a Cassandra node in the cluster

Check the replication factor for a given keyspace and table

Check the number of replicas for a given keyspace and table

Check the status of the nodes in the cluster

Check the replication factor for a given keyspace and table on all nodes in the cluster

Check the status of the nodes in the cluster to identify any nodes that are down or experiencing issues

Check the replication factor for a given keyspace and table on all nodes in the cluster to identify any inconsistencies

Check the read/write latency for the cluster

Repair

Reduce the replication factor: One possible remediation for this incident is to reduce the replication factor. This can be done by adjusting the configuration settings in Cassandra to store fewer copies of data. However, this may impact data availability and durability, so it's important to carefully consider the trade-offs before making changes.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.