This incident type occurs when there are many pending compaction tasks in a Cassandra cluster. Compaction is the process of merging and removing data from SSTables (sorted string tables) in Cassandra. When compaction tasks are pending, it means that the process is not completing in a timely manner and is causing performance issues in the cluster. This incident requires investigation and resolution to ensure the cluster's health and performance.
Parameters
Debug
Check the status of the Cassandra service
Check the number of pending compaction tasks in Cassandra
Check the health of the Cassandra cluster
Check the Cassandra logs for errors or warnings
Check the system load and CPU usage
Check the disk usage and available space
Check the network connectivity and latency
Check the firewall rules and open ports
Insufficient resources such as CPU, memory, or disk space, causing compaction to slow down or stop.
Repair
Set variables
Check if cassandra config file exists
Backup cassandra config file
Update concurrent_compactors property
Restart cassandra service
Next Step
Reduce the size of SSTables in the cluster by increasing the frequency of compaction or by manually triggering it. This will help reduce the number of pending tasks and improve performance.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.