Runbook

Host RAID Disk Failure

Back to Runbooks

Overview

Host RAID Disk Failure is an incident type that occurs when at least one device in a RAID array fails. This can result in a loss of data or service interruption. The incident requires attention and possibly a disk swap to restore the RAID array and prevent further failures.

Parameters

Debug

Check if the system recognizes the RAID array

Check the disks status in the RAID array

Check the SMART status of all disks in the RAID array

Check the logs for disk errors

Check the system logs for disk errors

Check the RAID events log

Check the disk health using S.M.A.R.T

Check the filesystem consistency on the RAID array

Check the RAID array integrity and rebuild status

Check the partitions on the RAID array

Repair

Replace the failed disk in the RAID array and rebuild the array.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.