Host RAID Disk Failure is an incident type that occurs when at least one device in a RAID array fails. This can result in a loss of data or service interruption. The incident requires attention and possibly a disk swap to restore the RAID array and prevent further failures.
Parameters
Debug
Check if the system recognizes the RAID array
Check the disks status in the RAID array
Check the SMART status of all disks in the RAID array
Check the logs for disk errors
Check the system logs for disk errors
Check the RAID events log
Check the disk health using S.M.A.R.T
Check the filesystem consistency on the RAID array
Check the RAID array integrity and rebuild status
Check the partitions on the RAID array
Repair
Replace the failed disk in the RAID array and rebuild the array.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.