Multiple drive failure after adding disk shelf
Applies to
- AFF A400
- DS224C disk shelf
Issue
[Node-01: disk_server_0: shm.threshold.consecutiveTimeouts:error]: shm: Disk 0d.13.18 has exceeded the threshold of 11 consecutive timeouts; the system will fail the disk if possible.
[Node-01: disk_server_1: shm.threshold.consecutiveTimeouts:error]: shm: Disk 0d.13.5 has exceeded the threshold of 11 consecutive timeouts; the system will fail the disk if possible.
[Node-01: disk_server_0: shm.threshold.consecutiveTimeouts:error]: shm: Disk 0d.13.19 has exceeded the threshold of 11 consecutive timeouts; the system will fail the disk if possible.
[Node-01: disk_server_0: shm.threshold.consecutiveTimeouts:error]: shm: Disk 0d.13.4 has exceeded the threshold of 11 consecutive timeouts; the system will fail the disk if possible.
[Node-01: disk_server_1: shm.threshold.consecutiveTimeouts:error]: shm: Disk 0d.13.23 has exceeded the threshold of 11 consecutive timeouts; the system will fail the disk if possible.
[Node-01: disk_server_1: shm.threshold.consecutiveTimeouts:error]: shm: Disk 0d.13.22 has exceeded the threshold of 11 consecutive timeouts; the system will fail the disk if possible.
Shelf name: 0d.shelf13
Shelf id: 13
Channel: 0d
Module: B
Shelf UID: 50:0a:09:80:0e:c9:e1:c2
Shelf S/N: XXXXXXXXXXXX
Term switch: N/A
Shelf state: ONLINE
Module state: OK
Partial Path Link Invalid Running Loss Phy CRC Phy
Disk Port Timeout Rate DWord Disparity Dword Reset Error Change
Id State Value (ms) (Gb/s) Count Count Count Problem Count Count
--------------------------------------------------------------------------------------------------
[ 0 ] OK 0 12.0 0 0 0 0 0 1
[ 1 ] OK 0 12.0 0 0 0 0 0 3
[ 2 ] OK 0 12.0 0 0 0 0 0 3
[ 3 ] OK 0 12.0 0 0 0 0 0 1
[ 4 ] OK 0 12.0 0 0 0 0 0 3
[ 5 ] OK 0 12.0 0 0 0 0 0 3
[...]
[ 18 ] OK 0 12.0 0 0 0 0 0 3
[ 19 ] OK 0 12.0 0 0 0 0 0 3
[ 20 ] OK 0 12.0 0 0 0 0 0 3
[ 21 ] OK 0 12.0 0 0 0 0 0 3
[ 22 ] OK 0 12.0 0 0 0 0 0 5
[ 23 ] OK 0 12.0 0 0 0 0 0 3