NS224 Shelf disks missing from Metrocluster IP
Applies to
- 4 Node MCC-IP
- NS224 Shelf
Issue
- Both clusters in MCC-IP trigger autosupport alerts indicating:
- HA Group Notification (DISK REDUNDANCY FAILED) ERROR
- HA Group Notification (SYNCMIRROR PLEX FAILED) ALERT
- HA Group Notification (FILESYSTEM DISK NOT RESPONDING) ERROR
- HA Group Notification (Health Monitor process schm: RaidDegradedMirrorAggrAlert[4c95d23d-1cad-49e3-8b37-0531b7b4a6e6]) ALERT
- In EMS logs multiple error messages against multiple disks for the same shelf are observed:
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.18L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.6L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.23L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.7L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Disk device e3a.21.0.7L0: Check Condition: CDB 0xe2:01:0100000000000000:000000400000: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(8285).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.mcc.lunmgr.io.error:debug]: Disk device S/N XXXXXXXXXXXX - CDB 0xe2:01:0100000000000000:000000400000 - (scsi error: command aborted) - Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(DT 8285). (HA status 0x0) - (out_status_flags 0x24)
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.9L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.19L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.15L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command - (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.pastTimeToLive:error]: Disk device e3b.21.1.15L0: request failed after try #1: cdb 0xe2:01:0100000000000000:000000400000.
- System Storage Configuration transition from Quad-Path to either Mixed-Path or Multi-Path in all four nodes of the MCC
- Shelf containing the affected disks is not visible in SYSCONFIG-A
- Both sides of the MCC report missing disks:
Main Cluster
RAID group /aggr1_CLUSTERA01_75_TB/plex0/rg1 (partial)
RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks)
--------- ------ ------------- ---- ---- ---- ----- -------------- --------------
dparity FAILED N/A 1831170/ -
parity FAILED N/A 1831170/ -
data FAILED N/A 1831170/ -
data FAILED N/A 1831170/ -
data FAILED N/A 1831170/ -
Raid group is missing 5 disks.
DR Cluster
RAID group /aggr1_CLUSTERB02_75_TB/plex1/rg1 (partial)
RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks)
--------- ------ ------------- ---- ---- ---- ----- -------------- --------------
dparity FAILED N/A 1831170/ -
parity FAILED N/A 1831170/ -
data FAILED N/A 1831170/ -
data FAILED N/A 1831170/ -
data FAILED N/A 1831170/ -
Raid group is missing 5 disks.