NS224 Shelf disks missing from Metrocluster IP
Applies to
- 4 Node MCC-IP
 - NS224 Shelf
 
Issue
- Both clusters in MCC-IP trigger autosupport alerts indicating:
    
- HA Group Notification (DISK REDUNDANCY FAILED) ERROR
 - HA Group Notification (SYNCMIRROR PLEX FAILED) ALERT
 - HA Group Notification (FILESYSTEM DISK NOT RESPONDING) ERROR
 - HA Group Notification (Health Monitor process schm: RaidDegradedMirrorAggrAlert[4c95d23d-1cad-49e3-8b37-0531b7b4a6e6]) ALERT
 
 - In EMS logs multiple error messages against multiple disks for the same shelf are observed:
 
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.18L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.6L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.23L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.7L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Disk device e3a.21.0.7L0: Check Condition: CDB 0xe2:01:0100000000000000:000000400000: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8285).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.mcc.lunmgr.io.error:debug]: Disk device S/N XXXXXXXXXXXX - CDB 0xe2:01:0100000000000000:000000400000 - (scsi error: command aborted) - Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(DT 8285). (HA status 0x0) - (out_status_flags 0x24)
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.9L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.19L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.15L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.pastTimeToLive:error]: Disk device e3b.21.1.15L0: request failed after try #1: cdb 0xe2:01:0100000000000000:000000400000.- System Storage Configuration transition from Quad-Path to either Mixed-Path or Multi-Path in all four nodes of the MCC
 - Shelf containing the affected disks is not visible in SYSCONFIG-A
 - Both sides of the MCC report missing disks:
 
Main Cluster
    RAID group /aggr1_CLUSTERA01_75_TB/plex0/rg1 (partial)
      RAID Disk    Device         HA  SHELF BAY CHAN Pool Type  RPM  Used (MB/blks)    Phys (MB/blks)
      ---------    ------         ------------- ---- ---- ---- ----- --------------    --------------
      dparity    FAILED             N/A                        1831170/ -
      parity    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      Raid group is missing 5 disks.
DR Cluster
    RAID group /aggr1_CLUSTERB02_75_TB/plex1/rg1 (partial)
      RAID Disk    Device         HA  SHELF BAY CHAN Pool Type  RPM  Used (MB/blks)    Phys (MB/blks)
      ---------    ------         ------------- ---- ---- ---- ----- --------------    --------------
      dparity    FAILED             N/A                        1831170/ -
      parity    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      Raid group is missing 5 disks.
