NS224 Shelf disks missing from Metrocluster IP

Last updated
Save as PDF
Share
1. Share
2. Tweet
3. Share

Views:: 134

Visibility:: Public

Votes:: 0

Category:: disk-shelves

Specialty:: hw

Last Updated:

Applies to

4 Node MCC-IP
NS224 Shelf

Issue

Both clusters in MCC-IP trigger autosupport alerts indicating:
- HA Group Notification (DISK REDUNDANCY FAILED) ERROR
- HA Group Notification (SYNCMIRROR PLEX FAILED) ALERT
- HA Group Notification (FILESYSTEM DISK NOT RESPONDING) ERROR
- HA Group Notification (Health Monitor process schm: RaidDegradedMirrorAggrAlert[4c95d23d-1cad-49e3-8b37-0531b7b4a6e6]) ALERT
In EMS logs multiple error messages against multiple disks for the same shelf are observed:

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.18L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.6L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.23L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.7L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Disk device e3a.21.0.7L0: Check Condition: CDB 0xe2:01:0100000000000000:000000400000: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8285).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.mcc.lunmgr.io.error:debug]: Disk device S/N XXXXXXXXXXXX - CDB 0xe2:01:0100000000000000:000000400000 - (scsi error: command aborted) - Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(DT 8285). (HA status 0x0) - (out_status_flags 0x24)

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.9L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.19L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.15L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).

[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.pastTimeToLive:error]: Disk device e3b.21.1.15L0: request failed after try #1: cdb 0xe2:01:0100000000000000:000000400000.

System Storage Configuration transition from Quad-Path to either Mixed-Path or Multi-Path in all four nodes of the MCC
Shelf containing the affected disks is not visible in SYSCONFIG-A
Both sides of the MCC report missing disks:

Main Cluster

RAID group /aggr1_CLUSTERA01_75_TB/plex0/rg1 (partial)

RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks) --------- ------ ------------- ---- ---- ---- ----- -------------- -------------- dparity FAILED N/A 1831170/ - parity FAILED N/A 1831170/ - data FAILED N/A 1831170/ - data FAILED N/A 1831170/ - data FAILED N/A 1831170/ - Raid group is missing 5 disks.

DR Cluster

RAID group /aggr1_CLUSTERB02_75_TB/plex1/rg1 (partial)