Skip to main content
NetApp Knowledge Base

NS224 Shelf disks missing from Metrocluster IP

Views:
41
Visibility:
Public
Votes:
0
Category:
disk-shelves
Specialty:
HW
Last Updated:

Applies to

  • 4 Node MCC-IP
  • NS224 Shelf

Issue

  • Both clusters in MCC-IP trigger autosupport alerts indicating:
    • HA Group Notification (DISK REDUNDANCY FAILED) ERROR
    • HA Group Notification (SYNCMIRROR PLEX FAILED) ALERT
    • HA Group Notification (FILESYSTEM DISK NOT RESPONDING) ERROR
    • HA Group Notification (Health Monitor process schm: RaidDegradedMirrorAggrAlert[4c95d23d-1cad-49e3-8b37-0531b7b4a6e6]) ALERT
  • In EMS logs multiple error messages against multiple disks for the same shelf are observed:
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.18L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.6L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.23L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.7L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Disk device e3a.21.0.7L0: Check Condition: CDB 0xe2:01:0100000000000000:000000400000: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8285).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.mcc.lunmgr.io.error:debug]: Disk device S/N XXXXXXXXXXXX - CDB 0xe2:01:0100000000000000:000000400000 - (scsi error: command aborted) - Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(DT 8285). (HA status 0x0) - (out_status_flags 0x24)
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.9L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3a.21.0.19L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(9000).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Unknown device e3b.21.1.15L9998: Check Condition: CDB 0x12: Sense Data SCSI:aborted command -  (0xb - 0x90 0x2 0xfc)(8999).
[CLUSTER-A01: scsi_cmdblk_strthr_admin: scsi.cmd.pastTimeToLive:error]: Disk device e3b.21.1.15L0: request failed after try #1: cdb 0xe2:01:0100000000000000:000000400000.
  • System Storage Configuration transition from Quad-Path to either Mixed-Path or Multi-Path in all four nodes of the MCC
  • Shelf containing the affected disks is not visible in SYSCONFIG-A
  • Both sides of the MCC report missing disks:

Main Cluster

    RAID group /aggr1_CLUSTERA01_75_TB/plex0/rg1 (partial)

      RAID Disk    Device         HA  SHELF BAY CHAN Pool Type  RPM  Used (MB/blks)    Phys (MB/blks)
      ---------    ------         ------------- ---- ---- ---- ----- --------------    --------------
      dparity    FAILED             N/A                        1831170/ -
      parity    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      Raid group is missing 5 disks.

DR Cluster

    RAID group /aggr1_CLUSTERB02_75_TB/plex1/rg1 (partial)

      RAID Disk    Device         HA  SHELF BAY CHAN Pool Type  RPM  Used (MB/blks)    Phys (MB/blks)
      ---------    ------         ------------- ---- ---- ---- ----- --------------    --------------
      dparity    FAILED             N/A                        1831170/ -
      parity    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      data    FAILED             N/A                        1831170/ -
      Raid group is missing 5 disks.

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.