Aggregate offline after takeover due to missing disk drives X357_SLNGE3T8ATE
Applies to
- Disks X357_SLNGE3T8ATE running firmware NA54 and below
- ONTAP 9
Issue
Aggregate offline following a takeover event
EMS
log reports multiple disks failing with the following error:
scsi.cmd.pastTimeToLive:error]: Disk device 0c.11.17: request failed after try #1: cdb 0x5e:01.
disk.readReservationFailed:error]: Disk read reservation failed on 0c.11.17 CDB 0x5e:01 - SCSI:aborted command (b 2f 10)
diskown.errorDuringIO:error]: error 20 (disk operation timed out) on disk 0c.11.17 (S/N XXXXXXXX) while reading reservation state
raid.notify.on.non.persistent.failure:debug]: Received SDM_NOTIFY_ON_NON_PERSISTENT_FAILURE for disk uid, originating sysid XXXXXXXXX.
raid.notify.on.non.persistent.failure.fatal:debug]: Failing disk due to SDM_NOTIFY_ON_NON_PERSISTENT_FAILURE for disk 0c.11.17, error 22, reason 21, originating sysid XXXXXXXXX, owner 3.
raid.config.disk.not.responding:notice]: Disk 0c.11.15 Shelf 11 Bay 15 [NETAPP X357_SLNGE3T8ATE NA54] S/N [XXXXXXXX] is not responding.
- Aggregate fails to come online
raid.stripe.replay.volume.offline:notice]: Aggregate partner: aggr_2 is not online.
raid.assim.rg.missingChild:debug]: Aggregate partner:aggr_2, rgobj_verify: RAID object 0 has only 16 valid children, expected 23.
raid.assim.plex.missingChild:debug]: Aggregate partner:aggr_2, plexobj_verify: Plex 0 only has 0 working RAID groups (1 total) and is being taken offline
raid.assim.mirror.noChild:debug]: Aggregate partner:aggr_2, mirrorobj_verify: No operable plexes found.
SYSCONFIG-R
section
*** This system has failed.
Any adapters shown below are those of the live partner
Aggregate aggr_2 (failed, raid_dp, partial, fast zeroed) (block checksums)
Plex /aggr_2/plex0 (offline, failed, inactive)
RAID group /aggr_2/plex0/rg0 (partial, block checksums)
RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks)
--------- ------ ------------- ---- ---- ---- ----- -------------- --------------
dparity 0c.11.12 0c 11 12 SA:B 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
parity 0b.12.12 0b 12 12 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data FAILED N/A 3662580/ -
data FAILED N/A 3662580/ -
data FAILED N/A 3662580/ -
data FAILED N/A 3662580/ -
data FAILED N/A 3662580/ -
data 0b.12.15 0b 12 15 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0c.11.16 0c 11 16 SA:B 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0b.12.16 0b 12 16 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data FAILED N/A 3662580/ -
data 0b.12.17 0b 12 17 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0c.11.18 0c 11 18 SA:B 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0b.12.18 0b 12 18 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0c.11.19 0c 11 19 SA:B 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0b.12.19 0b 12 19 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0c.11.20 0c 11 20 SA:B 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0b.12.20 0b 12 20 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0c.11.21 0c 11 21 SA:B 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0b.12.21 0b 12 21 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0c.11.22 0c 11 22 SA:B 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data 0b.12.22 0b 12 22 SA:A 0 SSD N/A 3662580/7500964352 3662830/7501476528 (fast zeroed)
data FAILED N/A 3662580/ -
Raid group is missing 7 disks.