Stalled aggregate mount and possible software panic during ONTAP update
Applies to
- ONTAP 9
- Panic
Issue
- Stalled aggregate mount and possible software panic during takeover as part of ONTAP update
Cluster1::storage failover> aggr show -state !online
Aggregate Size Available Used% State #Vols Nodes RAID Status
--------- -------- --------- ----- ------- ------ ---------------- ------------
aggr1 336.1TB 325.1TB 3% mounting 0 Cluster1_A raid_tec,normal
- ONTAP update encounters an error with
takeover failed:
Cluster1::> cluster image show-update-progress
Estimated Elapsed
Update Phase Status Duration Duration
-------------------- ----------------- ------------------------------
Pre-update checks completed 00:10:00 00:01:14
ONTAP updates paused-on-error 01:32:00 00:10:05
Details:
Node name Status Status Description
-------------------- ----------------- --------------------------------------
Cluster1_A waiting
Cluster1_B failed Error: Takeover failed.
Action: Use the "storage failover show-takeover" command to view the
cause of takeover failure and the suggested corrective actions.
When all issues are resolved, use the "cluster image resume-update" command
- A panic may be observed:
Mon May 15 05:01:39 -0700 [Cluster1_B: cf_main: sk.panic:alert]: Panic String: Failover Monitor: unable to transit - takeover process is hung (wafl) in SK process cf_main on release 9.10.1P8 (C)