METROCLUSTER HEAL_ROOT FAILED - AutoSupport message
Applies to
- MetroCluster
- ONTAP 9
- Heal root aggregates failed
-
callhome.mcc.heal_root.failed:alert
Event Summary
This message occurs when metrocluster heal root-aggregates operation has failed. The failure could be partial or complete.
Validate
Event Log
event log show -event *heal_root*
Command Line:
Run the following command on the cluster that attempted to perform the healing operation.
metrocluster operation show
Operation: heal-root-aggregates
State: completed-with-warnings
Errors: ClusterA (warning): Some aggregates are still resyncing. Run "storage aggregate show-resync-status -in-progress true" to verify that the resync is complete for all aggregates, and then run "metrocluster switchback".
Node: ClusterA-02
Start Time: mm/dd/yyyy hh:mm:ss
End Time: mm/dd/yyyy hh:mm:ss
Job ID: 6135
Operation UUID: e202288d-859b-11ec-88f1-00a098cbca9e
Command Line: metrocluster heal -phase root-aggregates
Additional Info: -
Resolution
- Review the output of metrocluster operation show and follow the correction actions mentioned in the error section before trying the heal command again.
- If the instructions provided by the command do not resolve the issue, contact NetApp Technical Support.
Additional Information
Note: The metrocluster operation show output will only report on the last metrocluster operation. If a subsequent metrocluster operation has run since the root-aggregate healing failure the output will not be relevant to the failure.
To get further details on a previous metrocluster operation:
- Run the metrocluster operation history show command to see all previous metrocluster operations:
metrocluster operation history show -fields operation,start-time,state,operation-uuid
- From the output, identify the operation-uuid of the failed or partially failed operation:
ClusterA::> metrocluster operation history show -fields operation,start-time,state,operation-uuid
operation-uuid operation start-time state
------------------------------------ ------------------- ------------------ ----------
b6fe95a4-dc12-11ec-881e-00a0985bd305 switchover-simulate mm/dd/yyyy hh:mm:ss successful
bffaf8b3-b9a4-11ec-881e-00a0985bd305 switchover-simulate mm/dd/yyyy hh:mm:ss successful
e202288d-859b-11ec-88f1-00a098cbca9e heal-root-aggregates mm/dd/yyyy hh:mm:ss failed
- Run the metrocluster operation history show command with the identified operation uuid:
metrocluster operation history show -operation-uuid <operation-uuid>
ClusterA::> met op hist show -operation-uuid e202288d-859b-11ec-88f1-00a098cbca9e
(metrocluster operation history show)
Operation: heal-root-aggregates
State: completed-with-warnings
Errors: ClusterA (warning): Some aggregates are still resyncing. Run "storage aggregate show-resync-status -in-progress true" to verify that the resync is complete for all aggregates, and then run "metrocluster switchback".
Node: ClusterA-02
Start Time: mm/dd/yyyy hh:mm:ss
End Time: mm/dd/yyyy hh:mm:ss
Job ID: 6135
Operation UUID: e202288d-859b-11ec-88f1-00a098cbca9e
Command Line: metrocluster heal -phase root-aggregates
Additional Info: -