METROCLUSTER HEAL_AGGR FAILED - AutoSupport Message
Applies to
- MetroCluster
- ONTAP 9
- Heal Aggregates failed
-
callhome.mcc.heal_aggr.failed:alert
Event Summary
This message occurs when metrocluster heal aggregates operation has failed. The failure could be partial or complete.
Validate
Event Log
event log show -event *heal*
Command Line:
Run the following command on the cluster that attempted to perform the healing operation.
metrocluster operation show
Operation: heal-aggregates
State: failed
Errors: Failed to validate the node and cluster components before the heal-aggregates operation.
ClusterA::ClusterA-01 (non-overridable veto): Remote site storage is not accessible. Verify connectivity to the remote site -- make sure that Storage and RDMA Interconnect links are up.
Node: ClusterA-02
Start Time: mm/dd/yyyy hh:mm:ss
End Time: mm/dd/yyyy hh:mm:ss
Job ID: 26153
Operation UUID: 00de12c1-ee15-11eb-a80f-00a0985bd745
Command Line: metrocluster heal -phase aggregates
Additional Info: -
Resolution
- Review the output of metrocluster operation show and follow the correction actions mentioned in the error section before trying the heal command again.
- If the instructions provided by the command do not resolve the issue, contact NetApp Technical Support.
Additional Information
Note: The metrocluster operation show output will only report on the last metrocluster operation. If a subsequent metrocluster operation has run since the aggregate healing failure the output will not be relevant to the failure.
To get further details on a previous metrocluster operation:
- Run the metrocluster operation history show command to see all previous metrocluster operations:
metrocluster operation history show -fields operation,start-time,state,operation-uuid
- From the output, identify the operation-uuid of the failed or partially failed operation:
ClusterA::> metrocluster operation history show -fields operation,start-time,state,operation-uuid
operation-uuid operation start-time state
------------------------------------ ------------------- ------------------ ----------
b6fe95a4-dc12-11ec-881e-00a0985bd305 switchover-simulate mm/dd/yyyy hh:mm:ss successful
bffaf8b3-b9a4-11ec-881e-00a0985bd305 switchover-simulate mm/dd/yyyy hh:mm:ss successful
00de12c1-ee15-11eb-a80f-00a0985bd745 heal-aggregates mm/dd/yyyy hh:mm:ss failed
- Run the metrocluster operation history show command with the identified operation uuid:
metrocluster operation history show -operation-uuid <operation-uuid>
ClusterA::> metrocluster operation history show -operation-uuid 00de12c1-ee15-11eb-a80f-00a0985bd745
Operation: heal-aggregates
State: failed
Errors: Failed to validate the node and cluster components before the heal-aggregates operation.
ClusterA::ClusterA-01 (non-overridable veto): Remote site storage is not accessible. Verify connectivity to the remote site -- make sure that Storage and RDMA Interconnect links are up.
Node: ClusterA-02
Start Time: mm/dd/yyyy hh:mm:ss
End Time: mm/dd/yyyy hh:mm:ss
Job ID: 26153
Operation UUID: 00de12c1-ee15-11eb-a80f-00a0985bd745
Command Line: metrocluster heal -phase aggregates
Additional Info: -