Cluster ports down leads to unexpected reboot
Applies to
- AFF A400
- ONTAP 9
- Switchless cluster
Issue
Ems-log
shows cluster ports down
[Node1: kernel: netif.linkDown:info]: Ethernet e3a: Link down, check cable.
[Node1: vifmgr: vifmgr.portdown:notice]: A link down event was received on node Node1, port e3a.
[Node1: vifmgr: vifmgr.clus.linkdown:EMERGENCY]: The cluster port e3a on node Node1 has gone down unexpectedly.
[Node2: kernel: netif.linkDown:info]: Ethernet e3b: Link down, check cable.
[Node2: vifmgr: vifmgr.portdown:notice]: A link down event was received on node Node2, port e3b.
[Node2: vifmgr: vifmgr.clus.linkdown:EMERGENCY]: The cluster port e3b on node Node2 has gone down unexpectedly.
- Takeover occurs due to missed heartbeat
[
Node1: cf_main: cf.fsm.takeoverCountdown:info]: Failover monitor: takeover scheduled in 10 seconds
[Node1: cf_main: cf.fsm.takeover.noHeartbeat:alert]: Failover monitor: Takeover initiated after no heartbeat was detected from the partner node.
[Node1: cf_main: cf.fsm.stateTransit:info]: Failover monitor: UP --> TAKEOVER
[Node1: cf_takeover: ha.takeover.stateChng:debug]: params: {'old_state': 'NOT_IN_TAKEOVER', 'new_state': 'IN_CFO_TAKEOVER'}
[Node1: cf_takeover: cf.fm.takeoverStarted:notice]: Failover monitor: takeover started
- Motherboard not booting after physical reseat