Skip to main content
NetApp Knowledge Base

Unexpected A700s reboot

Views:
137
Visibility:
Public
Votes:
0
Category:
aff-series
Specialty:
hw
Last Updated:

Applies to

  • A700s
  • BMC 1.81

Issue

  • Unexpected node reboot:

Tue Oct 27 17:15:35 +0200 [node_name_1: wafl_exempt08: wafl.vol.snap_create.done:info]: params: {'vol': 'vm_tfs_01', 'app': '', 'volident': '@vserver:34386606-fd18-11e6-aab3-00a098ae1e68', 'run_time': '504638', 'owner': '', 'type': 'Volume'}
Tue Oct 27 17:17:00 +0200 [node_name_1: ifconfig: netif.linkUp:info]: Ethernet e0M: Link up.

  • The Service Processor resets the suspicious node and partner takes over:.

Wed May 26 07:37:17 -0700 [node_name_2: cf_hwassist: cf.hwassist.takeoverTrapRecv:notice]: hw_assist: Received takeover hw_assist alert from partner(node_name_1), system_down because reset_via_sp.
Wed May 26 07:37:17 -0700 [node_name_2: cf_hwassist: cf.hwassist.takeoverTrapRecv:notice]: hw_assist: Received takeover hw_assist alert from partner(node_name_1), system_down because l2_watchdog_reset.

Tue Oct 27 17:15:50 +0200 [node_name_2: swi1: mri_ha: nvmm.mirror.aborting:debug]: mirror of sysid 1, partner_type HA Partner and mirror state MIRROR_ONLINE is aborted because of reason Abort Pending.
Tue Oct 27 17:15:50 +0200 [node_name_2: gop_eq_thread: ic.linkStatusChange:info]: HA interconnect: Port ic1a link is down.
Tue Oct 27 17:15:50 +0200 [node_name_2: cf_fastTimeout: cf.ic.heartBeatFailed:error]: HA interconnect: Heartbeat failed.
Tue Oct 27 17:15:50 +0200 [node_name_2: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of node_name_2 by node_name_1 disabled (unsynchronized log).
Tue Oct 27 17:15:50 +0200 [node_name_2: rastrace_dump: rastrace.dump.saved:debug]: A RAS trace dump for module IC instance 0 was stored in /etc/log/rastrace/IC_0_20201027_17:15:50:245981.dmp.
Tue Oct 27 17:15:50 +0200 [node_name_2: ctrl_hb_port_ic1a: ctrl.rdma.heartBeat:info]: HA interconnect: Missed heartbeat to 192.0.1.4.
Tue Oct 27 17:15:51 +0200 [node_name_2: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of node_name_2 by node_name_1 disabled (HA interconnect error. Verify that the partner node is running and that the HA interconnect cabling is correct, if applicable. For further assistance, contact technical support).

  • The suspicious node reboots and works fine after the giveback.

 

CUSTOMER EXCLUSIVE CONTENT

Registered NetApp customers get unlimited access to our dynamic Knowledge Base.

New authoritative content is published and updated each day by our team of experts.

Current Customer or Partner?

Sign In for unlimited access

New to NetApp?

Learn more about our award-winning Support