Repeated cf.hwassist.missedKeepAlive errors
Applies to
- AFF A220 / AFF A150 / AFF C190 / FAS2750 / FAS2720
- Cluster\Node managment network
Issue
- Continuous HW Assist keep alive errors
- Those cleared after a few minutes, in both nodes of an HA pair.
EMS.Log
Example:
14:03:59 +0100 [node_name01: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name02).
14:11:29 +0100 [node_name01: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name02).
14:24:59 +0100 [node_name01: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name02).
14:32:29 +0100 [node_name01: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name02).
15:28:00 +0100 [node_name01: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name02).
15:35:30 +0100 [node_name01: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name02).
13:34:02 +0100 [node_name02: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name01).
13:41:32 +0100 [node_name02: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name01).
13:55:02 +0100 [node_name02: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name01).
14:02:32 +0100 [node_name02: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name01).
14:16:02 +0100 [node_name02: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name01).
14:23:32 +0100 [node_name02: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name01).
- The errors follow a timing pattern.
Example: every 7:30 minutes.
- Command
storage failover hwassist show
outputs shows the following error:
cluster::> storage failover hwassist show
Node
-----------------
node-01
Partner: node-02
Hwassist Enabled: true
Hwassist IP: 10.XX.XX.X
Hwassist Port: 4444
Monitor Status: active
Inactive Reason: -
Corrective Action: -
Keep-Alive Status: Error: did not receive hwassist keep alive alerts from partner.
node-02
Partner: node-01
Hwassist Enabled: true
Hwassist IP: 10.XX.XX.X
Hwassist Port: 4444
Monitor Status: active
Inactive Reason: -
Corrective Action: -
Keep-Alive Status: Error: did not receive hwassist keep alive alerts from partner.
2 entries were displayed.
- Command
storage failover hwassist test
outputs shows timed out error:
cluster::> storage failover hwassist test -node *
Info: No response from partner(node-01).Timed out.
Info: No response from partner(node-02).Timed out.