ems.engine.event.undefinedEvent for ClusterIfInErrorsWarn_Alert and Threshold_crossed
Applies to
- ALL FAS/AFF platforms
- Nvidia MSN2100 cluster switch
Issue
- Receive the following error message ems logs
[node-01: notifyd: ems.engine.event.undefinedEvent:notice]: params: {'requestedEventName': 'hm.alert.raised', 'requestedEventParams': 'Param1:Alert Id = ClusterIfInErrorsWarn_Alert , Alerting Resource = switch-01/Intel Corporation Ethernet Connection I354, Param2:ethernet-switch, Param3:ClusterIfInErrorsWarn_Alert, Param4:switch-01/Intel Corporation Ethernet Connection I354, Param5:Threshold_crossed, Param6:1) Migrate any cluster LIF that uses this connection to another port connected to a cluster switch. For example, if cluster LIF "clus1" is on port e0a and the other LIF is on e0b, run the following command to move "clus1" to e0b:"network interface migrate -vserver vs1 -lif clus1 -destination-node node1 -destination-port e0b". 2) Replace the network cable with a known-good cable. If errors are corrected, stop. No further action is required. Otherwise, continue to Step 3. 3) Move the network cable to another port on the node (if available). Migrate the cluster LIF to the new port. If errors are corrected, contact technical support to troubleshoot the original node port. Otherwise, continue to Step 4. 4) Move the network cable to another available cluster switch port. Migrate the cluster LIF back to the original port. If errors are corrected, contact technical support to troubleshoot the original switch port. If errors persist, contact technical support for further assistance. , Param7:Communication between nodes in the cluster might be degraded., Param8:"Switch Name: switch-01" "Switch Show Model: X190006-PE" "Switch Serial Number: XXXXXXXXXXXXXX" "Switch Type: cluster-network" "Interface name: Intel Corporation Ethernet Connection I354" "Input Error Count: 17624" "Input Unicast Packet Count: 621902891" "Input Non-Unicast Packet Count: 5090500" "Input Error Percentage: 0" "Input Error % (Conv int): 18" "Input Error % (decimal): 0.1852587005425433" "Input Warning Count: 2" "Switch Analytics Resource Status: ok" , Param9:, Param10:0, Param11:, Param12:0'}
- The alert then gets cleared
[Node-01: cshmd: hm.alert.cleared:notice]: Alert Id = ClusterIfInErrorsWarn_Alert , Alerting Resource = Switch-01/Intel Corporation Ethernet Connection I354 cleared by monitor ethernet-switch