ClusterIfInErrorsWarn_Alert seen on system health alert
Applies to
- ONTAP 9
- FAS/AFF system
- Nexus 9k switch
Issue
- System health alert reports "ClusterIfInErrorsWarn_Alert" on one of the switch port
Cluster::> system health alert show
Node: node1
Alert ID: ClusterIfInErrorsWarn_Alert
Resource: Ethernet1/10
Severity: Major
Indication Time: Fri Apr 19 10:31:56 2024
Suppress: false
Acknowledge: false
Probable Cause: The percentage of inbound packet errors of switch
interface "switch1(Fxxxxxxxx7)/Ethernet1/10"
is above the warning threshold.
Possible Effect: Communication between nodes in the cluster might be
degraded.
- On storage we could see the network port flap from ems
Thu May 02 12:16:08 -0500 [node1: kernel: netif.linkDown:info]: Ethernet e0c: Link down, check cable.
Thu May 02 12:16:08 -0500 [node1: vifmgr: vifmgr.portdown:notice]: A link down event was received on node node1 , port e0c.
Thu May 02 12:16:08 -0500 [node1: vifmgr: vifmgr.clus.linkdown:EMERGENCY]: The cluster port e0c on node node1 has gone down unexpectedly.
Thu May 02 12:16:41 -0500 [node1: kernel: netif.linkUp:info]: Ethernet e0c: Link up.
Thu May 02 12:16:41 -0500 [node1: vifmgr: vifmgr.portup:notice]: A link up event was received on node node1, port e0c.
- Respective switch port log shows multiple CRC alerts
Ethernet1/10 is up
admin state is up, Dedicated Interface
Hardware: 1000/10000/25000/40000/50000/100000 Ethernet, address: 9yyy.xxxx.yyyc (bia 9yyy.xxxx.yyyc)
MTU 9216 bytes, BW 40000000 Kbit , DLY 10 usec
RX
12047766326 unicast packets 810877 multicast packets 619482 broadcast packets
977845000102 input packets 1977749702303210 bytes
7187530114 jumbo packets 0 storm suppression bytes
0 runts 15622 giants 10223919 CRC 0 no buffer
10239541 input error 0 short frame 0 overrun 0 underrun 0 ignored
0 watchdog 0 bad etype drop 0 bad proto drop 0 if down drop
0 input with dribble 0 input discard
0 Rx pause