Storage node suddenly stops sending LACP packets causing intermittent volumesDegraded and block service alerts
Applies to
- NetApp HCI H610S
- SolidFire AFA SF4805
- Network Switch configured in LACP slow
Issue
-
Following intermittent alerts seen in NetApp SolidFire Active IQ:
volumesDegraded
Details : Local replication and synchronization of the following volumes is in progress. Progress of the synchronization can be seen in the Running Tasks window. [VolumeID]
A bulk volume service is not responding.
- No signs of faulty hardware or issues with the nic drivers
- Happens on different models of storage nodes with different nic vendors
- From switch logs, it shows the ports gut suspended due to no LACP received
2023 Nov 13 01:09:36 NODE1-CUST-vDC %ETH_PORT_CHANNEL-5-PORT_SUSPENDED: Ethernet3/7: Ethernet3/7 is suspended by protocol, no LACP PDUs received
2023 Nov 13 01:11:17 NODE1-CUST-vDC %ETH_PORT_CHANNEL-5-PORT_SUSPENDED: Ethernet3/7: Ethernet3/7 is suspended by protocol, no LACP PDUs received
2023 Nov 13 01:14:37 NODE1-CUST-vDC %ETH_PORT_CHANNEL-5-PORT_SUSPENDED: Ethernet3/7: Ethernet3/7 is suspended by protocol, no LACP PDUs received
2023 Nov 13 01:15:07 NODE1-CUST-vDC %ETH_PORT_CHANNEL-5-PORT_SUSPENDED: Ethernet3/7: Ethernet3/7 is suspended by protocol, no LACP PDUs received