Errors Observed on Interface Reported by Multiple Storage Nodes
- Views:
- 31
- Visibility:
- Public
- Votes:
- 0
- Category:
- element-software
- Specialty:
- solidfire
- Last Updated:
- 2/5/2025, 9:50:14 AM
Applies to
NetApp Element Software 12.5 and later.
Issue
Multiple storage nodes are showing below message in event log and not reporting errors in unresolved state.
Errors observed on interface ethx
If the reported nodes are receiving RX errors as shown below on the specified ethx interface
2021-06-09 04:05, RX errors 3529 dropped 9825690 overruns 0 frame 3529, RX errors 2980 dropped 9238332 overruns 0 frame 2980
2021-06-09 06:05, RX errors 3529 dropped 9825932 overruns 0 frame 3529, RX errors 2980 dropped 9238601 overruns 0 frame 2980
2021-06-09 08:05, RX errors 3529 dropped 9826174 overruns 0 frame 3529, RX errors 2980 dropped 9238859 overruns 0 frame 2980
2021-06-09 10:05, RX errors 3529 dropped 9826416 overruns 0 frame 3529, RX errors 2980 dropped 9239101 overruns 0 frame 2980
2021-06-09 12:05, RX errors 3529 dropped 9826665 overruns 0 frame 3529, RX errors 2980 dropped 9239343 overruns 0 frame 2980
2021-06-09 14:05, RX errors 3529 dropped 9826956 overruns 0 frame 3529, RX errors 2980 dropped 9239585 overruns 0 frame 2980
Below troubleshooting steps can be tried
- Note the total number of nodes connected to the same network.
- Identify which nodes are experiencing the issue and which are not. Validate this through Active IQ event messages.
- All storage nodes (eth0 and eth1 interfaces) are connected to two switches: all eth0 interfaces connect to switch A, and all eth1 interfaces connect to switch B.
- For all storage nodes not showing the 'Errors observed on interface ethx'(RX errors in the node log), check which switch port they are connected to. Review the switch logs for any CRC/TX/RX errors on the switch interface connected to the storage node that is not reporting an issue.
- If you see CRC/TX/RX errors in the switch log for the same interface, swap the eth0 interface cable with eth1 on the same storage node. Check if the switch log shows errors on the same switch or a different switch to isolate the faulty hardware (which could be a faulty SFP or cable).