snapmirror failing after Ontap upgrade as one of the port is not reachable
Applies to
- ONTAP 9
- SnapMirror
Issue
- Snapmirror failing for volumes residing on one nodes with below error.
Failed to create snapshot snapmirror.e180b5b4-896a-11e8-a2cf-00a098d93948_2162351280.2025-03-31_123500 on volume SVM:vol. (CSM: Connection aborted.)- Cluster peer is in partial state and unreachable for the problematic node.
source::> cluster peer showPeer Cluster Name Cluster Serial Number Availability Authentication------------------------- --------------------- -------------- --------------destination 1-80-0xxxx Partial ok- All the cluster ports hosting IC LIF are admin/ operationally "UP"
- l2 ping on the port to confirm if the issue is at l2 or l3 network and found no response on the same.
- Capture the Packet trace to confirm there are no responses received from switch for the ARP request sent.
28 2025-04-01 10:35:59.261455 NetApp_d9:2d:1e Broadcast ARP Who has 20.xx.xx.53? Tell 20.xx.xx.5429 2025-04-01 10:36:00.262531 NetApp_d9:2d:1e Broadcast ARP Who has 20.xx.xx.53? Tell 20.xx.xx.5430 2025-04-01 10:36:01.263414 NetApp_d9:2d:1e Broadcast ARP Who has 20.xx.xx.53? Tell 20.xx.xx.5431 2025-04-01 10:36:02.264488 NetApp_d9:2d:1e Broadcast ARP Who has 20.xx.xx.53? Tell 20.xx.xx.5432 2025-04-01 10:36:03.265425 NetApp_d9:2d:1e Broadcast ARP Who has 20.xx.xx.53? Tell 20.xx.xx.5433 2025-04-01 10:36:04.266758 NetApp_d9:2d:1e Broadcast ARP Who has 20.xx.xx.53? Tell 20.xx.xx.54- The port status showed there are lot of discards reported on the port at transmit which indicates the switch is not accepting the packets.
- interface e0c (153 days, 0 hours, 40 minutes, 11 seconds) --RECEIVETotal discards: 19069 | Queue overflow: 19069 | Multi/broadcast: 661kCollisions: 0 | Xon: 0 | Xoff: 0Jumbo: 39271m | Cfg Up to Downs: 0 | TSO non-TCP drop: 0- Network team validated port Configuration and found no issues, however confirmed no packets are received at the switch port.
