Skip to main content
NetApp Knowledge Base

HA interconnect: Connection for 'cfo_rv' failed with high CPU utilization from HostOS

Views:
904
Visibility:
Public
Votes:
0
Category:
not set
Specialty:
Perf
Last Updated:

Applies to

  • FAS25xx
  • Cluster Peering Encryption
  • ONTAP 9.6 or later

Issue

  • HA link flapping with unsynchronized log in several times within one day:

Tue May 17 01:51:41  0900 [Nodename: raidio_thread: nvmm.mirror.aborting:debug]: mirror of sysid 1, partner_type HA Partner and mirror state MIRROR_ONLINE is aborted because of reason Abort Pending.
Tue May 17 01:51:41  0900 [Nodename: nvram_sync: nvmm.mirror.offlined:debug]: params: {'mirror': 'HA Partner Mirror Offlined'}
Tue May 17 01:51:41  0900 [Nodename: rendezvous_proc: cf.rv.notConnected:alert]: HA interconnect: Connection for 'cfo_rv' failed.
Tue May 17 01:51:41  0900 [Nodename: nic_mgr: cf.nm.nicViError:info]: HA interconnect: NIC 0 has an error on RAID VI (virtual interface #9): SEND_DESC_ERROR 12 2.
Tue May 17 01:51:41  0900 [Nodename: nic_mgr: cf.nm.nicReset:notice]: HA interconnect: Initiating soft reset on card 0 due to rendezvous reset.
Tue May 17 01:51:41  0900 [Nodename: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of Nodename by ae0000-vpnas1y disabled (HA interconnect error. Verify that the partner node is running and that the HA interconnect cabling is correct, if applicable. For further assistance, contact technical support).
Tue May 17 01:51:44  0900 [Nodename: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of Nodename by ae0000-vpnas1y disabled (NVRAM size mismatch).
Tue May 17 01:51:50  0900 [Nodename: cf_main: cf.fsm.takeoverOfPartnerDisabled:error]: Failover monitor: takeover of ae0000-vpnas1y disabled (unsynchronized log).

  • Observed high CPU util from HostOS is over 50%, which is highly coincident with the time of link flapping:

Cluster: Cluster_name (xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx)
    Node: node_name (yyyyyyyy-yyyy-yyyy-yyyy-yyyyyyyyyyyy)
    Time Range: 2022-05-16 16:00:01.000 00:00 - 22:04:01.473 00:00 GMT

time interval                         process        process
                                      instance       pct_cpu
                                                       (%)
------------------------------------  -------------  -------
2022-05-16 16:45:01 - 16:50:02 00:00  CSM BTLS, 282    55.44
2022-05-16 16:50:02 - 16:55:02 00:00  CSM BTLS, 282    52.50

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.