Skip to main content

Coming soon...New Support-Specific categorization of Knowledge Articles in the NetApp Knowledge Base site to improve navigation, searchability and your self-service journey.

NetApp Knowledge Base

HA interconnect: Connection for 'cfo_rv' failed with high CPU utilization from HostOS

Views:
320
Visibility:
Public
Votes:
0
Category:
not set
Specialty:
not set
Last Updated:

Applies to

  • FAS25xx
  • Cluster Peering Encryption
  • ONTAP 9.6 or later

Issue

  • HA link flapping with unsynchronized log in several times within one day:

Tue May 17 01:51:41  0900 [Nodename: raidio_thread: nvmm.mirror.aborting:debug]: mirror of sysid 1, partner_type HA Partner and mirror state MIRROR_ONLINE is aborted because of reason Abort Pending.
Tue May 17 01:51:41  0900 [Nodename: nvram_sync: nvmm.mirror.offlined:debug]: params: {'mirror': 'HA Partner Mirror Offlined'}
Tue May 17 01:51:41  0900 [Nodename: rendezvous_proc: cf.rv.notConnected:alert]: HA interconnect: Connection for 'cfo_rv' failed.
Tue May 17 01:51:41  0900 [Nodename: nic_mgr: cf.nm.nicViError:info]: HA interconnect: NIC 0 has an error on RAID VI (virtual interface #9): SEND_DESC_ERROR 12 2.
Tue May 17 01:51:41  0900 [Nodename: nic_mgr: cf.nm.nicReset:notice]: HA interconnect: Initiating soft reset on card 0 due to rendezvous reset.
Tue May 17 01:51:41  0900 [Nodename: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of Nodename by ae0000-vpnas1y disabled (HA interconnect error. Verify that the partner node is running and that the HA interconnect cabling is correct, if applicable. For further assistance, contact technical support).
Tue May 17 01:51:44  0900 [Nodename: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of Nodename by ae0000-vpnas1y disabled (NVRAM size mismatch).
Tue May 17 01:51:50  0900 [Nodename: cf_main: cf.fsm.takeoverOfPartnerDisabled:error]: Failover monitor: takeover of ae0000-vpnas1y disabled (unsynchronized log).

  • Observed high CPU util from HostOS is over 50%, which is highly coincident with the time of link flapping:

Cluster: Cluster_name (xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx)
    Node: node_name (yyyyyyyy-yyyy-yyyy-yyyy-yyyyyyyyyyyy)
    Time Range: 2022-05-16 16:00:01.000 00:00 - 22:04:01.473 00:00 GMT

time interval                         process        process
                                      instance       pct_cpu
                                                       (%)
------------------------------------  -------------  -------
2022-05-16 16:45:01 - 16:50:02 00:00  CSM BTLS, 282    55.44
2022-05-16 16:50:02 - 16:55:02 00:00  CSM BTLS, 282    52.50

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

Scan to view the article on your device

 

  • Was this article helpful?