Skip to main content
NetApp Knowledge Base

CONTAP-647685: Node reboots unexpectedly due to zombie process accumulation exhausting the BSD process limit

Views:
41
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
core
Last Updated:

Issue

  • Repeated calls via the "network ping" CLI or REST API cause zombie processes to accumulate
  • When the count reaches the BSD process limit, user space processes fail to spawn new threads, eventually triggering a node reboot.
  • Examples of user space core events and the eventual reboot by the node watchdog process:

Sat Feb 14 16:32:14 +0800 [cluster-02: vifmgr: ucore.panicString:error]: 'vifmgr: Call to pthread_create() failed with error: Cannot allocate memory, raising {{SIGABRT(6) at RIP 0x80936cd0a (pid 6922, uid 0, timestamp 1771057934)'}}
Sat Feb 14 17:25:00 +0800 [cluster-02: cphmd: ucore.panicString:error]: 'cphmd: Call to pthread_create() failed with error: Cannot allocate memory, raising SIGABRT(6) at RIP 0x807a74d0a (pid 55070, uid 0, timestamp 1771061101)'
...
Sat Feb 14 17:24:24 +0800 [cluster-02: nodewatchdog: nodewatchdog.node.panic:alert]: Data ONTAP has experienced a serious internal error: Process vifmgr unresponsive for 163 seconds. This might cause the node experiencing the problem to become unresponsive to data access. The node has been panicked to prevent this condition from continuing.
Sat Feb 14 17:24:24 +0800 [cluster-02: nodewatchdog: sk.panic:alert]: Panic String: Process vifmgr unresponsive for 163 seconds in process nodewatchdog on release 9.12.1P18 (C)

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.