Mgwd panic causing "vifmgr.reach.err" on all the ports - CONTROLLER TAKEOVER ON PANIC
Applies to
- ONTAP 9
- Userspace core (ucore) panic
Issue
- HA partner takes over due to userspace (management component) application crash
[node02: cf_main: callhome.reboot.takeover.p:notice]: Call home for PARTNER REBOOT (CONTROLLER TAKEOVER ON PANIC)
- mgwd restarts with the below panic message, causing
vifmgr.reach.errevents on all of the ports on the node
Panic Message : mgwd: assertion (ret != FALSE) at src/libnetapp/na_zapi.c:1039 failed, raising SIGABRT(6) at RIP 0x81d6a5d9a (pid 2046, uid 0, timestamp 1677865827)
- vifmgr logs:
[node01: mgwd: ucore.panicString:error]: 'mgwd: assertion (ret != FALSE) at src/libnetapp/na_zapi.c:1039 failed, raising SIGABRT(6) at RIP 0x81d6a5d9a (pid 2046, uid 0, timestamp 1677865827)'
[node01: vifmgr: vifmgr.reach.err:error]: Network port a0a on node node01 encountered errors while scanning for reachability. Reachability was detected to the following broadcast domains: -.
[node01: vifmgr: vifmgr.reach.err:error]: Network port a0a-10 on node node01 encountered errors while scanning for reachability. Reachability was detected to the following broadcast domains: -.
[node01: vifmgr: vifmgr.reach.err:error]: Network port a0a-3 on node node01 encountered errors while scanning for reachability. Reachability was detected to the following broadcast domains: -.
[node01: vifmgr: vifmgr.reach.err:error]: Network port a0b on node node01 encountered errors while scanning for reachability. Reachability was detected to the following broadcast domains: -.
[node01: vifmgr: vifmgr.reach.err:error]: Network port a0b-105 on node node01 encountered errors while scanning for reachability. Reachability was detected to the following broadcast domains: -.
[node01: vifmgr: vifmgr.reach.err:error]: Network port a0b-107 on node node01 encountered errors while scanning for reachability. Reachability was detected to the following broadcast domains: -.
[node01: spmd: spm.mgwd.process.exit:EMERGENCY]: Management Gateway (mgwd) subsystem with ID 2046 exited as a result of signal Virtual size limit exceeded. The subsystem will attempt to restart.
