CHW-1433: AFF-A800 disruption caused by watchdog nmi on CPU events
Issue
AFF-A800 storage systems might experience an unresponsive CPU, followed by a watchdog non-maskable interrupt (NMI) system disruption.
The following is an example of the controller disruption message:
- PANIC: watchdog nmi on cpu 12, hang cpu is 22 in process idle: cpu12 on release 9.11.1P11 (C) on Mon Oct 30 19:52:57 CDT 2023
version: 9.11.1P11: Wed Aug 9 11:18:59 EDT 2023
compile flags: x86_64.optimize - PANIC: nested machine check exception detected on CPU 26, no coredump will be generated.
- Requesting SP to power cycle the filer to attempt to clear the Machine Check Event
