AFF A250,C250 or FAS500f panic: watchdog nmi because IPMI interface congested
Applies to
- AFF A250, AFF C250
- ASA A250, ASA C250
- FAS500f
- BMC 15.11 or earlier
Issue
- Node reboots with panic:
PANIC: watchdog nmi because IPMI interface congested. in process idle: cpu9- BMC event logs or
SP-LATEST-SYSTEM-EVENT-LOGindicate a watchdog interrupt followed by multiple bus correctable errors:
292 | 12/02/2022 | 17:14:14 | Watchdog 2 #0x0f | Timer interrupt | Asserted293 | 12/02/2022 | 17:14:16 | Watchdog 2 #0x0f | Hard reset | Asserted294 | 12/02/2022 | 17:14:17 | Unknown #0x51 | State Asserted2a2 | 12/02/2022 | 17:15:00 | Critical Interrupt #0x31 | Bus Correctable error | Asserted2a3 | 12/02/2022 | 17:15:00 | Critical Interrupt #0x31 | Bus Correctable error | Asserted2a4 | 12/02/2022 | 17:15:00 | Critical Interrupt #0x31 | Bus Correctable error | Asserted- SSRAM logs report
NMIsource(WdogBMCFail) - After panic reboot the node may report temperature error at
Cluster::>event log show
[monitor.temp.unreadable:error]: The controller temperature (HIC2 Temp0) is not readable.
[monitor.temp.unreadable:error]: The controller temperature (HIC2 Temp1) is not readable.
[callhome.chassis.hitemp:error]: Call home for CHASSIS OVER TEMPERATURE
