System Takeover Due to CPU Catastrophic Error on Node
Applies to
AFF-A90
Issue
- Node reboots without any panic string or error messages
- The BMC CLI command
bmc status -dshows theCPU Catastrophic Errorbeing asserted and de-asserted.
root: eventfifod 446.567: 123(0x007b) : CPU Catastrophic Error asserted
root: eventfifod 446.567: 123(0x807b) : CPU Catastrophic Error de-asserted
root: eventfifod 447.367: 126(0x007e) : CPU Error Level 2 asserted
root: eventfifod 470.514: 126(0x807e) : CPU Error Level 2 de-asserted
root: eventfifod 472.981: 123(0x007b) : CPU Catastrophic Error asserted
root: eventfifod 472.981: 123(0x807b) : CPU Catastrophic Error de-asserted
root: eventfifod 762.422: 95(0x005f) : NMI Trigger to PCH asserted
root: eventfifod 762.425: 97(0x8061) : LVC3 CPU0 NMI asserted
root: eventfifod 762.425: 98(0xc062) : LVC3 CPU1 NMI asserted
root: eventfifod 762.425: 131(0x4083) : PCH NMI Request from BMC asserted
