Node down due to CPU issues
Applies to
- NetApp SolidFire SF-Series
- Element Software version 12.2 and 12.3
Issue
- Node is down unexpectedly and not rebooted automatically
- iDRAC shows error
CPU x(id) has an internal error (IERR)
ipmitool
command executed from a remote server (Ex: management node VM) confirms processor error asserted:- Command:
ipmitool -H BMC_IP_ADDRESS -U bmc_username -I lan sel list
- Example output:
ID | DATE | TIME | Processor #0x60 | IERR | Asserted
- Command:
ipmitool -H BMC_IP_ADDRESS -U bmc_username -I lan sel list -v
- Example output:
SEL Record ID : ID
Record Type : 02
Timestamp : DATE TIME
Generator ID : 0020
EvM Revision : 04
Sensor Type : Processor
Sensor Number : 60
Event Type : Sensor-specific Discrete
Event Direction : Assertion Event
Event Data : 00ffff
Description : IERR
- Command: