Node down due to CPU issues
Applies to
- NetApp SolidFire SF-Series
- Element Software version 12.2 and 12.3
Issue
- Node is down unexpectedly and not rebooted automatically
- iDRAC shows error CPU x(id) has an internal error (IERR)
- ipmitoolcommand executed from a remote server (Ex: management node VM) confirms processor error asserted:- Command: ipmitool -H BMC_IP_ADDRESS -U bmc_username -I lan sel list
- Example output:
 ID | DATE | TIME | Processor #0x60 | IERR | Asserted
- Command: ipmitool -H BMC_IP_ADDRESS -U bmc_username -I lan sel list -v
- Example output:
 SEL Record ID : ID
 Record Type : 02
 Timestamp : DATE TIME
 Generator ID : 0020
 EvM Revision : 04
 Sensor Type : Processor
 Sensor Number : 60
 Event Type : Sensor-specific Discrete
 Event Direction : Assertion Event
 Event Data : 00ffff
 Description : IERR
 
- Command: 
