Skip to main content
NetApp Knowledge Base

NetApp H610S node reboots unexpectedly with uncorrectable machine check error

Views:
3,149
Visibility:
Public
Votes:
1
Category:
h-series
Specialty:
hci
Last Updated:

Applies to

  • NetApp H610S
  • NetApp Element software
  • All currently supported versions of BIOS

Issue

  • A node in an Element cluster logs a nodeOffline event for approximately 7 to 15 minutes
  • The metadata drive is in Available status and will not be accepted back into the array
  • Logs indicate that the node has rebooted unexpectedly
  • Entries for Uncorrectable Machine Check Exception are found in the BMC system event log around the time of the nodeOffline event
  • Example of BMC SEL events:
SEL Record ID          : 0053
 Record Type           : 02
 Timestamp             : 11/22/2020 13:18:25
 Generator ID          : 0020
 EvM Revision          : 04
 Sensor Type           : Processor
 Sensor Number         : 74
 Event Type            : Sensor-specific Discrete
 Event Direction       : Assertion Event
 Event Data            : 0bffff
 Description           : Uncorrectable machine check exception

 

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.