Skip to main content
NetApp Knowledge Base

AFF-A250 down unable to boot and indicating a hardware failure

Views:
332
Visibility:
Public
Votes:
1
Category:
aff-series
Specialty:
hw
Last Updated:

Applies to

  • AFF-A250
  • ONTAP 9

Issue

  • Node goes down with the below PANIC message:

PANIC: Uncorrectable Machine Check Error at CPU9. SKL_IIO Error: STATUS<0xbb80000000000e0b>(VALID,UC,EN,MISCV,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))MISC<0x0000000064000000>(UCR_BUS_LOG(100),UCR_DEVICE_LOG(0),UCR_FUNCTION_LOG(0),UCR_SEGMENT_LOG(0))IIO Machine Check from device(s):RPT(100,0,0):ErrSrcID(CorrSrc(0),UCorrSrc(0x6660)), PLX PCIE 9797 switch on Controller, Br[9797](102,12,0): Link down. . in process idle: cpu9 on release 9.8P3 (C) on Mon Aug 2 00:23:10 CEST 2021 version: 9.8P3: Sat Mar 27 04:59:49 EDT 2021

  • The diagnostic boot fails:

[   55.095414] mce: [Hardware Error]: CPU 0: Machine Check Exception: 5 Bank 6: bb80000000000e0b
[   55.104912] mce: [Hardware Error]: RIP !INEXACT! 10:<ffffffff81786e4f> 
[   55.112086] {mwait_idle+0x6f/0x160} mce: [Hardware Error]: TSC 49a9a022c2 MISC 64000000
[   55.121240] mce: [Hardware Error]: PROCESSOR 0:50654 TIME 1627869582 SOCKET 0 APIC 0 microcode 2006906
[   55.131605] mce: [Hardware Error]: Run the above through 'mcelog --ascii'
[   55.139166] mce: [Hardware Error]: Machine check: Processor context corrupt
[   55.146909] Kernel panic - not syncing: Fatal machine chec

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.
  • Was this article helpful?