Skip to main content
NetApp Response to Russia-Ukraine Cyber Threat
In response to the recent rise in cyber threat due to the Russian-Ukraine crisis, NetApp is actively monitoring the global security intelligence and updating our cybersecurity measures. We follow U.S. Federal Government guidance and remain on high alert. Customers are encouraged to monitor the Cybersecurity and Infrastructure Security (CISA) website for new information as it develops and remain on high alert.

NetApp KCS Award

NetApp Knowledge Base

Element Software may misreport memory errors and result in a cluster fault for memoryEccThreshold on MemCtlr0

Views:
1,359
Visibility:
Public
Votes:
0
Category:
element-software
Specialty:
solidfire
Last Updated:

Applies to

  • NetApp Element software 12.0 and 12.2
  • NetApp SolidFire SF-Series product line
  • NetApp H-series storage nodes

Issue

  • NetApp Element software may misreport correctable errors on DIMMs as being correctable errors on a node's memory controller
  • Default settings for ECC errors on a node's memory controller are overly aggressive, resulting in a persistent, error severity cluster fault after even a single error
  • The following is the cluster fault shown in NetApp SolidFire Active IQ and the cluster UI
    • Error Code: memoryEccThreshold
    • Details: Correctable ECC memory error count crossed threshold on Memory controller: MemCtlr0
  • Node's BMC system event log (SEL) actually reports error(s) on a DIMM at the same time as the cluster fault(s)
    • [Information]   [Memory Error]   [Memory]            Correctable ECC (CPU_A0) - Asserted

 

Scan to view the article on your device
CUSTOMER EXCLUSIVE CONTENT

Registered NetApp customers get unlimited access to our dynamic Knowledge Base.

New authoritative content is published and updated each day by our team of experts.

Current Customer or Partner?

Sign In for unlimited access

New to NetApp?

Learn more about our award-winning Support