Skip to main content
NetApp Knowledge Base

How to troubleshoot correctable memory errors on FAS and AFF systems

Views:
26,367
Visibility:
Public
Votes:
14
Category:
ontap-9
Specialty:
hw
Last Updated:

Applies to

  • ONTAP 9
  • Data ONTAP 8
  • AFF / FAS platforms
  • DIMM Replacement Guide

Answer

Check Active IQ to see if CECC memory impact your systems.

Choose the appropriate guide based on platform and ONTAP version.

Platform System or NVRAM ONTAP Version Guide
  • AFF A900 / FAS9500
  • ASA A900
  • AFF A800, C800
  • ASA A800, ASA C800
  • AFF A700s
  • AFF A700 / FAS9000
  • AFF A400 , C400 FAS8300 / FAS8700
  • ASA A400, ASA C400
  • AFF A300 / FAS8200
  • AFF A250 , C250 / FAS500f
  • ASA A250, ASA C250
  • AFF A220 / AFF C190 / FAS27x0
  • AFF A200 / FAS26x0
  • ASA A150
  • AFF80x0 / FAS80x0
System DIMM
  • 9.1P18 and later P releases
  • 9.3P11 and later P releases
  • 9.4P6 and later P releases
  • 9.5 and later major releases

Correctable memory errors in ONTAP with dynamic thresholds

  • 9.1P17 and earlier P releases
  • 9.2 all P releases
  • 9.3P10 and earlier P releases
  • 9.4P5 and earlier P releases
Correctable memory errors reporting in ONTAP versions with static thresholds
NVRAM DIMM

9.1 and higher

Correctable memory errors on NVRAM DIMMs in ONTAP

EOS (End of support)

How to check the System availability and End of support date on my ONTAP platform?

Platform System or NVRAM ONTAP Version Guide
  • FAS25x0
  • FAS22x0
  • V / FAS32x0
  • V / FAS62x0
System or NVRAM 9.1 and higher Correctable memory errors on 62XX, 32XX, 25XX, and 22XX systems in ONTAP
  • FAS80x0
  • FAS25x0
  • FAS22x0
  • V / FAS32x0
  • V / FAS62x0
System or NVRAM

Data ONTAP 8 7-Mode

Correctable memory errors on Data ONTAP 8

 

Additional Information

ONTAP storage systems use error-correcting code (ECC) memory modules (DIMMs) that can correct in-flight memory errors with little to no impact on performance. Correctable ECC (CECC) errors are not a reliable predictor of disruptive uncorrectable ECC (UECC) errors, especially with the latest memory controllers and DRAM.

  • Previously, ONTAP alerted about "excessive" CECC errors based on a threshold of 500 errors since the last reboot.
  • These alerts can be considered false positives and can lead to unnecessary hardware maintenance without significant benefits.

NetApp has updated to a dynamic monitoring algorithm, with a much higher threshold.

  •  CECC errors are still logged but alone don't indicate a need for DIMM replacement. 
  • When CECC memory errors reach a critical state, ONTAP will  trigger Health Monitor alert  “CriticalCECCCountMemErrAlert”  and corresponding "Health Monitor" AutoSupport message.

We recommend updating your BIOS to the latest version to improve memory management and resilience to UECC errors. This also reduces scenarios where DIMMs can be mapped out during boot. Find the latest BIOS/LOADER version for your systems on the System Firmware & Diagnostics Download page.

Note: JEDEC-standard NVDIMM modules are used in AFF A800, AFF A400, AFF A320, FAS8700, and FAS8300 platforms.

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.