Correctable memory errors reporting in ONTAP versions with static thresholds
- Views:
- 1,207
- Visibility:
- Public
- Votes:
- 0
- Category:
- fas-systems
- Specialty:
- HW
- Last Updated:
- 4/25/2024, 2:56:19 PM
Applies to
ONTAP versions:
- 9.1P17 and earlier P releases
- 9.2 all P releases
- 9.3P10 and earlier P releases
- 9.4P5 and earlier P releases
Platforms:
- AFF A800
- AFF A700s
- AFF A700 / FAS9000
- AFF A300 / FAS8200
- AFF A220 / FAS27x0
- AFF A200 / FAS26x0
- AFF80x0 / FAS80x0
Note: For all other ONTAP platforms and ONTAP versions see: How to troubleshoot correctable memory errors on FAS and AFF systems
Issue
- Node is reporting correctable ECC errors:
event log show -event *cecc*
Sun Nov 11 08:00:52 GMT [ClusterA-01: idle_thread0: cecc_log_summary_1:warning]: params: {'total_num_ceccs': '56', 'num_ceccs': '3'}
Sun Nov 11 08:18:40 CST [cecc_log.summary:warning]: Total of 303 new correctable ECC errors just reported. You might want to check system memory. 12828 correctable ECC errors reported since booting.
- show-memory-errors is reporting multiple CECC errors on the same DIMM
ClusterA::*> set advanced
ClusterA::*> system node show-memory-errors
Correctable ECC Memory Errors:
Node: ClusterA-01
DIMM CECC Multiple Err
Name Count Same Address
------- ------ ------------
DIMM-1 0 false
DIMM-2 22 false
DIMM-3 0 false
Node: ClusterA-02
DIMM CECC Multiple Err
Name Count Same Address
------- ------ ------------
DIMM-1 0 false
DIMM-2 0 false
DIMM-3 0 false
6 entries were displayed.
- AutoSupport alert may be triggered
CriticalCECCCountMemErrAlert