Uncorrectable memory error on Data ONTAP 8 7-mode
Applies to
- FAS Platforms
- Data ONTAP 8 7-mode
Note: For all other ONTAP platforms and ONTAP versions see: How to troubleshoot uncorrectable memory errors on FAS and AFF systems
Issue
- Node panics with a DIMM error. The panic message can be observed in the messages file or on the system console:
filer> rdfile /etc/messages
- System DIMM
[filer:mgr.stack.string:notice]: Panic string: ECC error at DIMM-1: 94-03-1524-12A92E23,ADDR 0xdf50300,(Node(0), CH(0), DIMM(0), Rank(0), Bank(0x6), Row(0x944), Col(0x20), DQ-0-0-0-0-1-1-1-1 burst-01322310)
- NV DIMM
[filer:mgr.stack.string:notice] Panic string: ECC error at DIMM-NV1: 40-01-1246-01184277,ADDR 0x1eff51980,(Node(0), CH(1), DIMM(0), Rank(0), Bank(0x3),Row(0x2ff4), Col(0x130) Uncorrectable Machine Check E
- System console (connect to SP / system log)
- System DIMM
[filer:sk.panic:ALERT]: Panic String: ECC error at DIMM-1: 94-03-1524-12A92E23,ADDR 0xdf50300,(Node(0), CH(0), DIMM(0), Rank(0), Bank(0x6), Row(0x944), Col(0x20), DQ-0-0-0-0-1-1-1-1 burst-01322310) Uncorrectable Machine Check Error at CPU4. SNB_HA Error: STATUS<0xfe000e8000010090>(Val,OverF,UnCor,Enable,MiscV,AddrV,PCC,CorrSts(0),CorrCnt(0x3a),ExtErr(0x1),ErrCode(Channel 0, Read)ErrCode(0x90))MISC<0x0000002048028286>(HaDbBank(0),PE(0x1),ReqOpcode(0x2),RNID(0x4),RTID(0x1),HTID(0x
- NV DIMM
PANIC : ECC error at DIMM-NV1: 40-01-1246-01184277,ADDR 0x1eff51980,(Node(0), CH(1), DIMM(0), Rank(0), Bank(0x3),Row(0x2ff4), Col(0x130)
Uncorrectable Machine Check Error at CPU2. NB Error: STATUS(Val,OverF,UnCor,Enable,MiscV,AddrV,PCC,CErrCnt(0x424d),RdECC,ErrCode(Channel Unkn, Read)ErrCode(0x9f))MISC(Synd(0x8f886240),Chan(0x1),DIMM(0),RTID(0xc4)), ADDR(0x1eff51980).
version: 8.2.5P5: Thu Jan 7 04:22:14 PST 2021
conf : x86_64
cpuid = 2