CHW-778: NVDIMMBadHealthAlert Reason: FIRMWARE error, on NVDIMM-equipped systems
Issue
AFF A800, AFF A320, AFF A400, FAS8300, or FAS8700 storage systems
EMS:Sun Jun 06 17:17:15 -0700 [NetApp1: nphmd: hm.alert.raised:alert]: Alert Id = NVDIMMBadHealthAlert , Alerting Resource = /dev/nvdimm0:NetApp1 raised by monitor controller
AutoSupport Section: NVDIMM-STATUSTotal NVDIMM on this platform is 2
--------------------------------------------------
DIMM(/dev/nvdimm0) Page:0
DIMM(/dev/nvdimm0):
--------------------------------------------------
Controller Ready: Yes
Controller Busy: No
Energy Policy managed by: HOST
Save_N Low During CSAVE: Yes
Save_N Enabled(ARMED): Yes
Data on the Flash: NotValidModule is Health: No
Module Status(0x0040): NVDIMM FIRMWARE error
Flash Lifetime: 94%
Flash Lifetime Status: Normal
Example Health Alert:
::> system health alert showNode: node02
Alert ID: NVDIMMBadHealthAlert
Resource: /dev/nvdimm0
Severity: Major
Indication Time: Tue Dec 03 02:15:51 2024
Suppress: false
Acknowledge: false
Probable Cause: NVDIMM "NVDIMM-N 0 (DIMM-11)" on node "node02" is indicating a degraded status.
Reason: NVDIMM FIRMWARE error.
Possible Effect: Potential data loss as the NVDIMM becomes degraded.
Corrective Actions: Contact technical support for assistance with NVDIMM module replacement.