Bad SAS adapter causes checksum errors in ONTAP 9
Applies to
- ONTAP 9
- SAS adpater
Issue
- Checksum alert received:
CLCKSU:HA Group Notification from node-01 (CHECKSUM ERROR (multiple disks)) ALERT
PCI stealth errors seen through out the EMS logs
Mon Sep 25 02:41:28 -0700 [node-01: HSWL error: pcie.stealth.errors:debug]: params: {'pcie_errors': 'LVMR,1_0.PLX PCIE 8764 switch on Controller, PLX PCIE 8764 switch on Controller. IIO0:RPT(0,2,0): Br[8764](3,4,0): RcvErr(P4(104)); Br[8764](3,4,0): DevStatus(Corr), CorrErr(Rcvr). '}
- The SAS adapter is reset
Mon Sep 25 02:44:51 -0700 [node-01: pmcsas_timeout_0: sas.adapter.firmware.fault:debug]: Detected firmware fault 0xffffffff on SAS adapter 1.
Mon Sep 25 02:44:51 -0700 [node-01: pmcsas_asyncd_0: sas.adapter.debug:debug]: params: {'adapterName': '1', 'debug_string': 'Adapter debug dump is being collected'}
Mon Sep 25 02:44:53 -0700 [node-01: pmcsas_asyncd_0: sas.adapter.reset:debug]: Resetting SAS adapter 1a.
Mon Sep 25 02:45:04 -0700 [node-01: pmcsas_asyncd_0: sas.adapter.not.ready:debug]: SAS adapter 1 did not become ready.
Mon Sep 25 02:45:04 -0700 [node-01: pmcsas_asyncd_0: sas.adapter.hardreset:debug]: Hard resetting SAS adapter 1.