Excessive error messages issued while upgrading shelf firmware on NVMe shelf
Applies to
- NS224 disk shelves
- NSM100 modules
Issue
- Multiple components monitored by the NSM100 module are reporting an error.
- Messages such as the below can be observed on the storage:
Tue Jun 29 08:09:08 +0200 [node-01: dsa_worker0: ses.status.ModuleWarn:alert]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x PCI switch warning for PCI Switch 1: communication error. This element is on the rear of the shelf at the top, on module A.
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.ACPWarn:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x ACP Processor warning for shelf ACP processor 1: communication error. ; Alternate Control Path hardware failed This element is on the rear of the shelf at the top, on module A.
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x DIMM failure for Dimm Element 1: not installed or failed. This element is on the DIMM slot 1 in the top shelf module (A).
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x DIMM failure for Dimm Element 2: not installed or failed. This element is on the DIMM slot 2 in the top shelf module (A).
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x DIMM failure for Dimm Element 3: not installed or failed. This element is on the DIMM slot 3 in the top shelf module (A).
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x DIMM failure for Dimm Element 4: not installed or failed. This element is on the DIMM slot 4 in the top shelf module (A).
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.battery.error:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x battery failure error for Coin Battery 1: not installed or hardware failure. This element is on the rear of the shelf, in top module (A).
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.etherConn.warn:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x Ethernet connector warning for port e0a: cannot communicate with connector. This element is on the rear of the shelf at the top, on module A.
Tue Jun 29 08:09:08 +0200 [node-01:: dsa_worker0: ses.status.etherConn.warn:error]: NS224NSM100 (S/N XXXXXXXXXXXXXXX) shelf 10 on channel 0x Ethernet connector warning for port e0b: cannot communicate with connector. This element is on the rear of the shelf at the top, on module A.
- Logs indicate that a firmware upgrade was done.
Tue Jun 29 08:32:49 +0200 [node-01: dsa_sfu: sfu.downloadSuccess:info]: [storage download shelf]: Firmware file NSM100.0141.SFW downloaded on 0x.shelf10.
Tue Jun 29 08:32:49 +0200 [node-01: dsa_sfu: sfu.downloadSummary:info]: Shelf firmware updated on 2 shelves.