CHW-3131: NSM100 module fails during shelf module FW upgrade
Issue
- Single path to disks in the shelf are seen after a NSM100 shelf firmware upgrade on a NS224 shelf
- For example, an upgrade from FW 0240 to 0306
- The NSM100 module shows a "failed" status after the failed upgrade
EMSreport NSM100 module sensor unreadble after NSM100 module reboot following shelf fw upgrade:[node_name: dsa_worker1: ses.status.temperatureWarning:alert]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x temperature warning for Temperature sensor 6: not installed or failed. Current temperature: 36 C (96 F). This element is on the unknown location.
[node_name: dsa_worker1: ses.status.temperatureWarning:alert]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x temperature warning for Temperature sensor 7: not installed or failed. Current temperature: 69 C (156 F). This element is on the unknown location.
[node_name: dsa_worker1: ses.status.temperatureWarning:alert]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x temperature warning for Temperature sensor 8: not installed or failed. Current temperature: 63 C (145 F). This element is on the unknown location.
[node_name: dsa_worker1: ses.status.temperatureWarning:alert]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x temperature warning for Temperature sensor 9: not installed or failed. Current temperature: 66 C (150 F). This element is on the unknown location.
[node_name: dsa_worker1: ses.status.electronicsWarn:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x environmental monitoring warning for SES electronics 1: communication error. ; enclosure services hardware failed This element is on the rear of the shelf at the top, on module A.
[node_name: dsa_worker1: ses.status.ModuleWarn:alert]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x PCI switch warning for PCI Switch 1: communication error. This element is on the rear of the shelf at the top, on module A.
[node_name: dsa_worker1: ses.status.ACPWarn:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x ACP Processor warning for shelf ACP processor 1: communication error. ; Alternate Control Path hardware failed This element is on the rear of the shelf at the top, on module A.
[node_name: dsa_worker1: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x DIMM failure for Dimm Element 1: not installed or failed. This element is on the DIMM slot 1 in the top shelf module (A).
[node_name: dsa_worker1: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x DIMM failure for Dimm Element 2: not installed or failed. This element is on the DIMM slot 2 in the top shelf module (A).
[node_name: dsa_worker1: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x DIMM failure for Dimm Element 3: not installed or failed. This element is on the DIMM slot 3 in the top shelf module (A).
[node_name: dsa_worker1: ses.status.dimm.error:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x DIMM failure for Dimm Element 4: not installed or failed. This element is on the DIMM slot 4 in the top shelf module (A).
[node_name: dsa_worker1: ses.status.battery.error:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x battery failure error for Coin Battery 1: not installed or hardware failure. This element is on the rear of the shelf, in top module (A).
[node_name: dsa_worker1: ses.status.etherConn.warn:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x Ethernet connector warning for port e0a: cannot communicate with connector. This element is on the rear of the shelf at the top, on module A.
[node_name: dsa_worker1: ses.status.etherConn.warn:error]: NS224NSM100 (S/N XXXX) shelf 0 on channel 0x Ethernet connector warning for port e0b: cannot communicate with connector. This element is on the rear of the shelf at the top, on module A.
[node_name: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process schm: NoPathFromSwitchToNSM_Alert[1.0:A].
