Multiple sensors become "not readable" after BMC upgrade
Applies to
- AFF-A400
- ONTAP 9.7P17
- BMC 13.11P1
Issue
- After BMC upgrade, multiple sensors of a node are found to report "No Reading".
bmc log debug log,
======================================
Self Test: (run for 1 sec)
    Selftest: passed
Sensor Reading: (run for 1 sec)
    PilotIV         | 00h | ok  |  0.1 | Dynamic MC @ 20h
    PVCCIN_CPU0      | 01h | ns  | 21.1 | No Reading
    PVCCIN_CPU1      | 02h | ns  | 21.1 | No Reading
    PVDDQ_ABC        | 03h | ns  | 21.1 | No Reading
    PVDDQ_DEF        | 04h | ns  | 21.1 | No Reading
    PVDDQ_GHI        | 05h | ns  | 21.1 | No Reading
    PVDDQ_KLM        | 06h | ns  | 21.1 | No Reading
    P1V05_PCH        | 07h | ns  | 21.1 | No Reading
    System_Inlet     | 10h | ok  |  7.1 | 23 degrees C
    CX5_Inlet        | 11h | ns  |  7.2 | No Reading
    System_Outlet    | 12h | ok  |  7.3 | 30 degrees C
    NVMe_Temp        | 13h | ok  |  7.4 | 24 degrees C
    CX5_Temp1        | 14h | ns  |  7.5 | No Reading
    CX5_Temp2        | 15h | ns  |  7.6 | No Reading
    LED1_Temp        | 16h | ok  | 12.1 | 22 degrees C
    LED2_Temp        | 17h | ok  | 12.1 | 22 degrees C
    MP_Temp1         | 18h | ok  | 15.1 | 23 degrees C
    MP_Temp3         | 19h | ok  | 15.1 | 23 degrees C
    RiserL_Temp1     | 1Ah | ns  | 11.1 | No Reading
    RiserL_Temp2     | 1Bh | ns  | 11.1 | No Reading
    RiserM_Temp1     | 1Ch | ns  | 11.2 | No Reading
    RiserM_Temp2     | 1Dh | ns  | 11.2 | No Reading
    RiserM_Temp3     | 1Eh | ns  | 11.2 | No Reading
    RiserM_Temp4     | 1Fh | ns  | 11.2 | No Reading
    RiserR_Temp1     | 20h | ns  | 11.3 | No Reading
    RiserR_Temp2     | 21h | ns  | 11.3 | No Reading
    RiserR_Temp3     | 22h | ns  | 11.3 | No Reading
    RiserR_Temp4     | 23h | ns  | 11.3 | No Reading
    CPU0_Temp        | 24h | ns  |  3.1 | No Reading
    CPU1_Temp        | 25h | ns  |  3.2 | No Reading
    Mezz_Temp1       | 3Ch | ns  | 11.4 | No Reading
    Mezz_Temp2       | 3Dh | ns  | 11.4 | No Reading
    Bat_Temp         | 40h | ok  | 40.1 | 23 degrees C
- PSU related alerts are also reported but can be automatically recovered.
[monitor.chassisPower.degraded:alert]: Chassis power is degraded: PVCCIN CPU0 is critical low (1 mV).
[env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 PIN is Unreadable
[env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 POUT is Unreadable
[ env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 PIN is Unreadable
[env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 POUT is Unreadable
[ power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU2, PSU1.
[power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU2, PSU1.
[monitor.chassisPower.ok:notice]: Chassis power is OK.
[monitor.globalStatus.critical:EMERGENCY]: Power Supply Status Critical: PSU2, PSU1.
[env_mgr: monitor.chassisPowerSupply.ok:info]: Chassis power supply 1 is OK.
[env_mgr: monitor.chassisPowerSupply.ok:info]: Chassis power supply 2 is OK.
[ power_low_monitor: monitor.chassisPowerSupplies.ok:info]: Chassis power supplies OK.
[env_mgr: callhome.chassis.ps.ok:notice]: Call home for CHASSIS POWER SUPPLY OK: PS 1
[env_mgr: callhome.chassis.ps.ok:notice]: Call home for CHASSIS POWER SUPPLY OK: PS 2