CHW-489: Tracking: AFF A400/FAS8300 (CHASSIS OVER TEMPERATURE) ERROR events
Issue
- System is reporting (CHASSIS OVER TEMPERATURE) ERROR events on an hourly basis
- Review of logs show that both nodes are showing the "LED1_Temp" sensor as not readable / failed.
- SP-LATEST-SYSLOG:
[200519111706][TP][AMBIENT][WARN] One ambient is failed![200519111710][TP][AMBIENT][WARN] One ambient is failed![200519111713][TP][AMBIENT][WARN] One ambient is failed![200519111716][TP][AMBIENT][WARN] One ambient is failed![200519111719][TP][AMBIENT][WARN] One ambient is failed![200519111722][TP][AMBIENT][WARN] One ambient is failed![200519111725][TP][AMBIENT][WARN] One ambient is failed![200519111728][TP][AMBIENT][WARN] One ambient is failed!- PLATFORM-SENSORS:
LED1 Temp thermal failed C- SP-LATEST_IPMI:
Sensor Reading: (run for 6 sec)argc=3 argv[0]=/usr/local/bin/eq argv[1]=dump_sdrPVCCIN_CPU0 | 1.78 Volts | okPVCCIN_CPU1 | 1.78 Volts | okPVDDQ_ABC | 1.21 Volts | okPVDDQ_DEF | 1.21 Volts | okPVDDQ_GHI | 1.21 Volts | okPVDDQ_KLM | 1.21 Volts | okP1V05_PCH | 1.06 Volts | okSystem_Inlet | 25 degrees C | okCX5_Inlet | 39 degrees C | okSystem_Outlet | 36 degrees C | okNVMe_Temp | 26 degrees C | okCX5_Temp1 | 58 degrees C | okCX5_Temp2 | 48 degrees C | okLED1_Temp | no reading | ns <<<<< LED2_Temp | 25 degrees C | ok