FAS8700/FAS8300/AFF A400/AFF C400 reports (CHASSIS OVER TEMPERATURE) ERROR warnings
Applies to
- ONTAP 9
- AFF A400, AFF C400, ASA A400, ASA C400
- FAS8300
- FAS8700
Issue
- Hourly error messages for chassis temperature, in both nodes. Example:
[node_name-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (LED1 Temp) is not readable.
[node_name-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (LED2 Temp) is not readable.
[node_name-01: env_mgr: callhome.chassis.hitemp:error]: Call home for CHASSIS OVER TEMPERATURE
[node_name-02: env_mgr: monitor.temp.unreadable:error]: The controller temperature (LED1 Temp) is not readable.
[node_name-02: env_mgr: monitor.temp.unreadable:error]: The controller temperature (LED2 Temp) is not readable.
[node_name-02: env_mgr: callhome.chassis.hitemp:error]: Call home for CHASSIS OVER TEMPERATURE
- AutoSupport generated with:
HA Group Notification (CHASSIS OVER TEMPERATURE) ERROR
- LED1 or LED2 Temp sensor failed. Example:
::> system node run -node node_name-01 -command environment chassis list-sensors
Sensor Name State Current Critical Warning Warning Critical
Reading Low Low High High
----------------------------------------------------------------------------------------
...
LED1 Temp failed -- C 0 C 3 C 43 C 46 C
or
::> system node run -node node_name-01 -command environment status chassis
Sensor Name State Current Critical Warning Warning Critical
Reading Low Low High High
-------------------------------------------------------------------------------------------------
LED1 Temp not_available -- C 0 C 3 C 43 C 46 C
LED2 Temp not_available -- C 0 C 3 C 43 C 46 C
- "
No reading
" outputs for the LED sensors in the BMC debug logs. Example:
LED1_Temp | 16h | ns | 12.1 | No Reading
LED2_Temp | 17h | ns | 12.1 | No Reading
LED1_Temp | na | degrees C | na | na | 0.000 | 3.000 | 43.000 | 46.000 | na
LED2_Temp | na | degrees C | na | na | 0.000 | 3.000 | 43.000 | 46.000 | na
- In certain scenario, node may Panic as well with below messages:
[node_name-02: env_mgr: monitor.shutdown.chassisUnderTemp:error]: Chassis temperature is too cold: Ambient temperature is critical low. System will be shutdown immediately
[node_name-02: monitor: monitor.globalStatus.critical:EMERGENCY]: Chassis temperature is too low..
[node_name-02: env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Temperature critical)
[node_name-02: mgwd: mgwd.notify.halt.result:info]: MGWD able to notify CLAM on its HA partner node that this node is undergoing a planned shutdown (reason: E). Error: -