AFF-A300/FAS8200 node shutdown due a chassis over temperature
- Views:
- 1,490
- Visibility:
- Public
- Votes:
- 0
- Category:
- fas-systems
- Specialty:
- HW
- Last Updated:
- 12/25/2023, 2:42:36 PM
Applies to
- FAS8200
- AFF-A300
- ONTAP 9
Issue
- Node shutdown due to chassis over-temperature while only reporting 88 Celsius:
[?] Mon Jul 26 09:32:44 -0400 [Node_1: env_mgr: monitor.chassisTemperature.warm:alert]: Chassis temperature is too warm: CPU0 Temp Margin is critical high (88 C). [?] Mon Jul 26 09:32:44 -0400 [Node_1: env_mgr: monitor.shutdown.chassisOverTemp:EMERGENCY]: Chassis temperature is too hot: CPU0 Temp Margin is critical high. System will be shutdown in 2 minutes
- From the output of the events, all commands from the BMC, we see:
Record 710: Mon Jul 26 13:32:40 2021 [IPMI.notice]: 6301 | 02 | EVT: 015758f5 | CPU0_Temp_Margin | Assertion Event, "Upper Non-critical going high" Record 711: Mon Jul 26 13:32:40 2021 [IPMI.notice]: 6401 | 02 | EVT: 015958ff | CPU0_Temp_Margin | Assertion Event, "Upper Critical going high" Record 712: Mon Jul 26 13:32:44 2021 [IPMI.notice]: 6501 | 02 | EVT: 0301ffff | Attn_Sensor1 | Assertion Event, "State Asserted" Record 713: Mon Jul 26 13:32:47 2021 [IPMI.notice]: 6601 | 02 | EVT: 8159c3ff | CPU0_Temp_Margin | Deassertion Event, "Upper Critical going high" Record 714: Mon Jul 26 13:32:48 2021 [IPMI.notice]: 6701 | 02 | EVT: 8157c3f5 | CPU0_Temp_Margin | Deassertion Event, "Upper Non-critical going high" Record 715: Mon Jul 26 13:33:02 2021 [IPMI.notice]: 6801 | 02 | EVT: 0300ffff | Attn_Sensor1 | Assertion Event, "State Deasserted" Record 716: Mon Jul 26 13:34:40 2021 [IPMI.emergency]: triggered OS halt: Temperature critical Record 717: Mon Jul 26 13:34:42 2021 [IPMI.notice]: 6901 | 02 | EVT: 0301ffff | Attn_Sensor1 | Assertion Event, "State Asserted" Record 718: Mon Jul 26 13:34:56 2021 [IPMI.notice]: 6a01 | 02 | EVT: 0300ffff | Attn_Sensor1 | Assertion Event, "State Deasserted" Record 719: Mon Jul 26 13:35:18 2021 [IPMI.notice]: 6b01 | 02 | EVT: 6f03ffff | Sensor 255 | Assertion Event, "Storage OS graceful shutdown"