An AFF A800 can trigger an Environmental Shutdown due to a critical temperature issue in the "LED_Board_Temp1" sensor
Applies to
- AFF-A800
- BMC 10.2P2 or earlier
Issue
- An AFF A800 can trigger an Environmental Shutdown due to a critical temperature issue in the "LED_Board_Temp1"
- And the BMC firmware version is 10.2P2 or earlier.
In the Event logs we can find several records pointing to the LED_Board_Temp1 sensor:
Record 1889: Sun Dec 01 23:34:31.600000 2019 [IPMI.notice]: 0033 | 02 | EVT: 01572d2a | LED_Board_Temp1 | Assertion Event, "Upper Non-critical going high" | Reading: 45.000 | Threshold: 42.000
Record 1890: Sun Dec 01 23:34:31.610000 2019 [IPMI.notice]: 0034 | 02 | EVT: 01592d2d | LED_Board_Temp1 | Assertion Event, "Upper Critical going high" | Reading: 45.000 | Threshold: 45.000
Record 1891: Sun Dec 01 23:34:34.640000 2019 [IPMI.notice]: 0035 | 02 | EVT: 8159182d | LED_Board_Temp1 | Deassertion Event, "Upper Critical going high" | Reading: 24.000 | Threshold: 45.000
Record 1892: Sun Dec 01 23:34:34.640000 2019 [IPMI.notice]: 0036 | 02 | EVT: 8157182a | LED_Board_Temp1 | Deassertion Event, "Upper Non-critical going high" | Reading: 24.000 | Threshold: 42.000
Record 1893: Sun Dec 01 23:34:54.710000 2019 [IPMI.emergency]: env_mgr trigger OS halt:Temperature critical
Record 1894: Sun Dec 01 23:35:34.000000 2019 [IPMI.notice]: 0037 | 02 | EVT: 6f406fff | Sensor 255 | Assertion Event, "Storage OS stop/shutdown"
Record 1895: Sun Dec 01 23:35:35.000000 2019 [Controller.notice]: Appliance user command halt.
Record 1896: Sun Dec 01 23:35:34.830000 2019 [IPMI Event.critical]: System power down
Record 1897: Sun Dec 01 23:35:34.840000 2019 [IPMI.emergency]: Data ONTAP initiated shutdown
Record 1898: Sun Dec 01 23:35:34.850000 2019 [IPMI.notice]: 0038 | 02 | EVT: 6f00ffff | Power_Event | Assertion Event, "Power off/down"
Shortly after, the console logs show the environmental shutdown triggered by ONTAP OS:
Dec 02 00:34:45 [node_name1:monitor.shutdown.chassisOverTemp:EMERGENCY]: Chassis temperature is too hot: Ambient temperature is critical high. System will be shutdown immediately
Dec 02 00:35:05 [node_name1:monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Temperature critical)
Waiting for PIDS: 1987.
Terminated
.
Uptime: 111d13h47m10s
System powering down...