Rebooting BMC because one or more sensors are unreadable
Applies to
- ONTAP 9
- BMC
Issue
ONTAP reports multiple sensor unreadble condition and automatically recovers.
Example:
12/21/2024 20:38:14 node-01 NOTICE nvmem.battery.normalCharge: The NVMEM battery charging status is normal.12/21/2024 20:37:00 node-01 NOTICE monitor.globalStatus.ok: The system's global status is normal.12/21/2024 20:36:43 node-01 NOTICE monitor.chassisFan.ok: Chassis fan SysFan3 F2 is ok.12/21/2024 20:36:43 node-01 NOTICE monitor.chassisFan.ok: Chassis fan SysFan3 F1 is ok.12/21/2024 20:35:46 node-01 ERROR callhome.c.fan.fru.shut: Call home for MULTIPLE CHASSIS FAN FAILED: System will shut down in 2 minutes12/21/2024 20:35:22 node-01 ERROR callhome.c.fan.fru.fault: Call home for CHASSIS FAN FRU FAILED: Fan3_212/21/2024 20:35:22 node-01 ERROR callhome.c.fan.fru.fault: Call home for CHASSIS FAN FRU FAILED: Fan3_112/21/2024 20:35:01 node-01 NOTICE sp.reboot.sensor.unreadable: Rebooting BMC because one or more sensors are unreadable.12/21/2024 20:35:01 node-01 NOTICE monitor.shutdown.cancel: Automatic shutdown sequence canceled.12/21/2024 20:35:00 node-01 EMERGENCY monitor.globalStatus.critical: Multiple fans has failed: SysFan3 F2, SysFan3 F1.12/21/2024 20:34:51 node-01 EMERGENCY monitor.chassisFanFail.xMinShutdown: Multiple Chassis Fan failure: System will shut down in 2 minutes.12/21/2024 20:34:46 node-01 ERROR monitor.chassisFan.stop: Chassis fan contains at least one stopped fan: Fan3_2 (failed)12/21/2024 20:34:46 node-01 ERROR monitor.chassisFan.stop: Chassis fan contains at least one stopped fan: Fan3_1 (failed)[env_mgr: monitor.chassisTemperature.cool:alert]: Chassis temperature is too cool: Bat Ambient 2 is critical low (0 C).[env_mgr: monitor.chassisTemperature.cool:alert]: Chassis temperature is too cool: PM40068B Temp is critical low (0 C).[env_mgr: nvram.battery.capacity.low.critical:EMERGENCY]: The NVRAM battery capacity is critically low (0 cycles). To prevent data loss, the system will shut down in 5 minutes.[env_mgr: nvram.hw.degraded:error]: NVRAM hardware is degraded: NVS Power Good has fault.[env_mgr: callhome.battery.failure:EMERGENCY]: Call home for BATTERY (capacity low) CRITICAL.- SP-LATEST-SYSTEM-EVENT-LOG reports BMC reboot:
Record 2120: Wed Mar 11 19:22:17.239175 2026 [IPMI.warning]: Recovering BMC due to a non-readable sensor
Record 2121: Sun Jan 01 00:00:20.853442 2017 [IPMI.notice]: 052d | c0 | OEM: ffff70005100 | ManufId: 150300 | BMC Reset Internally
- sp.reboot.sensor.unreadable NetApp Support Site - Syslog Translator - Event Details
| Ems Identifier | sp.reboot.sensor.unreadable |
|---|---|
| Syslog Message | Rebooting %s because one or more sensors are unreadable. |
| Severity | NOTICE |
| Description | This message occurs when one or more sensors are unreadable from Service Processor (SP) or Baseboard Management Controller (BMC).SP or BMC is rebooted in attempt to recover the sensor reading. |
| Corrective Action | No user action is required. |
