Permanent errors on all HA mailbox disks PANIC upgrading to 9.7P10
Applies to
AFF A200
Issue
- PANIC upgrading ONTAP to 9.7P10 from 9.5P6
Panic_Message: Permanent errors on all HA mailbox disks (while marshalling header) in SK process fmmbx_instanceWorker on release 9.7P10 (C)
- Just one node reports:
node_name ERROR monitor.temp.unreadable: The controller temperature (Midplane 3 Temp) is not readable.
node_name ERROR monitor.temp.unreadable: The controller temperature (Midplane 4 Temp) is not readable.
node_name ERROR monitor.temp.unreadable: The controller temperature (Module A Expander Temp) is not readable.
node_name ERROR scsi.cmd.adapterHardwareErrorEMSOnly: Enclosure services device 0b.00.99: Adapter detected hardware error: HA status 0x6: cdb 0x1c.
- Both Power Supplies (PSUs) work fine but:
monitor.globalStatus.critical: Chassis temperature is too high..
monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1.
monitor.globalStatus.critical: Multiple fans has failed. Chassis temperature is too high..
monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1. Chassis temperature is too high..
monitor.globalStatus.critical: Multiple fans has failed. Chassis temperature is too high..
- PLATFORM-SENSOR output:
PSU2 FRU fru fault normal PSU_OFF
PSU1 FRU fru fault normal PSU_OFF
PSU2 Bad discrete fault normal TRUE
PSU1 Bad discrete fault normal TRUE
PSU2 discrete fault normal BAD
PSU1 discrete fault normal BAD
PSU2 ON discrete fault normal OFF
PSU1 ON discrete fault
- The issue remains with the Service Processor firmware and internal switch updated to the latest version.