NS224 partner module reboot during NSM100 replacement
Applies to
- ONTAP 9
- NS224
- Firmware rev 0231
Issue
- During an NSM100 replacement, both nodes reboot due to a fatal multidisk error:
Wed Jun 19 23:14:26 -0400 [cluster1-01: fmmbx_instanceWorker: cf.multidisk.fatalProblem:error]: Node encountered a multidisk error or other fatal error while waiting to be taken over. Permanent errors on all HA mailbox disks (while marshalling header).
Wed Jun 19 23:14:25 -0400 [cluster1-02: fmmbx_instanceWorker: cf.multidisk.fatalProblem:error]: Node encountered a multidisk error or other fatal error while waiting to be taken over. Permanent errors on all HA mailbox disks (while marshalling header).
- During the replacement, the partner module experiences multiple I2C bus errors and reboots at the same time the module being replaced is removed:
Thu Jun 20 03:12:51 2024 ( 1490+14:30:00.815); 0700000A; M0; HAL; hal; 04; HAL_I2CTarget_TollBooth: busRequest failed(44457543) kBus[4] addr:75 fd:30 retryCnt:1, errno:6 rc:-1 tgtAddr:0x46 offset:0x2 op:0x0
Thu Jun 20 03:12:51 2024 ( 1490+14:30:00.816); 0700000A; M0; HAL; hal; 04; HAL_I2CTarget_TollBooth: busRequest failed(44457544) kBus[4] addr:75 fd:30 retryCnt:1, errno:6 rc:-1 tgtAddr:0x46 offset:0x41 op:0x0
Thu Jun 20 03:12:51 2024 ( 1490+14:30:00.827); 0700000A; M0; HAL; hal; 04; HAL_I2CTarget_TollBooth: busRequest failed(44457545) kBus[4] addr:75 fd:30 retryCnt:1, errno:6 rc:-1 tgtAddr:0x46 offset:0x41 op:0x0
Thu Jun 20 03:12:51 2024 ( 1490+14:30:00.839); 0700000A; M0; HAL; hal; 04; HAL_I2CTarget_TollBooth: busRequest failed(44457546) kBus[4] addr:75 fd:30 retryCnt:1, errno:6 rc:-1 tgtAddr:0x46 offset:0x41 op:0x0
Thu Jun 20 03:22:57 2024 ( 0+00:00:48.519); 80000000; U?; HAL; hal; 04; +++ ++++++++++++++++++++++++++++++++++ +++
Thu Jun 20 03:22:57 2024 ( 0+00:00:48.519); 02000233; U?; HAL; hal; 04; +++ Application version 0230 launching +++
Thu Jun 20 03:22:57 2024 ( 0+00:00:48.519); 80000000; U?; HAL; hal; 04; +++ ++++++++++++++++++++++++++++++++++ +++