System repeatedly reboots with mailbox disks error
Applies to
- AFF-A400
- ONTAP 9.9.1P2
Issue
- System boots up to Waiting for givebackwithReservation conflicterror after a power cycle.
Reservation conflict found on this node's disks!
 Local System ID: 123456789
 Press Ctrl-C for Maintenance menu to release disks.
 [node2:cf.fmns.skipped.disk:notice]: While releasing the reservations in "Waiting For Giveback" state Failover Monitor Node State(fmns) module skipped the disk 0a.00.8 that is owned by 123456789 and reserved by 987654321.
 [node2:cf.disk.ResvFail:ALERT]: Disk 0d.00.9P3 has been reserved by the High Availability (HA) partner as part of a takeover operation.
 [node2:cf.disk.ResvTakeOver:notice]: This node will wait for giveback and the disk reservations to be released.
 Disk reservations have been released
 Waiting for giveback...(Press Ctrl-C to abort wait)
 Waiting for giveback...(Press Ctrl-C to abort wait)
 Waiting for giveback...(Press Ctrl-C to abort wait)
- After rebooting the node from Waiting for givebackstatus, system does not boot up and reboots repeatedly withAll Local mailbox disks are inaccessibleerror.
 Rebooting ...
 Uptime: 6m48s
 BIOS Version: 16.4
 PEI start.
 CPU PEI initialization.
 Wait BMC self-test result.
 BMC self-test: OK.
 UPI initialization.
 CPU initialization.
 Running quick memory initialization.
 SPI FLASH: Primary BIOS
 PEI end.
 DXE start.
 USB initialization.
 PCI host bridge initialization.
 CSM initialization.
 PCI Bus initialization start.
 BDS start.
 Console output devices connect.
 Ready to boot.
 
 Boot Loader version 6.4.9
 Copyright (C) 2000-2003 Broadcom Corporation.
 Portions Copyright (C) 2002-2021 NetApp, Inc. All Rights Reserved.
 
 ACPI RSDP Found at 0x6cc29000
 
 
 Starting AUTOBOOT press Ctrl-C to abort...
 Loading X86_64/freebsd/image2/kernel:0x200000/1147616 0x319000/10792232 0xf64000/3948368 0x1327f50/4387448 0x200240/1016 Entry at 0xffffffff80319000
 Loading X86_64/freebsd/image2/platform.ko:0x1758000/4068960 0x1b39660/628568
 Starting program at 0xffffffff80319000
 ---<<BOOT>>---
 NetApp Data ONTAP 9.9.1P2
 IPMI device unit 0 rev. 1, firmware rev. 13.04, version 2.0, device support mask 0xbf
 IPMI device unit 1 rev. 1, firmware rev. 13.04, version 2.0, device support mask 0xbf
 Copyright (C) 1992-2021 NetApp.
 All rights reserved.
 *******************************
 *                             *
 * Press Ctrl-C for Boot Menu. *
 *                             *
 *******************************
 [node2:fmmb.disk.notAccsble:notice]: All Local mailbox disks are inaccessible.
 [node2:cf.fsm.takeoverOfPartnerDisabled:error]: Failover monitor: takeover of node1 disabled (HA interconnect error. Verify that the partner node is running and that the HA interconnect cabling is correct, if applicable. For further assistance, contact technical support).
 [node2:kern.syslog.msg:notice]: domain xing mode: off, domain xing interrupt: false
 [node2:cf.fsm.takeoverOfPartnerDisabled:error]: Failover monitor: takeover of node1 disabled (unsynchronized log).
- HA interconnect ports are up on partner node during the impaired node's reboot.
SYSCONFIG-A
    slot 0: 10G/25G Ethernet Controller CX5
        e0a MAC Address:    xx:xx:xx:xx:xx:xx (auto-25g_cr-fd-up)
            SFP Vendor:         Amphenol
            SFP Part Number:    NDCCGF-N103
            SFP Serial Number:  APF1234567
        e0b MAC Address:    xx:xx:xx:xx:xx:xx (auto-25g_cr-fd-up)
            SFP Vendor:         Amphenol
            SFP Part Number:    NDCCGF-N103
            SFP Serial Number:  APF7654321
