System does not start after reboot with Unable to recover the local database of Data Replication Module
Applies to
- All AFF and FAS platforms
- ONTAP 9
- ONTAP Select
- Power outage, controller relocation, or during upgrade or other reboot
- Improperly unfail a disk before fixing the underlying virtual disk file in ONTAP Select product
Issue
- Console logs for node show:
******************************************************* This is a serial console session. Output from this ** session is mirrored on the SP console session. ******************************************************************************** SYSTEM MESSAGES *************************Internal error: Cannot open corrupt replicated database. Automatic recoveryattempt has failed or is disabled. Check the event logs for details. This nodeis not fully operational. Contact support personnel for the root volume recoveryprocedures.- Node may panic during boot:
Warning: previous shutdown was dirty, there is a possible loss of data.
May 28 00:43:33 [node1:wafl.root.content.changed:error]: Contents of the root volume 'vol0' might have changed. Verify that all recent configuration changes are still in effect.
PANIC : NVRAM contents are invalid...
PANIC: NVRAM contents are invalid... in SK process rc on release 9.10.1P5 (C) on Wed May 28 00:43:33 GMT 2025
- EMS messages for node:
Mar 28 17:03:46 [NODE_B:rdb.recovery.failed:EMERGENCY]: Error: Unable to find a master. Unable to recover the local database of Data Replication Module: Management.Mar 28 17:03:46 [NODE_B:spm.mgwd.process.exit:EMERGENCY]: Management Gateway (mgwd) subsystem with ID 1944 exited as a result of signal normal exit (0). The subsystem will attempt to restart.- Node boots and it is possible to log in but the cluster commands are not showing the correct outputs:
::> cluster showError: "show" is not a recognized command
::> set advanced::*> cluster ring showError: "show" is not a recognized commandROOT VOLUME NOT WORKING PROPERLY: RECOVERY REQUIREDerror message is displayed during the boot/login phase.- Some
bootargvalues are stated forcorruptand/orrecoveryin the LOADER.
Example:
LOADER-B> printenv bootarg.rdb_corruptVariable Name Value-------------------- --------------------------------------------------bootarg.rdb_corrupt 0500000000LOADER-B> printenv bootarg.init.boot_recoveryVariable Name Value-------------------- --------------------------------------------------bootarg.init.boot_recovery 80Note1: If bootarg is not set, it is displayed as undefined.
Note2: The same value(s) also can be checked via AutoSupport from
KENV section.Example:
bootarg.rdb_corrupt="5550055000"
