When recovering failed volumes an unexpected controller reboot might occur
- Views:
- 73
- Visibility:
- Public
- Votes:
- 0
- Category:
- e-series-systems
- Specialty:
- esg
- Last Updated:
- 7/11/2024, 5:30:02 AM
Applies to
- E-Series
- SANtricity OS version prior to 11.80.1
- Unreadable Sectors (URS)
Issue
Events leading to the failure:
- Controller A rebooted
- While Controller A rebooting, Controller B lost connection to all drives. Drives and volumes have failed.
vdmRecoverAllRAIDVols
was run on Controller B to recover the failed volumes.- While
vdmRecoverAllRAIDVols
was still running, between the time after volumes were degraded and before the recovery was completed, controller B rebooted due to transaction timeout. - Controller B's reboot is stuck at
recoverRAIDVolumes
while handling interrupted Writes (IW) entries in NVSRAM.