H610S stuck in kernel crash dump loop after driveFailed/DriveMissing alerts
Applies to
- NetApp H610S
- NetApp Element software 12.2
- Drive was replaced already before
Issue
- Error in NetApp SolidFire Active IQ:
driveFailed / DriveMissing
- Node cannot escape from kernel crash dump during boot:
A start job is running for Kernel crash dump (<elapsed time> / no limit)
- Eventually the dump will fail due to space constraints and will start again after the reboot
- RTFI failing immediately with:
[UnhandledError pid=1 cmd=udevadm settle]
sfscsiinfo_nvme
not showing the failed drive in RTFI CLI- No logs can be exported
- Screenshots in Additional Information