A system panic with an error disk_health_mon
Applies to
ONTAP 9.5
Issue
In EMS logs we see:
[node_name-01: disk_health_mon: shm.threshold.ratedLifeMax:alert]: shm: There are 1 drives that have reached the end of their rated life: 10c.1.3 (108%);
After this the HA-pair can enter in the panic loop with the panic error :
PANIC: integer divide fault: num 18 code 0 cs:rip 0x20:0xffffffff8f3cd6ad rflags 0x10246 in SK process disk_health_mon on release 9.5P14 (C) on Thu Aug 19 07:36:20 +03 2021
version: 9.5P14: Mon Aug 17 01:34:42 EDT 2020