PANIC: Invalid recovery point 172/8/0 in SK process cpr_thread on release 9.13.1P9
Applies to
- FAS 9000
- AFF A700
Issue
- Node panics with folllowing panic string:
PANIC: Invalid recovery point 172/8/0 in SK process cpr_thread on release 9.13.1P9 (C) on Wed May 22 11:51:18 UTC 2024 version: 9.13.1P9: Fri Apr 19 13:13:02 EDT 2024 compile flags: x86_64.optimize 0x000275ae734331b1: Add bad root port 128/3/0 (1) 0x000275ae73d5a4fc: Recovery pt 172/8/0 for 172/8/0 0x000275ae73d5ab82: Add recovery candidate 172/8/0 0x000275ae73d5b22e: Recovery pt 172/8/0 for 172/8/0 0x000275ae73d5f62e: Succeed in enqueuing 172/8/0 0x000275ae73d6092e: Succeed in waking up recovery thread 0x000275b9e4e3d7bf: Add bad root port 0/2/0 (1) 0x000275b9e5f839db: PCI NMI while recovery is in progress recursive PANIC: PCI Error NMI from device(s):ErrSrcID(CorrSrc(0x340),UCorrSrc(0)), RPT(0,2,0):PLX PCIE 8764 switch on Controller, Br[8764](3,8,0): Link down.
- In sp-latest-events we see the following:
Record 1860: Wed May 22 11:51:25.208700 2024 [IPMI.notice]: 9400 | 02 | EVT: 6f01ffff | IO11_Status | Assertion Event, "Absent"