Controller offline after replacement due to faulty BMC on NetApp EF300
- Views:
- 10
- Visibility:
- Public
- Votes:
- 0
- Category:
- e-series-systems
- Specialty:
- esg
- Last Updated:
- 4/17/2025, 3:18:13 PM
Applies to
Issue
- controller replaced due to
"LOCKDOWN_CAUSE_BMC_UNRECOVERABLE"
- short after insert/power on controller goes offline
- stale data seen in
STATE-CAPTURE-DATA
TRACE-BUFFERS
on surviving controller reportsAlt Ctl is dead/missing
:
08:39:37.093838 symTask2 mel C0022 MelE Major Event:0x400d Cat:0x4 Pri:0 Log:0 Action:0x0 Origin:0x0
ID:0x0 LUN:0x0 Dev:0x0 Data:0x00000000
Controller placed online
08:39:37.093842 symTask2 icon mffff Setting CheckIn Count=0
08:39:37.093843 symTask2 vkiCE ffff CE_NOTE 04/15/25-08:39:37.948 releasing alt ctl from reset
08:39:37.343873 symTask2 lem vffff CallBack: Alt inserted
08:39:37.344017 symTask2 dbm d0000 blkset Allocate FS Block: 172585 172586 172587 172588 172589
08:39:37.344338 symTask2 dbm d0000 blkset Free FS Block: 172583 172584 170124 170127 172582
08:39:37.391617 symTask2 mel C0022 MelE Major Event:0x5023 Cat:0x3 Pri:0 Log:0 Action:0x0 Origin:0x0
ID:0x8000 LUN:0x0 Dev:0x0 Data:0x00000000
Controller return status/function call for requested operation
08:39:37.708192 tHckReset hck hffff AltRunningState Change: DontCare->Running
08:39:38.708278 tHckReset hck hffff AltRunningState Change: Running->Off
08:39:38.708279 tHckReset hck hffff altCtlFailed(1) Reset_Failure, current state:5 Repoll
08:39:38.708287 tHckReset vkiCE ffff CE_NOTE 04/15/25-08:39:38.948 HealthCheck: Alt Ctl: 1 Reset_Failure, state: 5 Repoll
08:46:08.717821 tHckReset hck hffff altCtlFailed(4) Norun_Failure, current state:5 Repoll
08:46:08.717828 tHckReset vkiCE ffff CE_NOTE 04/15/25-08:46:08.948 HealthCheck: Alt Ctl: 4 Norun_Failure, state: 5 Repoll
08:46:08.717832 tHckReset vkiCE ffff CE_NOTE 04/15/25-08:46:08.948 HealthCheckManager: Notify Event 6 Ctl_Not_Running
08:46:08.718234 tHckReset hck hffff Notify IocfiManagerE of event Ctl_Not_Running, Response = Do Nothing
08:46:08.718673 tHckReset ioni cffff ResetDrvNet
08:46:08.718677 tHckReset ion cffff IonMgr Health Check, alt ctrl not responsive, event:Ctl_Not_Running
08:46:08.718678 tHckReset ion cffff IonMgr Alt check in:False Reason: (HealthCheck) Ctl_Not_Running
08:46:09.013638 ProcessHandlers txn ffff MasterTxn Master txn: 0x1805baf80 Aborting peer msg, Alt Ctl is dead/missing
08:46:09.013639 ProcessHandlers txn ffff MasterTxn Setting m_IsAltCtlAlive to false for txn:0x1805baf80
08:46:09.014169 ProcessHandlers txn ffff MasterTxn Master txn: 0x1805baf80 Aborting peer msg, Alt Ctl is dead/missing