Skip to main content
NetApp Knowledge Base

ONTAP upgrade failure on AFF-A900 node due to BMC communication issue

Views:
46
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
CORE
Last Updated:

Applies to

  • AFF A900 all-flash storage system
  • ONTAP 9.12.1P11 to 9.15.1P16 upgrade
  • MCC-IP (Multi-Cluster Consistency IP) environments
  • Baseboard Management Controller (BMC)

Issue

  • During an ONTAP upgrade from 9.12.1P11 to 9.15.1P16, the second node became unresponsive at boot and upgrade process stalled for over two hours..

console log:

---<<BOOT>>---
NetApp Data ONTAP 9.15.1P16
random: registering fast source Intel Secure Key RNG
nvme0: doorbell stride #2.
nvme0: 0% of timeout was used waiting for RDY.
nvme0: 0% of timeout was used waiting for RDY.
nvme0: Waiting on ctrlr at end of enable.
nvme0: 0% of timeout was used waiting for RDY.
nvd0: <0X331511900503A0SAM000PM9A30002T00025000> NVMe namespace sn:(S668NE0T301378)
nvd0: 1831420MB (3750748848 512 byte sectors)
IPMI device unit 0 rev. 1, firmware rev. 16.08, version 2.0, device support mask 0xbf
IPMI device unit 1 rev. 1, firmware rev. 16.08, version 2.0, device support mask 0xbf

  • From the BMC console, attempting a system power cycle, but the system failed to boot and displayed the same message as before.
  • After waiting for two and a half hours, the system eventually booted up to the Waiting for Giveback state without any errors.
  • After checking the EMS logs, splogs errors are occurring repeatedly, and SP-related logs are missing from both the Weekly and Full Autosupport reports.

EMS logs:

Mon Jan 26 05:30:00 +0900 [node-01: splog_main: splog_warnings_1:error]: params: {'sp_type': 'BMC', 'reason': 'splogd is running in degraded mode and having difficulty getting splogs from the SP FW'}

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.