H610S - BMC Self Test failed
Applies to
NetApp H610S
Issue
- In some cases, you will get persistent cluster faults:
BMC Self Test failed. This may impact IPMI based services and a BIOS/BMC update may be recommended.
- The following entries can be seen on
sf-master.info
2021-11-22T00:14:03.084577Z hci-stg-06 master-1[26236]: [EXPERR-4] [Util] 28031 GlobalPool-0 serviceshared/IpmiComponentMonitor.cpp:272:CheckHealth|BMC Self Test failed. Postponing Fault. mBmcSelfTestFailureCount=1 cNumFailedSelfTestsForFault=10 2021-11-23T13:24:28.274373Z hci-stg-06 master-1[26236]: [EXPERR-4] [Util] 28027 GlobalPool-0 serviceshared/IpmiComponentMonitor.cpp:272:CheckHealth|BMC Self Test failed. Postponing Fault. mBmcSelfTestFailureCount=1 cNumFailedSelfTestsForFault=10 2021-12-16T15:25:16.921931Z hci-stg-06 master-1[57297]: [EXPERR-4] [Util] 55937 GlobalPool-0 serviceshared/IpmiComponentMonitor.cpp:272:CheckHealth|BMC Self Test failed. Postponing Fault. mBmcSelfTestFailureCount=1 cNumFailedSelfTestsForFault=10 2021-12-16T21:18:54.078117Z hci-stg-06 master-1[57297]: [EXPERR-4] [Util] 55937 GlobalPool-0 serviceshared/IpmiComponentMonitor.cpp:272:CheckHealth|BMC Self Test failed. Postponing Fault. mBmcSelfTestFailureCount=1 cNumFailedSelfTestsForFault=10
- Along with the "BMC Self Test failed" cluster fault, any or all of the following conditions may also be present:
- BMC web GUI is inaccessible
- Node Offline events:
The SolidFire Application cannot communicate with node ID <#>
Node Offline nodeID=<#>
- Cannot ping or SSH to the BMC IP address
- Persistent cluster faults related to fans, power supplies, and system sensors. Examples:
Fan1A RPM is failed or missing.
Error checking sensor for Fan1B RPM
Error checking sensor for Inlet Temp
Error checking sensor for Exhaust Temp
- ipmitool commands fail with errors such as:
Could not open device at /dev/ipmi0 or /dev/ipmi/0 or /dev/ipmidev/0: No such file or directory
Get SEL Info command failed: Invalid command
Error sending Chassis Status command: Invalid command
Get Channel Info command failed: Invalid command