vCenter information in SolidFire AIQ has not been updated due to NOT_RESPONDING hosts
Applies to
- NetApp HCI
- NetApp SolidFire Active IQ (AIQ)
- There are hosts registered in vCenter that no longer exist
Example: output of dcli com vmware vcenter host list on vCenter
Command> dcli com vmware vcenter host list
|--------|------|----------------|-----------|
|host |name |connection_state|power_state|
|--------|------|----------------|-----------|
|host-xxx|host01|NOT_RESPONDING | |
|host-xxx|host02|NOT_RESPONDING | |
|host-xxx|host03|CONNECTED |POWERED_ON |
|host-xxx|host04|CONNECTED |POWERED_ON |
|--------|------|----------------|-----------|
Issue
- Information for vCenter and compute nodes in SolidFire AIQ has not been updated
mnode_hci-monitor.logon management node (mNode) shows errors:
get_nma_and_mnode_stats-directive-monitor-<vCenter-UUID>:[sf.mon.aiq:post_data:245]DEBUG:Published data to AIQ. Response [400]
get_nma_and_mnode_stats-directive-monitor-<vCenter-UUID>:[sf.mon.aiq:post_data:247]ERROR:Failed to send support data to AIQ. HTTP response code [400]
SF-VCAlarm-Monitor:[sf.mon.mediator:get_monitor_pairs:104]ERROR:Exception while speaking to dispatcher: 503 Service Unavailable
- At the time of the
503 Service Unavailableerror,vpxd.logon vCenter shows error:
error vpxd[21325] [Originator@6876 sub=Vmomi opID=492a9a9] Caught exception while sending activation result; <<xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx, <TCP '127.0.0.1 : 8085'>, <TCP '127.0.0.1 : 48066'>>, storageSystem-xxx, vim.host.StorageSystem.GetFileSystemVolumeInfo>, N5Vmomi5Fault11SystemError9ExceptionE(Fault cause: vmodl.fault.SystemError
--> )
--> [context]<CONTEXT>[/context]
error vpxd[21325] [Originator@6876 sub=Http2Session #2 opID=492a9a9] [Stream #70042349] Transaction was destroyed before completing in state: 1; The handler probably needs to be fixed to always complete. Now resetting the stream...
