SwitchSNMPCommunication_Alert reported for NVIDIA Cluster Switch
Applies to
- NVIDIA Cluster Switch SN2100
- ONTAP 9
- 4-Node MCC-IP
Issue
- ONTAP displays SwithSNMPCommunication_Alert as follows:
Cluster::> system health alert show
Node: Cluster-01
Alert ID: SwitchSNMPCommunication_Alert
Resource: cluster-switch01
Severity: Major
Indication Time: Fri Aug 16 13:41:11 2024
Suppress: false
Acknowledge: false
Probable Cause: SNMP communication from the node to the ethernet
switch has failed repeatedly. Invalid SNMP settings
are configured with ONTAP Switch Health Monitoring or
on the Ethernet switch.
Possible Effect: Ethernet switch communication problems and
accessibility issues.
Corrective Actions: 1) Check the SNMPv2c community or SNMPv3 username on the Ethernet switch to verify
that the expected community string or username is configured.
To view the expected community string or username, run the "system switch ethernet show -snmp-config" command.
2) (SNMPv3) Verify that the SNMPv3 credentials are present within ONTAP.
To view the established SNMP logins, run the "security login show -application snmp" command.
If a custom engine-id was provided for the SNMPv3 user,
ensure it is same as that of the remote switch.
- In Autosupport logs (CSHM-SWITCH-CONFIG.XML)
CSHM-SWITCH-CONFIG.XML
Device Name: cluster-switch01
Software Version Cumulus Linux version: 5.4.0
Reference Config File Version: NA
SNMP Version: SNMPv2c
Switch Monitoring Status: false
Reason For Not Monitoring: Invalid SNMP Settings
- The switches report the same behaviour on both clusters