Storage node reports CASA error and system log indicates a network issue at that time
Applies to
StorageGRID 11.4
Issue
- Storage node reports CASA error.
- Cassandra service is still running.
nodetool status
command from the target Storage node shows DN (down).Bycast.log
records the below message:
Example:
Jan 15 05:26:27 XXXX ADE: |21565833 0217996451 DDSM SQRT 2021-01-15T05:26:27.491683| WARNING 0206 DDSM: Possible error in Cassandra gossip detected. Node with IP x.x.x.x appears down.
system.log
records the below message:
Example:
WARN [InternalResponseStage:255] 2021-01-15 05:26:48,874 Gossiper.java (line 1168) Resetting connection pool to /x.x.x.x, too many failed echo attempts: 15. This usually indicates a network issue.
- The following dashboards (
Support
-->Metrics
-->Cassandra Network Overview
) shows nothing happened at that time range.
“Missed Gossip Messages: Sender ”
“Missed Gossip Messages: Recipient”