Cassandra repair progress slow alert in StorageGRID 11.5
Applies to
StorageGRID OS 11.5.0.9
Issue
Cassandra repair progress slow
alert doesn't resolve after 48 hours.
- Reaper Repaired Percentage dashboard shows effective repair percentage flat not completing 100%:
reaper_commands.txt
shows the following results:
Command: spreaper --reaper-host=localhost --reaper-port=9403 status-cluster storagegrid
"last_event": "postponed repair segment xxxxxx-xxxx-xxxx-xxxx-xxxxxxx because segment xxxxxx-xxxx-xxxx-xxxx-xxxxxxx is
running on host xxx.x.x.x",
servermanager.log
shows following results:
2023-02-16 09:43:41 +0000 | cassandra-reaper | restart initiated 2023-02-16 09:43:49 +0000 | cassandra-reaper | cassandra-reaper ended 2023-02-16 09:43:53 +0000 | cassandra-reaper | starting reaper 2023-02-28 05:08:03 +0000 | cassandra-reaper | restart initiated 2023-02-28 05:08:11 +0000 | cassandra-reaper | cassandra-reaper ended 2023-02-28 05:08:14 +0000 | cassandra-reaper | starting reaper 2023-03-01 17:18:41 +0000 | cassandra-reaper | restart initiated 2023-03-01 17:18:49 +0000 | cassandra-reaper | cassandra-reaper ended 2023-03-01 17:18:53 +0000 | cassandra-reaper | starting reaper