Multiple StorageGRID nodes hosted on one Linux server hitting cgroup out of memory
Applies to
StorageGRID on a Linux server
Issue
- Multiple containerized nodes hosted on the same Linux server in unknown state
- LDR, Cassandra and possibly other services in error state
- Memory usage is increasing over time, but StorageGRID metrics do not show any services memory usage increasing over time
- Linux server's kern.log reporting
memory cgroup out of memory
:
Nov 7 06:44:18 sg-node0-zone1 kernel: [1491372.966233] Memory cgroup out of memory: Killed process 3364298 (java) total-vm:18302844kB, anon-rss:13080308kB, file-rss:0kB, shmem-rss:304kB, UID:1015 pgtables:28820kB oom_score_adj:0
Nov 7 07:39:34 sg-node0-zone1 kernel: [1494688.596115] Memory cgroup out of memory: Killed process 3963236 (ldr) total-vm:18208608kB, anon-rss:13018024kB, file-rss:0kB, shmem-rss:252kB, UID:1015 pgtables:29100kB oom_score_adj:0