BeeGFS metadata nodes down due to out of memory errors
Applies to
NetApp E-Series with BeeGFS (BeeGFS-SW)
Issue
- BeeGFS filesystem unavailable due to multiple compute nodes become inaccessible or are powered off.
- Pacemaker fencing events are observed in logs, for example:
Feb 23 08:16:35.774 hosname pacemaker-fenced[70525](handle_fence_request) notice: Client pacemaker-controld.70529 wants to fence (off) hosname using any deviceFeb 23 09:13:17.424 hosname pacemaker-fenced[70525](handle_fence_request) notice: Client pacemaker-controld.70529 wants to fence (off) hosname using any device
