StorageGRID decommission stuck at erasure coding stage with permission issue
Applies to
StorageGRID 11.x
Issue
StorageGRID Decommission stuck at erasure coding, running and pausing the decommissioning task in the process.
Observed the below error messages in
bycast-err.log files:On node EC leader node:
Apr 24 17:36:40 <node_name> ADE: |<Node_details> ECJM CSSR 2025-04-24T17:36:40.916854| ERROR 0111 ECJM: 2938110359(decom 12724207,): Exception caught during decommissioning ENFORCE failed: !"VCS Size lookup failed during node decommissioning".Apr 24 17:36:41 <node_name> ADE: |21947927 0110368999 ECJM CSSR 2025-04-24T17:36:41.024222| ERROR 1081 PROC: Exception: /build/src/modules/ErasureCoding/EC_JobManager_Module/NodeDecommissionJob.cc(139): Throw in function findAffectedEcgs#012Dynamic exception type: boost::wrapexcept<std::runtime_error>#012std::exception::what: ENFORCE failed: !"VCS Size lookup failed during node decommissioning"#012On Node being Decommissioned:
Mar 24 05:04:31 <node_name> ADE: |<Node_details> CHUN VCSS 2025-03-24T05:04:31.884931| ERROR 0356 CHUN: getxattr failed for /var/local/rangedb/8/chunk/0FF221A8-9D00-46CD-B4B4-C41F6D40D5C2 attr user.size, error Permission deniedMar 24 06:24:18 <node_name> ADE: |12724207 0805568948 CHUN VCSS 2025-03-24T06:24:18.552685| ERROR 0356 CHUN: getxattr failed for /var/local/rangedb/8/chunk/0FF221A8-9D00-46CD-B4B4-C41F6D40D5C2 attr user.size, error Permission denied