StorageGRID Hotfix Installation Fails on PA Node with Authentication Errors
Applies to
- NetApp StorageGRID
- StorageGRID Primary Admin Node
Issue
After upgrading StorageGRID from version
11.9.0.8 to 12.0.0.1, the hotfix installation on the primary admin node fails. All services on the node remain stopped, and attempts to manually start services result in warnings about an unapplied hotfix and potential data loss. Log output indicates SSH authentication failures and errors processing the hotfix:Sample errors from logs:
root@PA-ADMIN:/var/local/hotfix/historical-logs/hotfix-install-2025-12-09T09:20:21Z.log[2025-12-09T09:20:42.161491 #4701] INFO -- install: from /usr/sbin/run-host-command:161:in `<main>'[2025-12-09T09:20:42.164024 #4701] ERROR -- install: Error processing hotfix[2025-12-09T09:20:42.164073 #4701] ERROR -- install: Failed to execute sh command with 'run-host-command /var/local/dkr/0/tmp/hotfix-stage/hotfix/base-os-install --apply --no-start 2>&1 2>&1' on localhost - 1, /var/lib/gems/3.1.0/gems/net-ssh-7.3.0/lib/net/ssh.rb:275:in `start': Authentication failed for user root@127.0.0.1 (Net::SSH::AuthenticationFailed) from /usr/local/lib/site_ruby/bycast/commands/remote-sh.rb:66:in `block in call' from /usr/local/lib/site_ruby/bycast/commands/remote-sh.rb:60:in `synchronize' from /usr/local/lib/site_ruby/bycast/commands/remote-sh.rb:60:in `call' from /usr/sbin/run-host-command:161:in `<main>'...
Additionally,
/tmp/hotfix-boot-error and /tmp/hotfix-execute-failure files are present, and attempts to SSH as root on port 8022 fail with “Permission denied”.