Active IQ Unified Manager web interface goes unresponsive due to 'Too many open files' error caused by file descriptor leak
Applies to
Active IQ Unified Manager (AIQUM) 9.9 or earlier
Issue
- AIQUM WebUI is unresponsive
- API calls triggered fail with
Error 500 - Internal Server Error - Grafana fails to create report of AIQUM
server.logshowsToo many open fileserror:
ERROR [org.jboss.as.server.deployment.scanner] (DeploymentScanner-threads - 1) WFLYDS0012: Scan of /opt/netapp/essentials/jboss/standalone/deployments threw Exception: java.lang.RuntimeException: WFLYDS0032: Failed to list files in directory /opt/netapp/essentials/jboss/standalone/deployments. Check that the contents of the directory are readable.
...
Caused by: java.nio.file.FileSystemException: /opt/netapp/essentials/jboss/standalone/deployments: Too many open files
Exception handling request to /apis/XMLrequest: java.lang.RuntimeException: java.io.IOException: Cannot run program "/opt/netapp/essentials/jboss/bin/native/lib64/authenticate": error=24, Too many open files
ocumserver.logindicatesGot IO exception while processing access_logwithToo many open filesduringAccess Log Task
INFO [oncommand] [Access Log Task] [com.netapp.ipc.util.AccessLogTask] <YEAR>-<MONTH>-<DATE> is older than 30 days
ERROR [oncommand] [Access Log Task] [com.netapp.ipc.util.AccessLogTask] Got IO exception while processing access_log
java.nio.file.FileSystemException: /var/log/ocie/<YEAR>-<MONTH>-<DATE>: Too many open files
- Command
lsof -p `cat /var/run/ocie.pid` | awk '{print $9}' | sort | grep "/var/log/ocie/20" | uniq -cshows large numbers of file descriptors for/var/log/ocie/<YEAR>-<MONTH>-<DATE>directories that are older than 30 days
174 /var/log/ocie/2019-08-26
172 /var/log/ocie/2019-08-27
:
34 /var/log/ocie/2021-07-13
98 /var/log/ocie/2021-07-14
98 /var/log/ocie/2021-07-15
40 /var/log/ocie/2021-07-16
