Both SAS ports offline after some checkum errors for multiple disks
Applies to
- SAS shelf connectivity
- X357_S163A3T8ATE NA54 drives
- MetroCluster IP
Issue
- Checksum errors reported for 3 particular drives.
- Checksum errors repaired during scrub.
- Primary SAS adapter offline due to checksum verification failure on the 3 drives.
- Secondary SAS adapter offline due to the same issue.
- Example:
[node_name: raidio_thread: raid_read_cksum_embed_1:notice]: params: {'owner': '', 'disk_info': 'Disk /aggregate/plex0/rg0/0a.01.1 Shelf 1 Bay 1 [NETAPP X357_S163A3T8ATE NA54]...
[node_name: raidio_thread: raid_rg_readerr_repair_cksum_error_1:notice]: params: {'disk_rpm': 'N/A', 'vendor': 'NETAPP ', 'firmware_revision': 'NA54', 'shelf': '1', 'disk_info': 'Disk /aggregate/plex0/rg0/0a.01.1 Shelf 1 Bay 1 [NETAPP X357_S163A3T8ATE NA54] ...
...
[node_name: disk_admin: disk.checksum.offlineAdapter:alert]: Adapter 0a taken offline due to checksum verification failure on multiple disks. Keep the adapter offline and contact Contact NetApp technical support. for assistance.
[node_name: disk_admin: callhome.hba.failed:EMERGENCY]: Call home for Write Verification Error: Port 0a Failed. Keep the adapter offline and contact NetApp technical support for assistance.
[node_name: asd_asyncd_0: sas.adapter.offline:info]: SAS adapter 0a is now offline.
[node_name: raidio_thread: raid_read_cksum_embed_1:notice]: params: {'owner': '', 'disk_info': 'Disk /aggregate/plex0/rg0/0b.01.1 Shelf 1 Bay 1 [NETAPP X357_S163A3T8ATE NA54]...
[node_name: raidio_thread: raid_rg_readerr_repair_cksum_error_1:notice]: params: {'disk_rpm': 'N/A', 'vendor': 'NETAPP ', 'firmware_revision': 'NA54', 'shelf': '1', 'disk_info': 'Disk /aggregate/plex0/rg0/0b.01.1 Shelf 1 Bay 1 [NETAPP X357_S163A3T8ATE NA54] ...
[node_name: disk_admin: disk.checksum.offlineAdapter:alert]: Adapter 0b taken offline due to checksum verification failure on multiple disks. Keep the adapter offline and contact Contact NetApp technical support. for assistance.
[node_name: disk_admin: callhome.hba.failed:EMERGENCY]: Call home for Write Verification Error: Port 0b Failed. Keep the adapter offline and contact NetApp technical support for assistance.
[node_name: asd_asyncd_0: sas.adapter.offline:info]: SAS adapter 0b is now offline.
[node_name: config_thread: callhome.checksum.multiple:alert]: Call home for CHECKSUM ERROR (multiple disks)
- Issue remains after controller replacement.
- Issue remains using an external card SAS port
- Issue remains after scrub completion