E-Series reports unreadable sectors detected on multiple drives due to a faulty controller
- Views:
- 3,058
- Visibility:
- Public
- Votes:
- 1
- Category:
- e-series-systems
- Specialty:
- esg
- Last Updated:
- 6/4/2024, 10:03:09 PM
Applies to
- E-Series
- SANtricity OS
Issue
- VDD errors (VDD logged an error, VDD repair started, VDD repair completed) are seen on multiple drives (from parsed MEL):
A:11/30/20 5:51:28 AM (05:51:28) 102043 201f VDD repair completed - Shelf 1, Bay A - SSID: 3, Devnum: 0x010017 LBA: 0x436886a0
----> Flags: 0x40202085 = READ: Read Operation, ERROR: IO Compl. w. Err, PARITY: Parity data, NOLOCK: Prevent lock during read err., PI: Error coding in effect, NOCACHE: CDB DPO cache lowest retention - Error: 0x844 = UA_MISCORRECTED_DATA_ERROR
A:11/30/20 5:51:28 AM (05:51:28) 102042 201e VDD repair started - Shelf 1, Bay A - SSID: 3, Devnum: 0x01000f
A:11/30/20 5:51:28 AM (05:51:28) 102041 201e VDD repair started - Shelf 1, Bay A - SSID: 3, Devnum: 0x010017
B:11/30/20 4:36:26 AM (04:36:26) 101853 2014 VDD logged an error - Shelf 1, Bay B - SSID: 0, Devnum: 0x01010b LBA: 0x0e19b600, Blocks: 0xb8 - Recovered
----> Flags: 0x200801 = READ: Read Operation, CURRENT: Read current data from cache, PI: Error coding in effect
----> Recovery: 0x2 = Reconstruction used, ASC: 0x1f = IOP_FAST_TIMEOUT_ERROR, Detection: 0xf80b0181
B:11/30/20 4:36:26 AM (04:36:26) 101852 2014 VDD logged an error - Shelf 1, Bay B - SSID: 3, Devnum: 0x010113 LBA: 0x30431628, Blocks: 0x8 - Recovered
----> Flags: 0x200801 = READ: Read Operation, CURRENT: Read current data from cache, PI: Error coding in effect
----> Recovery: 0x2 = Reconstruction used, ASC: 0x1f = IOP_FAST_TIMEOUT_ERROR, Detection: 0xf80b0181
- Unreadable sector(s) detected errors occur, which may accompany with data assurance mismatch detected:
A:11/30/20 5:39:26 AM (05:39:26) 102031 6700 Unreadable sector(s) detected data loss occurred - Volume volume04 - LBA: 0x10da21aaa <--CRITICAL
----> Physical Drive in Tray 1 Slot 23, LBA: 0x28f443aa
A:11/30/20 5:39:26 AM (05:39:26) 102030 2061 Data assurance mismatch detected - probable cause is cached data - Volume volume04 - ioType: DST_OUT, hwPIStatus: GUARD_ERROR, swPIStatus: GUARD_ERROR, Host ID: 65535, LBA: 0x10da2bb00
----> Expected Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
----> Found Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
A:11/30/20 5:39:26 AM (05:39:26) 102029 2070 Data assurance mismatch detected -- cached data error on both controllers - Volume volume04
A:11/30/20 5:39:26 AM (05:39:26) 102028 2061 Data assurance mismatch detected - probable cause is cached data - Volume volume04 - ioType: DST_OUT, hwPIStatus: GUARD_ERROR, swPIStatus: GUARD_ERROR, Host ID: 65535, LBA: 0x10da21a00
----> Expected Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
----> Found Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
- Those unreadable sectors are spread across multiple drives:
Volume LUN Accessible By Date/Time Volume LBA Drive Location Drive LBA Failure Type
volume01 1 Host Cluster cluster1 11/30/20 12:04:40 PM 0x3c08c9ee Shelf 1 Bay 5 0x2f119ee DA Error
volume03 3 Host Cluster cluster1 11/30/20 6:22:09 PM 0x10241c1ed Shelf 1 Bay 4 0x261838ed DA Error
volume04 4 Host Cluster cluster1 11/30/20 5:39:28 AM 0x10da21aaa Shelf 1 Bay 23 0x28f443aa DA Error
volume04 4 Host Cluster cluster1 11/30/20 5:39:28 AM 0x10da2bbac Shelf 2 Bay 22 0x289457ac DA Error
volume04 4 Host Cluster cluster1 11/30/20 5:39:28 AM 0x10da2bbad Shelf 2 Bay 22 0x289457ad DA Error
volume05 5 Host Cluster cluster1 11/30/20 10:42:42 AM 0xa806930a Shelf 1 Bay 10 0x3ee0d20a DA Error