DS460C shelf temperature sensor unknown and DCM critical
Applies to
- ONTAP 9
- DS460C Disk Shelves
Issue
- The following alerts are reported in the event logs:
[Node-01: dsa_disc: ses.shelf.drwr.phy.rateunkwn:alert]: Drive shelf "10", module "B", drawer "2", PHY "0" is reporting an unknown link rate.
[Node-01: statd: monitor.shelf.fault:alert]: Critical fault reported on disk storage shelf attached to channel 10a. Check fans, power supplies, disks, and temperature sensors.
- The STORAGE-SHELFsection of the autosupport shows unknown link rate for the affected drawer through the affected module:
Shelf name:    9b.shelf10
Shelf id:      10
Channel:       9b
Module:        B
Shelf UID:     50:XX:XX:XX:XX:XX:XX:XX
Shelf S/N:     SHJGDXXXX00380
Term switch:   N/A
Shelf state:   ONLINE
Module state:  OK
                     Partial Path   Link    Invalid   Running     Loss    Phy       CRC     Phy
Disk          Port   Timeout        Rate     DWord    Disparity   Dword   Reset     Error   Change
Id           State   Value (ms)    (Gb/s)    Count    Count       Count   Problem   Count   Count
--------------------------------------------------------------------------------------------------
[DCM1.0] OK             0       12.0        0           0       0        0         0       7
[DCM1.1] OK             0       12.0        0           0       0        0         0       7
[DCM1.2] OK             0       12.0        0           0       0        0         0       7
[DCM1.3] OK             0       12.0        0           0       0        0         0       7
[DCM2.0] UNKWN LNK      0         NA        0           0       0        0         0       8
[DCM2.1] UNKWN LNK      0         NA        0           0       0        0         0       8
[DCM2.2] UNKWN LNK      0         NA        0           0       0        0         0       8
[DCM2.3] UNKWN LNK      0         NA        0           0       0        0         0       8
[DCM3.0] OK             0       12.0        0           0       0        0         0       7
- The STORAGE-FAULT section shows that the temperature sensors mapped to the affected drawer are reported as unknown and the drawer as critical:
Enclosure Status: information
Channel: 1d
Shelf: 10
Shelf Type: DS460-12
Product Serial Number: SHJGDXXXX00380
Module Type: IOM12B
Temperature Sensors:
Element Status         Status Bytes  Status Descriptions  
 20: OK                01,00,35,00   
 21: UNKNOWN           06,00,33,00   
 22: OK                01,00,36,00   
 23: OK                01,00,35,00   
 24: OK                01,00,35,00   
 25: OK                01,00,39,00   
 26: UNKNOWN           06,00,3A,00   
 27: OK                01,00,3B,00   
 28: OK                01,00,3A,00   
 29: OK                01,00,3A,00  
Drawers Control Module:
Element Status                      Status Bytes  Status Descriptions
  1 [IOM12B A]    : OK                01,00,01,01   REPORT, TYPE BIT0
  2 [IOM12B A]    : OK                01,00,01,01   REPORT, TYPE BIT0
  3 [IOM12B A]    : OK                01,00,01,01   REPORT, TYPE BIT0
  4 [IOM12B A]    : OK                01,00,01,01   REPORT, TYPE BIT0
  5 [IOM12B A]    : OK                01,00,01,01   REPORT, TYPE BIT0
  6 [IOM12B B]    : OK                01,00,01,01   REPORT, TYPE BIT0
  7 [IOM12B B]    : CRITICAL          02,40,00,00   FAIL
  8 [IOM12B B]    : OK                01,00,01,01   REPORT, TYPE BIT0
  9 [IOM12B B]    : OK                01,00,01,01   REPORT, TYPE BIT0
 10 [IOM12B B]    : OK                01,00,01,01   REPORT, TYPE BIT0
