Skip to main content
NetApp Knowledge Base

Health Monitor process nchm: StorageFCAdapterFault_Alert

Views:
451
Visibility:
Public
Votes:
0
Category:
metrocluster
Specialty:
metrocluster
Last Updated:

Applies to

  • Fabric-attached MetroCluster
  • ONTAP 9

Issue

  • EMS reports Health Monitor process nchm: StorageFCAdapterFault_Alert.

Sun May 15 02:27:00 HKT [nodeA: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process nchm: StorageFCAdapterFault_Alert[100000109b42235f].

  • There are timeout errors in the EMS when accessing disks.

Sun May 15 02:09:19 HKT [nodeA: slifc_timeout_2: fci.device.quiesce:debug]: Adapter 2d encountered a command timeout on Disk device T1_Brocade6505B:9.126 (0x02080900) LUN 62 cdb 0x9a:000000002d562200:0001:0200 retry: 0 Quiescing the device.
Sun May 15 02:09:20 HKT [ndoeA: slifc_timeout_2: fci.device.timeout:debug]: HBA 2d encountered a device timeout on Disk device T1_Brocade6505B:9.126 (0x02080900) LUN 62 cdb 0x9a:000000002d562200:0001:0200 retry: 0

  • Large number of transport error on the effeted port.

hard_reset_count                29
Manual adapter dump count 0
Auto adapter dump count 0
firmware_fault_count            0
firmware_pause_count            0
device status:           60900  80900
  link_fail_count             0      0    total:  0
  lip_count                   0      0    total:  0
  underrun_count              0      0    total:  0
  overrrun_count              0      0    total:  0
  transport_error_count    2085   1407    total:  3492
  crc_error_count             0      0    total:  0
  victim_abort_io_count      24     23    total:  47
  timeout_io_count           36      1    total:  37
  logged_out_count           11      6    total:  17
  dma_error_count             0      0    total:  0
  resource_unavail_count      0      0    total:  0
  data_reassembly_count       0      0    total:  0
  device_quiesce_count      216    218    total:  434

  • porterrshow shows large number of crc err and disc c3.

hshshshshshs          frames      enc    crc    crc    too    too    bad    enc   disc   link   loss   loss   frjt   fbsy  c3timeout    pcs    uncor
       tx     rx      in    err    g_eof  shrt   long   eof     out   c3    fail    sync   sig                  tx    rx     err    err
  0:    2.0g   3.1g   0      0      0      0      0      0      0     26      0      0      0      0      0      0      0      0      0
  1:    0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
  2:    4.2g   4.0g   0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
  3:    1.8g   3.7g   0     43.0k  42.3k   0     28    738      0     43.8k  34      0      1.3k   0      0     43.7k   1     46.9m   0
  4:    4.1g   1.7g   0      0      0      0      0      0      0      2      0      0      0      0      0      0      0      0      0
  5:    0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
  6:    2.6g   4.0g   0      0      0      0      0      0      0      4      0      0      0      0      0      0      0      0      0
  7:    4.2g   4.1g   0      0      0      0      0      0      0     28      0      0      0      0      0      0      0      0      0
  8:    1.2g   2.2g   0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
  9:    1.8g   2.5g   0      0      0      0      0      0      0     24.5k   2      0      2      0      0      0     24.5k   0      0

  • Gather SFP stats from the switch connected to the impacted Adapter port and verify if the Tx/Rx power is fine.

> sfpshow 6
Current:     0.000   mAmps
Voltage:     3374.8  mVolts
RX Power:    -2.3    dBm (591.5uW)
TX Power:    -inf    dBm (0.0   uW)

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.