NSM100 modules fail to read I2C EEPROM
Applies to
- NS224 shelf
- NSM100 shelf module
- ONTAP 9
Issue
- HA Group Notification (SHELF COOLING UNIT FAILED) ERROR alerts continuously being reported
- No disk path loss or data access
- Both NSM100 modules sporadically go missing in sysconfig -a:
=-=-=-=-=-= Tue Jan 03 2023, 20:32:26 +0600 SYSCONFIG-A 2 lines Shelf 1: NS224NSM100 Firmware rev. NSM100 A: NSM100 B: 0163 Shelf 1: NS224NSM100 Firmware rev. NSM100 A: NSM100 B: 0163 =-=-=-=-=-= Tue Jan 03 2023, 20:47:44 +0600 SYSCONFIG-A 2 lines Shelf 1: NS224NSM100 Firmware rev. NSM100 A: 0163 NSM100 B: Shelf 1: NS224NSM100 Firmware rev. NSM100 A: 0163 NSM100 B:
- All NSM100 temperature and fan sensors show NOT AVAILABLE:
Fans:
Element Status Status Bytes Status Descriptions
1: NOT AVAILABLE 07,00,00,70 OFF, FAIL
2: NOT AVAILABLE 07,00,00,70 OFF, FAIL
3: NOT AVAILABLE 07,00,00,70 OFF, FAIL
4: NOT AVAILABLE 07,00,00,70 OFF, FAIL
5: NOT AVAILABLE 07,00,00,70 OFF, FAIL
6: NOT AVAILABLE 07,00,00,70 OFF, FAIL
7: NOT AVAILABLE 07,00,00,70 OFF, FAIL
8: NOT AVAILABLE 07,00,00,70 OFF, FAIL
9: NOT AVAILABLE 07,00,00,70 OFF, FAIL
10: NOT AVAILABLE 07,00,00,70 OFF, FAIL
11: NOT AVAILABLE 07,00,00,70 OFF, FAIL
12: NOT AVAILABLE 07,00,00,70 OFF, FAIL
13: NOT AVAILABLE 07,00,00,70 OFF, FAIL
14: NOT AVAILABLE 07,00,00,70 OFF, FAIL
15: NOT AVAILABLE 07,00,00,70 OFF, FAIL
16: NOT AVAILABLE 07,00,00,70 OFF, FAIL
17: NOT AVAILABLE 07,00,00,70 OFF, FAIL
18: NOT AVAILABLE 07,00,00,70 OFF, FAIL
19: NOT AVAILABLE 07,00,00,70 OFF, FAIL
20: NOT AVAILABLE 07,00,00,70 OFF, FAIL
Temperature Sensors:
Element Status Status Bytes Status Descriptions
1: NOT AVAILABLE 07,00,14,00
2: NOT AVAILABLE 07,00,14,00
3: NOT AVAILABLE 07,00,14,00
4: NOT AVAILABLE 07,00,14,00
5: NOT AVAILABLE 07,00,14,00
6: NOT AVAILABLE 07,00,14,00
7: NOT AVAILABLE 07,00,14,00
8: NOT AVAILABLE 07,00,14,00
9: NOT AVAILABLE 07,00,14,00
10: NOT AVAILABLE 07,00,14,00
11: NOT AVAILABLE 07,00,14,00
12: NOT AVAILABLE 07,00,14,00
13: NOT AVAILABLE 07,00,14,00
14: NOT AVAILABLE 07,00,14,00
15: NOT AVAILABLE 07,00,14,00
16: NOT AVAILABLE 07,00,14,00
17: NOT AVAILABLE 07,00,14,00
- NSM100 module logs report VPD and EEPROM read failures:
Mon Jan 905:44:04 2023 ( 12+20:47:08.976); 020002E8; I0; HAL; hal; 02;VPD_PCM_Read: start reading pcm[0] i2cAddr:0xa0 bitMsk:0x6 Mon Jan 9 05:44:09 2023 ( 12+20:47:14.086); 020000F5; I0;HAL; hal; 02; Failed to read I2C EEPROM. Status=7 Mon Jan 9 05:44:14 2023 ( 12+20:47:19.086); 020000F5; I0;HAL; hal; 02; Failed to read I2C EEPROM. Status=7 Mon Jan 9 05:44:19 2023 ( 12+20:47:24.086); 020000F5; I0;HAL; hal; 02; Failed to read I2C EEPROM. Status=7 Mon Jan 9 05:44:19 2023 ( 12+20:47:24.086); 0200010D; I0;HAL; hal; 02; PCM 1 VPD: Read failed (bus=1, addr=0xA0) Mon Jan 9 05:44:19 2023 ( 12+20:47:24.288); 020001B0; I0;HAL; hal; 02; PCM 1 VPD CRC: 0xFFE98F5C Mon Jan 9 05:44:19 2023 ( 12+20:47:24.288); 02000110; I0;HAL; hal; 02; PCM 1 VPD Structure: 0x01
storage fault -v
shows midplane VPD failures for both modules:
Element Status Status Bytes Status Descriptions
1 [NSM100 A] : NONCRITICAL 03,84,00,E0 MIDPLANE VPD FAULT, ADDITIONAL STATUS AVAILABLE, RESERVED, EXCEPTION DATA VALID, LOG DATA VALID
2 [NSM100 B] : NONCRITICAL 03,8C,00,E0 MIDPLANE VPD FAULT, MASTER, ADDITIONAL STATUS AVAILABLE, RESERVED, EXCEPTION DATA VALID, LOG DATA