NS224模块固件不匹配并出现多个环境错误
适用场景
- NS224 架
- NSM100 固件升级
问题描述
- 两个模块均正常工作、并且系统处于多路径中、可正常提供数据。
- 在NSM100模块上升级固件期间、自动升级过程从不完成: 示例:
Tue Jun 14 18:42:52 [node_name-02: dsa_disc: ses.mismatch.fw.version:error]: The disk shelf modules on disk shelf 0x.0 are running two different firmware versions. Disk shelf module A is running 0121, and disk shelf module B is running .
 Tue Jun 14 18:42:52 [node_name-02: dsa_disc: sfu.firmwareDownrev.shelf:error]: Shelf 0x.shelf0 has downrev firmware.
- 多个错误仅与一个NSM100 B相关示例:
::> storage shelf show -shelf 1.0 -instance
       Shelf Name: 1.0
        Stack ID: 1
        Shelf ID: 0
           ...
       Shelf State: Online
         Status: Normal
 Boot device "2" error detected.
 Temperature reported by temperature sensor "17" exceeds the specifications for the disk shelf or its components.
 Temperature reported by temperature sensor "16" exceeds the specifications for the disk shelf or its components.
 Temperature reported by temperature sensor "15" exceeds the specifications for the disk shelf or its components.
 Temperature reported by temperature sensor "14" exceeds the specifications for the disk shelf or its components.
 Temperature reported by temperature sensor "13" exceeds the specifications for the disk shelf or its components.
 Temperature reported by temperature sensor "12" exceeds the specifications for the disk shelf or its components.
 DIMM "8" error detected. DIMM is located in the DIMM slot 4 in the bottom shelf module (B).
 DIMM "7" error detected. DIMM is located in the DIMM slot 3 in the bottom shelf module (B).
 DIMM "6" error detected. DIMM is located in the DIMM slot 2 in the bottom shelf module (B).
 DIMM "5" error detected. DIMM is located in the DIMM slot 1 in the bottom shelf module (B).
 Critical error detected in module "2".
 Coin cell battery "2" error detected.
- "sysconfig -M"输出中未报告模块B。示例:
::> system node run -node node_name-01 -command sysconfig -M
 ...
 !NS224NSM100-MODULE!012345678910!111-04256+B3!1D!!
 !NS224NSM100-MODULE!!!!!
- 重新启动、重新安装和更换NSM100 B后、问题描述 仍会显示。
- 通过软重新启动NSM100 A、可以临时发现模块B、而不会出现任何问题。
- 硬重新启动NSM100 A会复制初始问题描述 、模块B会报告错误。