在所有 HA 邮箱磁盘崩溃升级到时出现永久错误 9.7P10
适用于
AFF A200
问题
- 将 ONTAP 从 9.5P6 崩溃升级到 9.7P10
Panic_Message: Permanent errors on all HA mailbox disks (while marshalling header) in SK process fmmbx_instanceWorker on release 9.7P10 (C)
- 只有一个节点报告:
node_name ERROR monitor.temp.unreadable: The controller temperature (Midplane 3 Temp) is not readable.
node_name ERROR monitor.temp.unreadable: The controller temperature (Midplane 4 Temp) is not readable.
node_name ERROR monitor.temp.unreadable: The controller temperature (Module A Expander Temp) is not readable.
node_name ERROR scsi.cmd.adapterHardwareErrorEMSOnly: Enclosure services device 0b.00.99: Adapter detected hardware error: HA status 0x6: cdb0x1c.
- 两个电源工作正常,但:
monitor.globalStatus.critical: Chassis temperature is too high..
monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1.
monitor.globalStatus.critical: Multiple fans has failed. Chassis temperature is too high..
monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1. Chassis temperature is too high..
monitor.globalStatus.critical: Multiple fans has failed. Chassis temperature is too high..
- 和平台传感器输出:
PSU2 FRU fru fault normal PSU_OFF
PSU1 FRU fru fault normal PSU_OFF
PSU2 Bad discrete fault normal TRUE
PSU1 Bad discrete fault normal TRUE
PSU2 discrete fault normal BAD
PSU1 discrete fault normal BAD
PSU2 ON discrete fault normal OFF
PSU1 ON discrete fault
- 服务处理器固件和内部交换机更新为最新版本时仍存在此问题。