报告"机箱配置无效(配对系统PCM不兼容)"
适用场景
- ONTAP 9
- 服务处理器(SP)
- 基板管理控制器(BMC)
问题描述
- EMS报告:
[?] Tue Jun 28 02:22:30 +0000 [node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 4 Temp) is not readable.
[?] Tue Jun 28 02:22:30 +0000 [node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 3 Temp) is not readable.
[?] Tue Jun 28 02:22:30 +0000 [node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 2 Temp) is not readable.
[?] Tue Jun 28 02:22:30 +0000 [node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 1 Temp) is not readable.[?] Tue Jun 28 02:22:43 +0000 [node-01: rlm_hbtrcv_non_blocking: sp.update.status:debug]: params: {'reason': 'sp_bootup_notify_servprocd: SP online handler has been called '}
[?] Tue Jun 28 02:22:44 +0000 [node-01: cf_worker: cf.hwassist.notifyCfgSuccess:debug]: params: {'hwtype': 'SP'}
[?] Tue Jun 28 02:23:16 +0000 [node-01: env_mgr: monitor.fan.warning:notice]: multiple fans have failed. Replace it to avoid overheating
[?] Tue Jun 28 02:23:46 +0000 [node-01: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: Multiple fans have failed
- pipmi/系统传感器显示无法读取传感器的报告SP:
PSU2 Bad invalid --
PSU1 Bad invalid --
PSU2 invalid --
PSU1 invalid --
PSU2 ON invalid --
PSU1 ON invalid --
PSU1 INFO FAILED
PSU1 INFO FAILED
PSU1 FRU MULTIFAULT
PSU2 FRU MULTIFAULT
Partner Status failed --
PSU1 Present PRESENT
PSU1 5V init_failed -- mV -- -- -- --
PSU1 12V init_failed -- mV -- -- -- --
PSU1 5V Curr init_failed -- mA -- -- -- --
PSU1 12V Curr init_failed -- mA -- -- -- --
PSU1 Fan 1 init_failed -- RPM -- -- -- --
PSU1 Fan 2 init_failed -- RPM -- -- -- --
PSU1 Inlet Temp init_failed -- C -- -- -- --
PSU1 Hotspot Temp init_failed -- C -- -- -- --
PSU2 Present PRESENT
PSU2 5V init_failed -- mV -- -- -- --
PSU2 12V init_failed -- mV -- -- -- --
PSU2 5V Curr init_failed -- mA -- -- -- --
PSU2 12V Curr init_failed -- mA -- -- -- --
PSU2 Fan 1 init_failed -- RPM -- -- -- --
PSU2 Fan 2 init_failed -- RPM -- -- -- --
PSU2 Inlet Temp init_failed -- C -- -- -- --
PSU2 Hotspot Temp init_failed -- C -- -- -- --
PSU_FAN failed --
Module B Expander Temp init_failed -- C -- -- -- --
Module A Expander Temp init_failed -- C -- -- -- --
Midplane 4 Temp failed -- C 0 C 5 C 47 C 52 C
Midplane 3 Temp failed -- C 0 C 5 C 47 C 52 C
Midplane 2 Temp failed -- C 0 C 5 C 47 C 52 C
Midplane 1 Temp failed -- C 0 C 5 C 47 C 52 C
Ambient Temp init_failed -- C -- -- -- --
Internal Shelf failed --
- SP日志中重复出现以下消息。
Apr 13 07:42:16 2021 [SP.critical]: Rebooting SP due to loss of ACP comms
Jan 1 00:00:40 1970 [IPMI.notice]: 7b03 | c0 | OEM: ffff70005100 | ManufId: 150300 | SP Reset Internally
Jan 1 00:00:50 1970 [IPMI.notice]: 7c03 | 02 | EVT: 0301ffff | Power_Good | Assertion Event, "State Asserted"
Jan 1 00:00:50 1970 [IPMI.notice]: 7d03 | 02 | EVT: 0301ffff | Power_Proc_OK | Assertion Event, "State Asserted"
Jan 1 00:00:50 1970 [IPMI.notice]: 7e03 | 02 | EVT: 0301ffff | Controller_Fault | Assertion Event, "State Asserted"
Jan 1 00:00:51 1970 [IPMI.notice]: 7f03 | 02 | EVT: 0900ffff | LkWrench_Port_Up | Assertion Event, "Device Disabled"
Jan 1 00:01:02 1970 [SP.notice]: Running primary version 2.10
Jan 1 00:01:05 1970 [SP.normal]: Heartbeat started
Jan 1 00:01:05 1970 [Heartbeat.notice]: Heartbeat start: Set SP time. Old time: Thu Jan 1 00:01:05 1970. New time: Tue Apr 13 07:44:06 2021.
Apr 13 07:44:06 2021 [Heartbeat.notice]: Heartbeat time adjusted: Set SP time. Old time: Thu Jan 1 00:01:05 1970. New time: Tue Apr 13 07:44:06 2021.
Apr 13 07:44:06 2021 [IPMI.notice]: 8003 | 02 | EVT: 6fc22fff | System_FW_Status | Assertion Event, "OnTap Kernel Running"
Apr 13 07:44:06 2021 [IPMI.notice]: 8103 | 02 | EVT: 0300ffff | Controller_Fault | Assertion Event, "State Deasserted"
Apr 13 07:45:09 2021 [IPMI.warning]: FRUID 1 Access error
Apr 13 07:45:30 2021 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Sent
Apr 13 08:00:12 2021 [ASUP.notice]: Reminder email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Sent