紧急停机:由于服务处理器过载,环境原因停机(电池充电电流故障)
适用于
- FAS8020
- 服务处理器( SP ) 3.7P1
问题
- 服务处理器报告读取传感器时出现高负载和错误:
SP node-name> events all
...
Record 960: Sat Jun 20 00:39:00.617884 2020 [SP.notice]: SP load is high: 2.64 2.37 2.28
Record 963: Sat Jun 20 02:12:02.973421 2020 [SP.notice]: SP load is high: 2.98 2.78 2.57
Record 964: Sat Jun 20 02:23:51.925071 2020 [IPMI.warning]: Error while reading sensor number : 0
Record 965: Sat Jun 20 02:23:51.933973 2020 [IPMI.notice]: 9706 | c0 | OEM: f9ff7020ff00 | ManufId: 150300 | Undefined
Record 966: Sat Jun 20 02:25:33.741715 2020 [IPMI.warning]: Error while reading sensor number : 0
Record 967: Sat Jun 20 02:25:33.754053 2020 [IPMI.notice]: 9806 | c0 | OEM: f9ff7020ff00 | ManufId: 150300 | Undefined
Record 968: Sat Jun 20 02:43:03.757287 2020 [SP.notice]: SP load is high: 2.29 2.29 2.40
- 节点的 EMS 消息会在多个组件中报告错误并触发关闭:
Sat Jun 20 10:01:01 CEST [node_name: env_mgr: monitor.fru.info.unreadable:error]: The inventory information of FRU FAN1 is not readable.
Sat Jun 20 10:03:27 CEST [node_name: env_mgr: monitor.temp.unreadable:error]: The controller temperature (CPU0 Temp Margin) is not readable.
Sat Jun 20 10:08:04 CEST [node_name: env_mgr: monitor.fru.info.unreadable:error]: The inventory information of FRU FAN2 is not readable.
Sat Jun 20 10:11:15 CEST [node_name: env_mgr: monitor.fru.info.unreadable:error]: The inventory information of FRU PSU2 is not readable.
Sat Jun 20 10:14:31 CEST [node_name: env_mgr: monitor.fru.info.unreadable:error]: The inventory information of FRU FAN3 is not readable.
Sat Jun 20 10:17:31 CEST [node_name: env_mgr: monitor.fru.info.unreadable:error]: The inventory information of FRU PSU1 is not readable.
Sat Jun 20 10:27:18 CEST [node_name: env_mgr: monitor.temp.unreadable:error]: The controller temperature (In Flow Temp) is not readable.
Sat Jun 20 10:27:18 CEST [node_name: env_mgr: monitor.chassisTemperature.state.unknown:alert]: Chassis temperature state is unknown: Multiple Temp sensors are unreadable. System will be shutdown in 2 minutes.
Sat Jun 20 10:27:18 CEST [node_name: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Out Flow Temp) is not readable.
Sat Jun 20 10:27:18 CEST [node_name: env_mgr: monitor.temp.unreadable:error]: The controller temperature (PCI Slot Temp) is not readable.
Sat Jun 20 10:27:33 CEST [node_name: statd: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Battery charging current failed)