由于 SP/BMC 端口故障,SP/BMC 无法更新固件
适用于
- ONTAP 9
- 服务处理器
- 固件升级
问题描述
- SP/BMC 固件升级失败,事件日志中报告以下错误:
[cluster1-01: servprocd: sp.servprocd.upd.evts:debug]: params: {'reason': 'SP Firmware network update from 10.5 to 10.6 has been triggered.'}
[cluster1-01: servprocd: sp.servprocd.upd.evts:debug]: params: {'reason': 'SP firmware image has been successfully transferred to SP using network interface.'}
[cluster1-01: servprocd: sp.servprocd.upd.unexpt.evts:debug]: params: {'reason': 'Unable to successfully update SP firmware using network interface'}
[cluster1-01: servprocd: sp.servprocd.upd.error:error]: SP update error: SP firmware update failure has been detected.
- 来自 AutoSupport 的
SP-MGMT-MLOG-TXT.GZ报告 SP/BMC 不可访问:
[kern_servprocd:info:7198] 0x808707100: 0: NOTICE: Servprocd::FTM: transferred /mroot/etc/RLM_FW/sp_image.tar.gz to sp://cluster1-01:/mnt/sapps/sp_image.tar.gz
[kern_servprocd:info:7198] 0x808707100: 0: NOTICE: Servprocd::SpUpdate: update: SP firmware image has been successfully transferred to SP using network interface.
[kern_servprocd:info:5642] 0x808702b00: 0: ERR: Servprocd::SpUpdate: writeImage: spcs sp update command failed (The Service Processor on node "cluster1-01" is not reachable. Verify that the SP or BMC is online, verify that api-service is enabled on the SP or BMC, verify that the partner node is running, check if pings from SP or BMC to partner node work, check if hw-assist keep-alives are normal, check that network ports are configured correctly and are functional (up). Then, try the command again.)
[kern_servprocd:info:5642] 0x808702b00: 0: ERR: Servprocd::SpUpdate: update: unable to successfully update SP firmware using network interface
[kern_servprocd:info:6143] 0x80b715000: 0: NOTICE: Servprocd::SpUpdate: ScheduleSpAutoUpdate: Checking whether SP network is available for SP firmware auto-update
[kern_servprocd:info:6143] 0x80b715000: 0: ERR: Servprocd::SpUpdate: doPreUpdateChecks: Ping test to internal BMC IP failed.
[kern_servprocd:info:6143] 0x809426f00: 8003ea000000080d: ERR: Servprocd::CLI: create_imp: The specified package /mroot/etc/software/http://10.16.11.5/web/netapp/BMC_FW.zip could not be read.
[kern_servprocd:info:6143] 0x80946d400: 8503ea0000000445: ERR: Servprocd::CLI: get_bmc_boot_image: Failed to get boot image for BMC
SP-LATEST-RUNTIME从 AutoSupport 报告 MemFree 高于 57,000 KB
MemFree: 96968 kB
SP-LATEST-CONFIGURATION从 AutoSupport 显示 N/A 主固件和备份固件版本:
version
=======
Booted primary firmware version 11.6
Primary firmware version N/A
Backup firmware version N/A
SP-LATEST-SYSLOG来自 AutoSupport 报告的错误在手动更新尝试期间类似于以下内容:
BMC fud_eth[29319]: get_notif_shm: Failed to create the shared memory key
BMC fud_eth[29319]: ipmi_notif_to_ontap: Unable to set progress state as shared memory is not accessible.
BMC fud_eth[29319]: Downloading package...
BMC fud_eth[29319]: system() returned ret1 = 0, ret2 = 0
BMC last message repeated 3 times
BMC fud_eth[29319]: FW package download from ftp://anonymous@10.10.10.10/software//BMC_FW_308-04071_11.7.tar.gz failed.
BMC fud_eth[29319]: Cleaning up..
- SP/BMC 处于联机状态,并且 service-processor api-service 已启用。
- SP api-service 内部证书已续订。
- 网络端口配置正确,功能正常(向上)。
- 重新启动 SP/BMC 和重新启动 service-processor 后台程序无济于事。
- 尝试直接从 SP/BMC 提示符进行升级也会失败。
- 对受影响的节点执行接管/回馈并不能解决问题。