用户的机器最开始报电源错误,而且通过机器后方电源状态灯发现有2个电源模块已经故障,而且运行一周左右时间就宕机
---------------------------------------------------------------------------
LABEL: SCAN_ERROR_CHRP
IDENTIFIER: BFE4C025
Date/Time: Tue Jan 13 10:08:31 2015
Sequence Number: 63235
Machine Id: 00C5102C4C00
Node Id: LIHDMCMDS01
Class: H
Type: PERM
Resource Name: sysplanar0
Resource Class: planar
Resource Type: sysplanar_rspc
Location:
Description
UNDETERMINED ERROR
Failure Causes
UNDETERMINED
Recommended Actions
RUN SYSTEM DIAGNOSTICS.
Detail Data
PROBLEM DATA
0644 00E0 0000 0600 9600 8E00 0000 0000 0000 0000 4942 4D00 5048 0030 0100 3F30
2014 1223 0157 3207 2014 1223 0157 3208 4500 0106 0000 0000 0000 0000 0000 0000
509A E5B4 509A E5B4 5548 0018 0100 3F30 6103 4400 0000 0000 0000 A804 0000 0000
5053 00F4 0101 3F30 0201 0002 0000 00EC 003C 0002 0000 0000 0000 0000 0000 0000
..........
Diagnostic Analysis
Diagnostic Log sequence number: 19690
Resource tested: sysplanar0
Resource Description: System Planar
Location:
SRC: 11001524
Description: Power/Cooling subsystem Unrecovered Error, bypassed
with loss of redundancy. Refer to the system service
documentation for more information.
Additional Words: 2-003C0002 3-00000000 4-00000000 5-00000000
6-00000000 7-00000000 8-00000000 9-00000000
Possible FRUs:
Priority: L FRU: 39J2779 S/N: YL1116P66140 CCIN: 51B7
Location: U7879.001.DQDKLFV-E2
Priority: L FRU: 03N6355 S/N: YL11C6063033 CCIN: 28EA
Location: U7879.001.DQDKLFN-P1-C8
在换完电源模块后经过一周时间发现机器又宕了,重新开机查看报错,发现CEC报了好多错误
在IBM硬件信息中心查到了这些
难道这些都坏了?请问接下来该怎样排查啊?到底用不用把报错的全换掉?