互联网服务NetAPP FAS

netapp fas 3040存储宕机,2个控制器都不断重启,求助

控制器a:
CFE version 3.1.0 based on Broadcom CFE: 1.0.40
Copyright (C) 2000,2001,2002,2003 Broadcom Corporation.
Portions Copyright (c) 2002-2006 Network Appliance, Inc.

CPU type 0xF29: 2800MHz
Total memory: 0x80000000 bytes (2048MB)


Starting AUTOBOOT press any key to abort...
Loading: 0x200000/33111516 0x2193ddc/31331956 0x3f75450/2557763 0x41e5b93/5 Entry at 0x00200000
Starting program at 0x00200000
Press CTRL-C for special boot menu
Thu Jan  9 13:28:51 GMT [nvram.battery.state:info]: The NVRAM battery is currently ON.

NetApp Release 7.2.4: Fri Nov 16 00:07:27 PST 2007
Copyright (c) 1992-2007 Network Appliance, Inc.
Starting boot on Thu Jan  9 13:28:46 GMT 2014
Thu Jan  9 13:29:03 GMT [ispfc_main:error]: Disk 0b.33 has failed to spin up and cannot be used. Please replace it with a new drive.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.44 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.45 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.43 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.41 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.38 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.39 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.32 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.37 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.36 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [diskown.errorReadingOwnership:warning]: error 19 (disk not ready for requested operation) while reading ownership on disk 0b.33 (S/N )
Thu Jan  9 13:29:04 GMT [disk.init.failureBytes:error]: Disk 0a.21 failed due to failure byte setting.
Thu Jan  9 13:29:04 GMT [disk.init.failureBytes:error]: Disk 0a.19 failed due to failure byte setting.
Thu Jan  9 13:29:04 GMT [diskown.errorDuringIO:error]: error 19 (disk not ready for requested operation) on disk 0b.33 (S/N ) while reading individual disk ownership area
Thu Jan  9 13:29:04 GMT [disk.init.failureBytes:error]: Disk 0a.26 failed due to failure byte setting.
Thu Jan  9 13:29:04 GMT [disk.init.failureBytes:error]: Disk 0a.23 failed due to failure byte setting.
Thu Jan  9 13:29:04 GMT [disk.init.failureBytes:error]: Disk 0a.17 failed due to failure byte setting.
Thu Jan  9 13:29:04 GMT [disk.init.failureBytes:error]: Disk 0a.22 failed due to failure byte setting.
Thu Jan  9 13:29:05 GMT [shm.fab.writeSenseError:warning]: shm: Unable to write failure bytes to disk 0b.33 due to error 5/20/0/1.
Thu Jan  9 13:29:05 GMT [disk.releaseFailed:error]: Disk release failed on 0b.33 with return code 5.
Thu Jan  9 13:29:11 GMT [config.noBloop:CRITICAL]: The local node cannot access the partner node's disk shelves because the partner node's shelves are not connected to the local node through their shelf module B.
add net 127.0.0.0: gateway 127.0.0.1
Thu Jan  9 13:29:15 GMT [fmmbx_instanceWorke:info]: missing lock disks, possibly stale mailbox instance on local side
Thu Jan  9 13:29:15 GMT [fmmb.current.lock.disk:info]: Disk 0a.18 is a local HA mailbox disk.
Thu Jan  9 13:29:15 GMT [fmmb.current.lock.disk:info]: Disk ?.? is a local HA mailbox disk.
Use aggr options root in maintenance mode to specify the proper root volume

Waiting to be taken over.  REBOOT in 22 seconds: UNCERTAIN mailbox status in fm_run0
!!!!!
Thu Jan  9 13:29:15 GMT [rc:info]: Node has encountered a multi-disk or other fatal error, waiting to be taken over.
Waiting to be taken over.  REBOOT in 17 seconds.
Waiting to be taken over.  REBOOT in 12 seconds.
Waiting to be taken over.  REBOOT in 7 seconds.
Waiting to be taken over.  REBOOT in 2 seconds.
参与16

12同行回答

neilruleneilrule系统运维工程师zhou
你这报了很多盘坏了啊,不会是出于保护阵列系统自己重启了控制器吧。显示全部

你这
报了很多盘坏了啊,不会是出于保护阵列系统自己重启了控制器吧。

收起
金融其它 · 2022-10-24
浏览836
pysx0503pysx0503联盟成员系统工程师第十区。散人
近期有什么其他的操作?更换硬盘或者断电。从起一类的操作?突然挂掉的吗显示全部

近期有什么其他的操作?更换硬盘或者断电。从起一类的操作?突然挂掉的吗

收起
系统集成 · 2022-03-20
浏览1159
popopopo系统工程师home
Disk 0b.33 has failed to spin up and cannot be used. Please replace it with a new drive.显示全部
Disk 0b.33 has failed to spin up and cannot be used. Please replace it with a new drive.收起
系统集成 · 2014-01-16
浏览4246
power_7power_7技术经理IBM AND ORACLE
顶上去,大家给看看显示全部
顶上去,大家给看看收起
互联网服务 · 2014-01-16
浏览4353
power_7power_7技术经理IBM AND ORACLE
Thu Jan  9 13:29:15 GMT [fmmb.current.lock.disk:info]: Disk ?.? is a local HA mailbox disk.Use aggr ...donnieyen 发表于 2014-1-10 15:24     Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.43 failed due t...显示全部
Thu Jan  9 13:29:15 GMT [fmmb.current.lock.disk:info]: Disk ?.? is a local HA mailbox disk.
Use aggr ...
donnieyen 发表于 2014-1-10 15:24



    Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.43 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.41 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.38 failed due to failure byte setting.
Thu Jan  9 13:29:03 GMT [disk.init.failureBytes:error]: Disk 0a.39 failed due to failure byte setting.
报了一堆setting 错误,这是什么原因导致 的?收起
互联网服务 · 2014-01-10
浏览4350
power_7power_7技术经理IBM AND ORACLE
CFE version 3.1.0 CFE version 3.0.0两个控制器的微码不一致donnieyen 发表于 2014-1-10 15:27     这个不影响,之前运行都是ok的。显示全部
CFE version 3.1.0
CFE version 3.0.0
两个控制器的微码不一致
donnieyen 发表于 2014-1-10 15:27



    这个不影响,之前运行都是ok的。收起
互联网服务 · 2014-01-10
浏览4285
donnieyendonnieyen数据库管理员重庆坤基科技有限公司
CFE version 3.1.0 CFE version 3.0.0两个控制器的微码不一致显示全部
CFE version 3.1.0
CFE version 3.0.0
两个控制器的微码不一致收起
系统集成 · 2014-01-10
浏览4335
donnieyendonnieyen数据库管理员重庆坤基科技有限公司
Thu Jan  9 13:29:15 GMT [fmmb.current.lock.disk:info]: Disk ?.? is a local HA mailbox disk.Use aggr options root in maintenance mode to specify the proper root volume先把硬盘坏的换了再说显示全部
Thu Jan  9 13:29:15 GMT [fmmb.current.lock.disk:info]: Disk ?.? is a local HA mailbox disk.
Use aggr options root in maintenance mode to specify the proper root volume
先把硬盘坏的换了再说收起
系统集成 · 2014-01-10
浏览4329
xieyadongxieyadong系统工程师南方电网
关注一哈显示全部
关注一哈收起
系统集成 · 2014-01-10
浏览4329
hp9000ahp9000a系统工程师紫光
A,B控制器的微码怎么不一样?显示全部
A,B控制器的微码怎么不一样?收起
互联网服务 · 2014-01-10
浏览4284

提问者

power_7
技术经理IBM AND ORACLE
擅长领域: NetAPP FAS

问题状态

  • 发布时间:2014-01-10
  • 关注会员:3 人
  • 问题浏览:12427
  • 最近回答:2022-10-24
  • X社区推广