Hello all,
I’m at my wits end trying to resolve this. We are running
centos 4.5 on Intel hardware. Dual SCSI disk drives mirrored on an LSI Logic
controller.
Every once in a while and not always on the same server and
not only on the local SCSI Drives.
System A – Dual internal drives on /dev/sda
System B – Dual internal drives on /dev/sdc with a DAS on
/dev/sda.
Each of these systems experienced a kernel mptbase error and
placed /dev/sda into read only mode. Note again the /dev/sda isn’t always
local.
For system A – remounting in ro mode didn’t work and the
system had to be rebooted. File system check and bad block checks showed
nothing and when the system was rebooted – it was fine.
A portion of the messages log is below. I would appreciate
any ideas or directions.
Thanks,
Steve Moccio
Aug 7 01:00:06
sshd(pam_unix)[18336]: session opened for user root by (uid=0)
Aug 7 09:00:36
kernel: mptscsi: ioc1: attempting task abort! (sc=f6f07c80)
Aug 7
09:00:36 kernel: scsi1 : destination target 0, lun 0
Aug 7 09:00:36
kernel: command = Write (10) 00
00 00 fb d7 00 01 90 00
Aug 7 09:00:38
kernel: mptbase: Initiating ioc1 recovery
Aug 7 09:00:44
kernel: drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with
IOC Reset
Aug 7 09:01:19
last message repeated 10 times
Aug 7 09:01:40
last message repeated 7 times
Aug 7 09:01:41
kernel: mptbase: ioc1: ERROR - Diagnostic reset FAILED! (102h)
Aug 7 09:01:41
kernel: mptbase: ioc1 NOT READY WARNING!
Aug 7 09:01:41
kernel: mptbase: WARNING - (-1) Cannot recover ioc1
Aug 7 09:01:41
kernel: mptscsi: ioc1: Issue of TaskMgmt failed!
Aug 7 09:01:41
kernel: mptscsi: ioc1: task abort: FAILED (sc=f6f07c80)
Aug 7 09:01:41
kernel: mptscsi: ioc1: attempting bus reset! (sc=f6f07c80)
Aug 7
09:01:41 kernel: scsi1 : destination target 0, lun 0
Aug 7 09:01:41
kernel: command = Write (10) 00
00 00 fb d7 00 01 90 00
Aug 7 09:01:41
kernel: mptbase: Initiating ioc1 recovery
Aug 7 09:01:46
kernel: mptbase: ioc1: ERROR - Doorbell ACK timeout (count=4999),
IntStatus=80000000!
Aug 7 09:01:47
kernel: drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with
IOC Reset
Aug 7 09:02:23
last message repeated 10 times
Aug 7 09:02:44
last message repeated 7 times
Aug 7 09:02:47
kernel: mptbase: ioc1: ERROR - Diagnostic reset FAILED! (102h)
Aug 7 09:02:47
kernel: mptbase: ioc1 NOT READY WARNING!
Aug 7 09:02:47
kernel: mptbase: WARNING - (-1) Cannot recover ioc1
Aug 7 09:02:47
kernel: mptscsi: ioc1: bus reset: FAILED (sc=f6f07c80)
Aug 7 09:02:48
kernel: mptscsi: ioc1: Attempting host reset! (sc=f6f07c80)
Aug 7 09:02:48
kernel: mptbase: Initiating ioc1 recovery
Aug 7 09:02:51
kernel: drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with
IOC Reset
Aug 7 09:02:51
kernel: drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with
IOC Reset
Aug 7 09:02:53
kernel: mptbase: ioc1: ERROR - Doorbell ACK timeout (count=4999),
IntStatus=80000000!
Aug 7 09:02:58
kernel: drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with
IOC Reset
Aug 7 09:03:34
last message repeated 10 times
Aug 7 09:03:48
last message repeated 5 times
Aug 7 09:03:54
kernel: mptbase: ioc1: ERROR - Diagnostic reset FAILED! (102h)
Aug 7 09:03:54
kernel: mptbase: ioc1 NOT READY WARNING!
Aug 7 09:03:54
kernel: mptbase: WARNING - (-1) Cannot recover ioc1
Aug 7 09:03:54
kernel: scsi: Device offlined - not ready after error recovery: host 1 channel
0 id 0 lun 0