Hello all,
I’m at my wits end trying to resolve this. We are running centos 4.5 on
Intel hardware. Dual SCSI disk drives mirrored on an LSI Logic controller.
Every once in a while and not always on the same server and not only on the
local SCSI Drives.
System A – Dual internal drives on /dev/sda
System B – Dual internal drives on /dev/sdc with a DAS on /dev/sda.
Each of these systems experienced a kernel mptbase error and placed /dev/sda
into read only mode. Note again the /dev/sda isn’t always local.
For system A – remounting in ro mode didn’t work and the system had to be
rebooted. File system check and bad block checks showed nothing and when the
system was rebooted – it was fine.
A portion of the messages log is below. I would appreciate any ideas or
directions.
Thanks,
Steve Moccio
Aug 7 01:00:06 sshd(pam_unix)[18336]: session opened for user root by
(uid=0)
Aug 7 09:00:36 kernel: mptscsi: ioc1: attempting task abort! (sc=f6f07c80)
Aug 7 09:00:36 kernel: scsi1 : destination target 0, lun 0
Aug 7 09:00:36 kernel: command = Write (10) 00 00 00 fb d7 00 01 90
00
Aug 7 09:00:38 kernel: mptbase: Initiating ioc1 recovery
Aug 7 09:00:44 kernel:
drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with IOC
Reset
Aug 7 09:01:19 last message repeated 10 times
Aug 7 09:01:40 last message repeated 7 times
Aug 7 09:01:41 kernel: mptbase: ioc1: ERROR - Diagnostic reset FAILED!
(102h)
Aug 7 09:01:41 kernel: mptbase: ioc1 NOT READY WARNING!
Aug 7 09:01:41 kernel: mptbase: WARNING - (-1) Cannot recover ioc1
Aug 7 09:01:41 kernel: mptscsi: ioc1: Issue of TaskMgmt failed!
Aug 7 09:01:41 kernel: mptscsi: ioc1: task abort: FAILED (sc=f6f07c80)
Aug 7 09:01:41 kernel: mptscsi: ioc1: attempting bus reset! (sc=f6f07c80)
Aug 7 09:01:41 kernel: scsi1 : destination target 0, lun 0
Aug 7 09:01:41 kernel: command = Write (10) 00 00 00 fb d7 00 01 90
00
Aug 7 09:01:41 kernel: mptbase: Initiating ioc1 recovery
Aug 7 09:01:46 kernel: mptbase: ioc1: ERROR - Doorbell ACK timeout
(count=4999), IntStatus=80000000!
Aug 7 09:01:47 kernel:
drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with IOC
Reset
Aug 7 09:02:23 last message repeated 10 times
Aug 7 09:02:44 last message repeated 7 times
Aug 7 09:02:47 kernel: mptbase: ioc1: ERROR - Diagnostic reset FAILED!
(102h)
Aug 7 09:02:47 kernel: mptbase: ioc1 NOT READY WARNING!
Aug 7 09:02:47 kernel: mptbase: WARNING - (-1) Cannot recover ioc1
Aug 7 09:02:47 kernel: mptscsi: ioc1: bus reset: FAILED (sc=f6f07c80)
Aug 7 09:02:48 kernel: mptscsi: ioc1: Attempting host reset! (sc=f6f07c80)
Aug 7 09:02:48 kernel: mptbase: Initiating ioc1 recovery
Aug 7 09:02:51 kernel:
drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with IOC
Reset
Aug 7 09:02:51 kernel:
drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with IOC
Reset
Aug 7 09:02:53 kernel: mptbase: ioc1: ERROR - Doorbell ACK timeout
(count=4999), IntStatus=80000000!
Aug 7 09:02:58 kernel:
drivers/message/fusion/mptctl.c@1985::mptctl_do_mpt_command - Busy with IOC
Reset
Aug 7 09:03:34 last message repeated 10 times
Aug 7 09:03:48 last message repeated 5 times
Aug 7 09:03:54 kernel: mptbase: ioc1: ERROR - Diagnostic reset FAILED!
(102h)
Aug 7 09:03:54 kernel: mptbase: ioc1 NOT READY WARNING!
Aug 7 09:03:54 kernel: mptbase: WARNING - (-1) Cannot recover ioc1
Aug 7 09:03:54 kernel: scsi: Device offlined - not ready after error
recovery: host 1 channel 0 id 0 lun 0