It's time to resurrect this thread from way back in June. The problem in the subject line has reared its ugly head again, but this time with a twist that makes it much worse. A little refresher on what was happening back then. Every so often the root file system would be remounted read-only, with the error in the subject line appearing over and over again on the console.
Lately, this has been happening every 10-14 days, and I would have to reboot my system. Since the root file system was not writable, no error messages were logged in /var/log/messages. So I configured syslog to write messages to another system as well, and this time I have captured some errors (see below). BTW, this is a SATA drive.
What makes it much worse this time, is that the system won't boot! When I try to boot now I get the following error over and over again:
ata1: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
HELP! Is there anything I can do to recover this system?
Alfred
Here are the first 50 lines from /var/log/messages (including the first occurrence of the error in the subject line)
Aug 1 18:57:04 balboa01 kernel: ata1: command 0x35 timeout, stat 0xb7 host_stat 0x21 Aug 1 18:57:04 balboa01 kernel: ata1: translated ATA stat/err 0xb7/00 to SCSI SK/ASC/ASCQ 0xb/47/00 Aug 1 18:57:04 balboa01 kernel: ata1: status=0xb7 { Busy } Aug 1 18:57:04 balboa01 kernel: SCSI error : <0 0 0 0> return code = 0x8000002 Aug 1 18:57:04 balboa01 kernel: Current sda: sense key Aborted Command Aug 1 18:57:04 balboa01 kernel: Additional sense: Scsi parity error Aug 1 18:57:04 balboa01 kernel: end_request: I/O error, dev sda, sector 224365 Aug 1 18:57:04 balboa01 kernel: ATA: abnormal status 0xB7 on port 0x1F7 Aug 1 18:57:04 balboa01 last message repeated 2 times Aug 1 18:57:04 balboa01 kernel: ata1: command 0x35 timeout, stat 0xb7 host_stat 0x21 Aug 1 18:57:04 balboa01 kernel: ata1: translated ATA stat/err 0xb7/00 to SCSI SK/ASC/ASCQ 0xb/47/00 Aug 1 18:57:04 balboa01 kernel: ata1: status=0xb7 { Busy } Aug 1 18:57:04 balboa01 kernel: SCSI error : <0 0 0 0> return code = 0x8000002 Aug 1 18:57:04 balboa01 kernel: Current sda: sense key Aborted Command Aug 1 18:57:04 balboa01 kernel: Additional sense: Scsi parity error Aug 1 18:57:04 balboa01 kernel: end_request: I/O error, dev sda, sector 233795925 Aug 1 18:57:04 balboa01 kernel: Buffer I/O error on device dm-0, logical block 29198337 Aug 1 18:57:04 balboa01 kernel: lost page write due to I/O error on dm-0 Aug 1 18:57:04 balboa01 kernel: ATA: abnormal status 0xB7 on port 0x1F7 Aug 1 18:57:04 balboa01 last message repeated 2 times Aug 1 18:57:04 balboa01 kernel: ata1: command 0x35 timeout, stat 0xb7 host_stat 0x21 Aug 1 18:57:04 balboa01 kernel: ata1: translated ATA stat/err 0xb7/00 to SCSI SK/ASC/ASCQ 0xb/47/00 Aug 1 18:57:04 balboa01 kernel: ata1: status=0xb7 { Busy } Aug 1 18:57:04 balboa01 kernel: SCSI error : <0 0 0 0> return code = 0x8000002 Aug 1 18:57:04 balboa01 kernel: Current sda: sense key Aborted Command Aug 1 18:57:04 balboa01 kernel: Additional sense: Scsi parity error Aug 1 18:57:04 balboa01 kernel: end_request: I/O error, dev sda, sector 224373 Aug 1 18:57:04 balboa01 kernel: Buffer I/O error on device dm-0, logical block 1893 Aug 1 18:57:04 balboa01 kernel: lost page write due to I/O error on dm-0 Aug 1 18:57:04 balboa01 kernel: ATA: abnormal status 0xB7 on port 0x1F7 Aug 1 18:57:04 balboa01 last message repeated 2 times Aug 1 18:57:04 balboa01 kernel: Aborting journal on device dm-0. Aug 1 18:57:04 balboa01 kernel: ata1: command 0x35 timeout, stat 0xb7 host_stat 0x21 Aug 1 18:57:04 balboa01 kernel: ata1: translated ATA stat/err 0xb7/00 to SCSI SK/ASC/ASCQ 0xb/47/00 Aug 1 18:57:04 balboa01 kernel: ata1: status=0xb7 { Busy } Aug 1 18:57:04 balboa01 kernel: SCSI error : <0 0 0 0> return code = 0x8000002 Aug 1 18:57:04 balboa01 kernel: Current sda: sense key Aborted Command Aug 1 18:57:04 balboa01 kernel: Additional sense: Scsi parity error Aug 1 18:57:04 balboa01 kernel: end_request: I/O error, dev sda, sector 172585309 Aug 1 18:57:04 balboa01 kernel: Buffer I/O error on device dm-0, logical block 21547010 Aug 1 18:57:04 balboa01 kernel: lost page write due to I/O error on dm-0 Aug 1 18:57:04 balboa01 kernel: ATA: abnormal status 0xB7 on port 0x1F7 Aug 1 18:57:04 balboa01 last message repeated 2 times Aug 1 18:57:04 balboa01 kernel: ext3_abort called. Aug 1 18:57:04 balboa01 kernel: EXT3-fs error (device dm-0): ext3_journal_start_sb: Detected aborted journal Aug 1 18:57:04 balboa01 kernel: Remounting filesystem read-only Aug 1 18:57:04 balboa01 kernel: ata1: command 0x35 timeout, stat 0xb7 host_stat 0x21 Aug 1 18:57:04 balboa01 kernel: EXT3-fs error (device dm-0) in start_transaction: Journal has aborted Aug 1 18:57:34 balboa01 kernel: ata1: command 0x35 timeout, stat 0xb7 host_stat 0x21 Aug 1 18:57:34 balboa01 kernel: ata1: translated ATA stat/err 0xb7/00 to SCSI SK/ASC/ASCQ 0xb/47/00