[CentOS] Root fs suddenly goes r/o

Wed Jul 11 23:13:50 UTC 2007
Garrick Staples <garrick at usc.edu>

On Wed, Jul 11, 2007 at 06:20:50PM -0300, Eduardo Grosclaude alleged:
> Out of the blue, dmesg on my HP Proliant w/ a SCSI disk gives loads of
> messages like this one:
> 
> EXT3-fs error (device dm-0) in start_transaction: Journal has aborted
> 
> Then the root fs goes read-only, so little else can be done on the machine.
> LVM locks up. At restart, fs needs a reboot to recover after fsck. The host
> starts up ok, then I am given some more minutes before the problem
> reappears. This is stock CentOS 4.4, never have gotten to update it because
> of this very same problem.
> 
> System logs say SCSI I/O error, but SMART says no problem has been found,
> neither does badblocks (run from a rescue CD bootup). SCSI cabling,
> terminator, etc has been checked.
> 
> What should I investigate next? Is the disk condemned?

Quite likely the drive is dieing.  If you want proof from SMART,
something like 'smartctl -t long /dev/sda' will likely fail.

-- 
Garrick Staples, GNU/Linux HPCC SysAdmin
University of Southern California

Please avoid sending me Word or PowerPoint attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://lists.centos.org/pipermail/centos/attachments/20070711/bdbfe0f2/attachment-0005.sig>