dmesg is not reporting any issues.
The /proc/mdstat looks fine. md0 : active raid1 sdb1[1] sda1[0] X blocks [2/2] [UU]
however /var/log/messages says:
smartd[3392] Device /dev/sda 20 offline uncorrectable sectors
The machine is running fine.. raid array looks good - what is up with smartd?
THanks,
Jerry
on 4/1/2011 8:32 AM Jerry Geis spake the following:
dmesg is not reporting any issues.
The /proc/mdstat looks fine. md0 : active raid1 sdb1[1] sda1[0] X blocks [2/2] [UU]
however /var/log/messages says:
smartd[3392] Device /dev/sda 20 offline uncorrectable sectors
The machine is running fine.. raid array looks good - what is up with smartd?
THanks,
Jerry
Could it be that the bad sectors so far have been in unused areas? Once a drive runs out of sectors to map corrections to, I would really think about replacing it.
Scott Silva wrote:
Could it be that the bad sectors so far have been in unused areas? Once a drive runs out of sectors to map corrections to, I would really think about replacing it.
This advice is so often repeated by people on lists. This is a pretty normal function of modern hard drives. The drive needs to reallocate the bad sectors. It does not mean that the drive is failing unless there have been a large number of sectors requiring reallocation or it keeps happening often.
Have a look at this to fix them for normal drives without raid: http://smartmontools.sourceforge.net/badblockhowto.html
Linux raid will rewrite the block if it is in the raided part of the disk. You can force a scrub doing this (md0 is the raid device). echo check > /sys/block/md0/md/sync_action Check /proc/mdstat and dmesg for status.
You should be doing this weekly to identify bad blocks, so check your crontab.
Jeremy
Jerry Geis wrote:
dmesg is not reporting any issues.
The /proc/mdstat looks fine. md0 : active raid1 sdb1[1] sda1[0] X blocks [2/2] [UU]
however /var/log/messages says:
smartd[3392] Device /dev/sda 20 offline uncorrectable sectors
The machine is running fine.. raid array looks good - what is up with smartd?
search the list archives for "offline uncorrectable sectors". e2fsck -cc might help, though I don't know how that will go with raid.
On Fri, 1 Apr 2011, Jerry Geis wrote:
dmesg is not reporting any issues.
The /proc/mdstat looks fine. md0 : active raid1 sdb1[1] sda1[0] X blocks [2/2] [UU]
however /var/log/messages says:
smartd[3392] Device /dev/sda 20 offline uncorrectable sectors
The machine is running fine.. raid array looks good - what is up with smartd?
This page is one I like for understanding SMART attributes.
http://www.z-a-recovery.com/man-smart.htm
Steve
On 04/01/2011 05:32 PM, Jerry Geis wrote:
dmesg is not reporting any issues.
The /proc/mdstat looks fine. md0 : active raid1 sdb1[1] sda1[0] X blocks [2/2] [UU]
however /var/log/messages says:
smartd[3392] Device /dev/sda 20 offline uncorrectable sectors
The machine is running fine.. raid array looks good - what is up with smartd?
THanks,
Jerry
This means that you will be in trouble sooner or later.
Is there only sda1 on sda? If so, try
mdadm /dev/md0 --fail /dev/sda1 mdadm /dev/md0 --remove /dev/sda1 mdadm /dev/md0 --add /dev/sda1
which will end up overwriting the offline uncorrectable sectors. In most cases this fixes the disk. But you will have to monitor it carefully from now on.
HTH,
Kay