On Wed, 2 Aug 2006, William L. Maltby wrote:
FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 0x0007 093 092 021 Pre-fail Always - 1866 0x0027 252 252 063 Pre-fail Always - 1457
Now, the Q is: what do the numbers mean? Seconds? Milliseconds? If it's seconds, the "RAW_VALUE" may explain why all is OK after things have been powered up long enough. If it's ms, I can only thing it is running a self-test. I would have to go read those articles more closely to see what I can determine.
Hi William,
The attribute names (as listed in the "ATTRIBUTE_NAME" column) are described here:
http://en.wikipedia.org/wiki/Self-Monitoring%2C_Analysis%2C_and_Reporting_Te...
The VALUE column contains a normalized value for the attribute, WORST contains the drives lifetime minimum (or maximum value), THRESH contains the drive manufactures failure threshold, and RAW_VALUE contains a 6-byte value that is used to store the attributes raw value. I see that several sectors are marked as unreadable in the logfiles you posted. What do you see in the column "Reallocated_Sector_Ct?" If the SMART attributes check out, the disk is most likely fine (you can manually run a long self test to be sure), and I would start looking at the SATA controller ( most likely culprit) and the SATA device driver (you could easily integrate some debugging data into the driver to assist with debugging the problem).
Hope this helps, - Ryan -- UNIX Administrator http://prefetch.net