[CentOS] how to debug hardware lockups?

Les Mikesell lesmikesell at gmail.com
Tue Nov 18 23:05:50 UTC 2008

nate wrote:
> Les Mikesell wrote:
>> Yes, apparently RAM errors can be subtle and only appear when certain
>> adjacent bit patterns are stored - or when the moon is in a certain
>> phase or something.
> Don't forget cosmic rays
> http://adsabs.harvard.edu/abs/1978ITNS...25.1166P

Yeah, but those don't stop when you replace the faulty RAM...  Mine did, 
but the errors committed to disk kept randomly re-appearing mysteriously 
as the reads from the RAID1 alternated afterwards.

