[CentOS] how to debug hardware lockups?
Les Mikesell
lesmikesell at gmail.com
Tue Nov 18 14:47:43 UTC 2008
Rudi Ahlers wrote:
>> I had machine that would crash about once every week or two in normal
>> operation. Memtest86+ found an error in the 2nd day of running. The worst
>> part was that it left the raid mirrors in a strange state that caused
>> occasional problems for months even after replacing the RAM.
>>
>> --
>
> Did you leave memtest86+ running for 2 days? I thought 1 or 2 cycles
> would be good enough?
>
> I'm hoping to pick-up the server in the next 2 hours then I can see
> what happens when I run memtest86+ or other tests
Yes, apparently RAM errors can be subtle and only appear when certain
adjacent bit patterns are stored - or when the moon is in a certain
phase or something.
--
Les Mikesell
lesmikesell at gmail.com
More information about the CentOS
mailing list