[CentOS] kernel: Machine check events logged

Wed Jul 7 14:26:39 UTC 2010
Alexander Farber <alexander.farber at gmail.com>

I've only found this Solaris blog, but don't understand it well enough:
http://blogs.sun.com/gavinm/entry/amd_opteron_athlon64_turion64_fault

Can't provide you more details, because my dedicated server
is under hoster's "hardware tests" since 5 hours :-(
(and I guess everyone will run home for the Germany-Spain game soon)

Regards
Alex

>> > MCE 0
>> > HARDWARE ERROR. This is *NOT* a software problem!
>> > Please contact your hardware vendor
>> > CPU 0 4 northbridge TSC 111a60c5584d4 [at 2500 Mhz 1 days 9:25:51
>> > uptime (unreliable)]
>> > MISC c008000001000000 ADDR 1148f5940
>> >   Northbridge NB Array Error
>> >        bit35 = err cpu3
>> >        bit42 = L3 subcache in error bit 0
>> >        bit43 = L3 subcache in error bit 1
>> >        bit46 = corrected ecc error
>> >        bit59 = misc error valid
>> >   memory/cache error 'generic read mem transaction, generic
>> > transaction, level generic'
>> > STATUS 9c1f4cf8001c011b MCGSTATUS 0
>> > No DIMM found for 1148f5940 in SMBIOS