How do you detect ECC memory errors when CentOS is booted?
Just had a strange hardware error where a server machine hang during anaconda install it passed memtest86 test but the bios dmi log showed dimm bank 1b had failures.
Now I wonder what would happen if a machine running in production would get such problems, how do you detect it without having to go into bios?
Hints on what to use? Chris