This server was running fedora core 2 till 12/2/2005. It had an impressive uptime of 180 days or so. On friday I upgraded to CentOS 4.2. After upgrading to CentOS 4.2 this server has been throwing kernel panics every 10 hours or so.
These are some of the error messages I am getting on the console: "CPU 0:Machine check exception :4 Bank 0 : f672100000000833 TSC 112180365ccf ADDR e2c7f7c0 Kernel Panic - Not synching Machine Check"
This is a production server. I would appreciate any help.
S
Check MEMORY BANK (RAM)
Vikas wrote:
This server was running fedora core 2 till 12/2/2005. It had an impressive uptime of 180 days or so. On friday I upgraded to CentOS 4.2. After upgrading to CentOS 4.2 this server has been throwing kernel panics every 10 hours or so.
These are some of the error messages I am getting on the console: "CPU 0:Machine check exception :4 Bank 0 : f672100000000833 TSC 112180365ccf ADDR e2c7f7c0 Kernel Panic - Not synching Machine Check"
This is a production server. I would appreciate any help.
S _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
Is there a way to check the memory bank other then swapping in and out the memory modules. I am 90 miles away from the server. So swapping in and out the memory modules is really not an option,
S On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Check MEMORY BANK (RAM)
Vikas wrote:
This server was running fedora core 2 till 12/2/2005. It had an impressive uptime of 180 days or so. On friday I upgraded to CentOS 4.2. After upgrading to CentOS 4.2 this server has been throwing kernel panics every 10 hours or so.
These are some of the error messages I am getting on the console: "CPU 0:Machine check exception :4 Bank 0 : f672100000000833 TSC 112180365ccf ADDR e2c7f7c0 Kernel Panic - Not synching Machine Check"
This is a production server. I would appreciate any help.
S _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
Vikas
Do you have remote hand ? If yes, restart centos server and select mentest86 option in boot menu.
Vikas wrote:
Is there a way to check the memory bank other then swapping in and out the memory modules. I am 90 miles away from the server. So swapping in and out the memory modules is really not an option,
S On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Check MEMORY BANK (RAM)
Vikas wrote:
This server was running fedora core 2 till 12/2/2005. It had an impressive uptime of 180 days or so. On friday I upgraded to CentOS 4.2. After upgrading to CentOS 4.2 this server has been throwing kernel panics every 10 hours or so.
These are some of the error messages I am getting on the console: "CPU 0:Machine check exception :4 Bank 0 : f672100000000833 TSC 112180365ccf ADDR e2c7f7c0 Kernel Panic - Not synching Machine Check"
This is a production server. I would appreciate any help.
S _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
I do have remote hands. When grub loads, it doesnt show the memtest86 option.
"select mentest86 option in boot menu." In which boot menu should I look for memtest86. ?
I am currently logged into the box. It runs fine for a couple of hours before it gives the kernel error. Can i run the memtest86 check from the shell ?
S
On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Vikas
Do you have remote hand ? If yes, restart centos server and select mentest86 option in boot menu.
Vikas wrote:
Is there a way to check the memory bank other then swapping in and out the memory modules. I am 90 miles away from the server. So swapping in and out the memory modules is really not an option,
S On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Check MEMORY BANK (RAM)
Vikas wrote:
This server was running fedora core 2 till 12/2/2005. It had an impressive uptime of 180 days or so. On friday I upgraded to CentOS 4.2. After upgrading to CentOS 4.2 this server has been throwing kernel panics every 10 hours or so.
These are some of the error messages I am getting on the console: "CPU 0:Machine check exception :4 Bank 0 : f672100000000833 TSC 112180365ccf ADDR e2c7f7c0 Kernel Panic - Not synching Machine Check"
This is a production server. I would appreciate any help.
S _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
Ok I did this to get memtest in the boot menu: yum install memtest* memtest-setup
I am now going to call the remote hands and have them run the memtest.
Would it be too much of a conincidence that this friday I: 1. upgrade the OS from Fedora Core 2 to CentOS 4.2 2. Upgrade the bios on the motherboard Tyan S2881 http://www.tyan.com/support/html/b_s2881.html
and the memory fails at the same time ?
Also can this problem be something to do with the harddrive DMA ? I tired using hdparm to disable the harddrive DMA, but since its a SATA drive, hdparm reports: "HDIO_GET_MULTCOUNT failed: Inappropriate ioctl for device"
lm_sensors shows the cpu temperature to be 46C.
S
On 12/13/05, Vikas topgun9@gmail.com wrote:
I do have remote hands. When grub loads, it doesnt show the memtest86 option.
"select mentest86 option in boot menu." In which boot menu should I look for memtest86. ?
I am currently logged into the box. It runs fine for a couple of hours before it gives the kernel error. Can i run the memtest86 check from the shell ?
S
On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Vikas
Do you have remote hand ? If yes, restart centos server and select mentest86 option in boot menu.
Vikas wrote:
Is there a way to check the memory bank other then swapping in and out the memory modules. I am 90 miles away from the server. So swapping in and out the memory modules is really not an option,
S On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Check MEMORY BANK (RAM)
Vikas wrote:
This server was running fedora core 2 till 12/2/2005. It had an impressive uptime of 180 days or so. On friday I upgraded to CentOS 4.2. After upgrading to CentOS 4.2 this server has been throwing kernel panics every 10 hours or so.
These are some of the error messages I am getting on the console: "CPU 0:Machine check exception :4 Bank 0 : f672100000000833 TSC 112180365ccf ADDR e2c7f7c0 Kernel Panic - Not synching Machine Check"
This is a production server. I would appreciate any help.
S _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
Vikas wrote:
I do have remote hands. When grub loads, it doesnt show the memtest86 option.
"select mentest86 option in boot menu." In which boot menu should I look for memtest86. ?
I am currently logged into the box. It runs fine for a couple of hours before it gives the kernel error. Can i run the memtest86 check from the shell ?
S
On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Vikas
Do you have remote hand ? If yes, restart centos server and select mentest86 option in boot menu.
Vikas wrote:
Is there a way to check the memory bank other then swapping in and out the memory modules. I am 90 miles away from the server. So swapping in and out the memory modules is really not an option,
S On 12/13/05, Adriano Frare alfrare@e-alinux.com wrote:
Check MEMORY BANK (RAM)
Vikas wrote:
This server was running fedora core 2 till 12/2/2005. It had an impressive uptime of 180 days or so. On friday I upgraded to CentOS 4.2. After upgrading to CentOS 4.2 this server has been throwing kernel panics every 10 hours or so.
These are some of the error messages I am getting on the console: "CPU 0:Machine check exception :4 Bank 0 : f672100000000833 TSC 112180365ccf ADDR e2c7f7c0 Kernel Panic - Not synching Machine Check"
This is a production server. I would appreciate any help.
1. You need to install memtest in order for it to work - perhaps get memtest86+ fro dag's repo, it will setup the grub entry for you.
2. Could you refrain from top posting please ?
3. Could you trim reply posts if possible ?
Thanks :)
just finished running memtest86. That did not give any error. The machine still is giving kernel panicks.
What should I do next ? :(
S
Vikas wrote:
just finished running memtest86. That did not give any error. The machine still is giving kernel panicks.
What should I do next ? :(
Get parsemce.c from : http://www.codemonkey.org.uk/cruft/parsemce.c/
basic usage info is included in the header of the file. This should let you decode exactly what is going on with the machine.