Dear group, I am in the process of configuring 2 new servers. They are running Centos 5.5 and for the last three days they have been rebooting unexpectedly, can you point me in the right direction what to look for in the logs. I have been checking /var/log/messages but don't see anything that hint me any clues why this is happening. Your input is much much appreciated. Lisandro
PS. These are custom built Tyan boxes.
I am in the process of configuring 2 new servers. They are running Centos
5.5 and for the last three days they have been rebooting unexpectedly, can you point me in the right direction what to look for in the logs. I have been checking /var/log/messages but don't see anything that hint me any clues why this is happening. Your input is much much appreciated. Lisandro
If there aren't any messages in the logs, then I would have to think this is a hardware issue. Maybe an overheating or power supply problem. These servers have new hardware? Describe your hardware...
From: compdoc compdoc@hotrodpc.com
If there aren’t any messages in the logs, then I would have to think this is a
hardware issue. Maybe an overheating or power supply problem. These servers have
new hardware? Describe your hardware...
These should appear in the server logs (IPMI) in the BIOS screen or with server system tools...
JD
Also run memtest86 on them overnight (getting at least one complete iteration). That utility is available by booting the installation CD/DVD.
Devin
If its two servers doing the same, then I guess it's not likely they both have the same hardware problem. The thing is, that's not something centos is going to do on its own, so it's some program that's been added, or some common bios setting that's wrong.
Do they connect to a UPS with a serial/usb cable?
Also run memtest86 on them overnight (getting at least one complete
iteration).
I've seen one memtest iteration pass, but 2 or 3 were needed before a failure showed up. That's not usually the case, though...
On 1/16/2011 9:24 AM, compdoc wrote:
I've seen one memtest iteration pass, but 2 or 3 were needed before a failure showed up. That's not usually the case, though...
I have a server right now which passed three memtest iterations but throws intermittent errors on one DIMM when it gets warm enough (warm enough being about 2 or 3 C warmer than the normal system temp under full stress test load with all covers on in my build environment).
It isn't really common - but it does happen.
This is interesting...I wonder if my box is having and overheating issue. Sent on the Sprint® Now Network from my BlackBerry®
-----Original Message----- From: Jerry Franz jfranz@freerun.com Sender: centos-bounces@centos.org Date: Sun, 16 Jan 2011 09:42:46 To: CentOS mailing listcentos@centos.org Reply-To: CentOS mailing list centos@centos.org Subject: Re: [CentOS] Server reboots unexpectebly.
On 1/16/2011 9:24 AM, compdoc wrote:
I've seen one memtest iteration pass, but 2 or 3 were needed before a failure showed up. That's not usually the case, though...
I have a server right now which passed three memtest iterations but throws intermittent errors on one DIMM when it gets warm enough (warm enough being about 2 or 3 C warmer than the normal system temp under full stress test load with all covers on in my build environment).
It isn't really common - but it does happen.
On Sun, 16 Jan 2011, compdoc wrote:
To: 'CentOS mailing list' centos@centos.org From: compdoc compdoc@hotrodpc.com Subject: Re: [CentOS] Server reboots unexpectebly.
If its two servers doing the same, then I guess it's not likely they both have the same hardware problem. The thing is, that's not something centos is going to do on its own, so it's some program that's been added, or some common bios setting that's wrong.
Do they connect to a UPS with a serial/usb cable?
Also run memtest86 on them overnight (getting at least one complete
iteration).
I've seen one memtest iteration pass, but 2 or 3 were needed before a failure showed up. That's not usually the case, though...
Test 5 is the most stressfull for exercising memory. You can select that to run continuously overnight.
Keith
----------------------------------------------------------------- Websites: http://www.karsites.net http://www.php-debuggers.net http://www.raised-from-the-dead.org.uk
All email addresses are challenge-response protected with TMDA [http://tmda.net] -----------------------------------------------------------------