On 12/18/06, James Olin Oden james.oden@gmail.com wrote:
On 12/18/06, Mark Belanger mark_belanger@ltx.com wrote:
I have many different centos machines that are hanging regulary. I believe this is due to something our application is doing - not a centos specific problem.
When the machines hang, there is no access to the console or remote access(ssh, rsh, etc).
Any tips on debugging this issue? It is becoming quite a show stopper as we migrate our product from Solaris to Linux.
tia,
FYI - The "application" is a collection of programs that communicate with each other and to a large chip tester via a proprietary serial bus. The hangs are random but pretty frequent - in the range of several per day to several per week.
-Mark
Have you tried the magic sysreq sequence on the console?
Oh, I just noticed propietary serial bus, does mean you have your own device driver(s)? Still see if magic sysreq works, but what you do with this should be driven by your driver writers.
Cheers...james