trouble with crash utility - Discuss

3 Jul 2013


      Hi all,
I recently had an issue where, running kernel 2.6.32-358.11.1, the box
would be up for about five minutes, then would crash and reboot.  kdump
saved the vmcore files, so I was hoping to run crash against them to see
why this was occurring.  I copied the vmcores to another machine,
installed the kernel-debuginfo package, and gave it a try, but had no
success:
$ crash /usr/lib/debug/lib/modules/2.6.32-358.11.1.el6.centos.plus.x86_64/vmlinux vmcore
crash 6.1.0-1.el6
Copyright (C) 2002-2012  Red Hat, Inc.
Copyright (C) 2004, 2005, 2006, 2010  IBM Corporation
Copyright (C) 1999-2006  Hewlett-Packard Co
Copyright (C) 2005, 2006, 2011, 2012  Fujitsu Limited
Copyright (C) 2006, 2007  VA Linux Systems Japan K.K.
Copyright (C) 2005, 2011  NEC Corporation
Copyright (C) 1999, 2002, 2007  Silicon Graphics, Inc.
Copyright (C) 1999, 2000, 2001, 2002  Mission Critical Linux, Inc.
This program is free software, covered by the GNU General Public License,
and you are welcome to change it and/or distribute copies of it under
certain conditions.  Enter "help copying" to see the conditions.
This program has absolutely no warranty.  Enter "help warranty" for details.
GNU gdb (GDB) 7.3.1
Copyright (C) 2011 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-unknown-linux-gnu"...
WARNING: kernels compiled by different gcc versions:
  /usr/lib/debug/lib/modules/2.6.32-358.11.1.el6.centos.plus.x86_64/vmlinux: 4.4.6
  vmcore kernel: 4.4.7
WARNING: kernel version inconsistency between vmlinux and dumpfile
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c1bbc0  type: "current_task (per_cpu)"
crash: page excluded: kernel virtual address: ffffffff81c232a4  type: "tss_struct ist array"
$
And instead of the crash> prompt, I was back to the bash prompt.  Does
anyone know what I could do to figure out why that would be?  Do I need
to run crash on the original server (the crash docs seem to imply that's
not necessary)?  Anything else I could look for that might be useful?
As far as I can tell the kernels match:
$ strings vmcore |grep OSRELEASE
OSRELEASE=2.6.32-358.11.1.el6.x86_64
--keith
-- 
kkeller@wombat.san-francisco.ca.us