On Thu, 2006-08-17 at 18:00 +0200, Peter Kjellström wrote:
On Thursday 17 August 2006 17:35, Marek Dabrowski wrote:
I found probabyl reason of my problems... Two minutes ago I rebooted second node. Load on first node decresed from 10 to 2.01. Rebooted node have load 0.29 now. But question "why" that is happening is still open.
I say it's time to bring out the big hammer.. oprofile.
rough guide:
- install the following rpms: oprofile kernel-smp-devel kernel-debuginfo
- opcontrol --setup --vmlinux=/path/to/vmlinux/from/debuginfo-rpm
when in abnormal state: opcontrol --start ; sleep a bit ; opcontrol --shutdown opreport -l -p /lib/modules/$(uname -r)/kernel | head -n 15
That should give you the top most consumers of processor time and hopefully some clues :-)
While you're at it, run vmstat for a few seconds and study the first two columns.
/Peter (who likes big hammers, everything's a nail!!)
hmmmm ... the title of this message does not look good :) ... I though for sure this was some serious SPAM :)