[CentOS] everything seems to hang, but system is idle?

Sun Apr 11 12:49:15 UTC 2010
Rudi Ahlers <rudiahlers at gmail.com>

On Sun, Apr 11, 2010 at 2:25 PM, Rudi Ahlers <rudiahlers at gmail.com> wrote:
> On Sun, Apr 11, 2010 at 2:04 PM, JohnS <jses27 at gmail.com> wrote:
>>
>> On Sun, 2010-04-11 at 12:58 +0200, Rudi Ahlers wrote:
>>
>>> At the same time I can open a new SSH session and do whatevery I like.
>>> But it seems that running a command which takes time to complete
>>> hangs.
>> ---
>> Try killing off those rsyncs and try it again.  You need to provide some
>> other type of error messages.  Use strace.  tail /var/log/messages and
>> paste it in your reply.  Even if you don't see anything in it that does
>> not mean someone else can't.  You may need a reboot.
>>
>> John
>>
>> _______________________________________________
>
>
>
> John, I already said I can't kill the process and tail -f
> /var/log/messages *really* doesn't show me anything.
>
> I am running tail -f /var/log/message in on SSH window, and at the
> same time killed & re-ran "yum remove iscsi-initiator-utils -y" in
> another SSH window. /var/log/message has *nothing* to report.
>
>
>
> [root at intranet ~]# tail -f /var/log/messages
> Apr 11 14:18:50 intranet nmbd[4310]:   find_domain_master_name_query_fail:
> Apr 11 14:18:50 intranet nmbd[4310]:   Unable to find the Domain
> Master Browser name SOFTDUX<1b> for the workgroup SOFTDUX.
> Apr 11 14:18:50 intranet nmbd[4310]:   Unable to sync browse lists in
> this workgroup.
> Apr 11 14:18:50 intranet nmbd[4310]: [2010/04/11 14:18:50, 0]
> nmbd/nmbd_browsesync.c:find_domain_master_name_query_fail(351)
> Apr 11 14:18:50 intranet nmbd[4310]:   find_domain_master_name_query_fail:
> Apr 11 14:18:50 intranet nmbd[4310]:   Unable to find the Domain
> Master Browser name SOFTDUX<1b> for the workgroup SOFTDUX.
> Apr 11 14:18:50 intranet nmbd[4310]:   Unable to sync browse lists in
> this workgroup.
> Apr 11 14:19:14 intranet snmpd[3912]: error scanning interface data
> (expected 10, got 0)
> Apr 11 14:20:44 intranet snmpd[3912]:last message repeated 6 times
> Apr 11 14:22:14 intranet snmpd[3912]:last message repeated 6 times
> Apr 11 14:23:44 intranet snmpd[3912]:last message repeated 6 times
>
>
> The ONLY "fix" is a reboot, but I don't want to reboot every few
> minutes (have done it already a few times today as the server is a dev
> server and everyone else working it (mainly web development) have to
> wait and this cuts in on production time.
>



I can't install strace either:



[root at intranet ~]# yum install strace -y
Loaded plugins: fastestmirror
Loading mirror speeds from cached hostfile
 * local-addons: 192.168.1.250
 * local-base: 192.168.1.250
 * local-extras: 192.168.1.250
 * local-updates: 192.168.1.250
 * rpmforge: apt.sw.be
Setting up Install Process
Resolving Dependencies
--> Running transaction check
---> Package strace.x86_64 0:4.5.18-5.el5_4.4 set to be updated
--> Finished Dependency Resolution

Dependencies Resolved

=====================================================================================================================================================
 Package                        Arch                           Version
                                  Repository
  Size
=====================================================================================================================================================
Installing:
 strace                         x86_64
4.5.18-5.el5_4.4                          local-updates
         177 k

Transaction Summary
=====================================================================================================================================================
Install       1 Package(s)
Upgrade       0 Package(s)

Total size: 177 k
Downloading Packages:
Running rpm_check_debug
Running Transaction Test







And that's where it sits and does nothing. The system's load isn't very high:


[root at intranet ~]# uptime
 14:32:25 up 1 day,  2:05,  6 users,  load average: 2.02, 2.02, 2.01


and again /var/log/messages reports nothing related to this problem:


[root at intranet ~]# tail -f /var/log/messages
Apr 11 14:19:14 intranet snmpd[3912]: error scanning interface data
(expected 10, got 0)
Apr 11 14:20:44 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:22:14 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:23:44 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:25:14 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:26:44 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:28:14 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:29:44 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:31:14 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:32:44 intranet snmpd[3912]:last message repeated 6 times
Apr 11 14:33:23 intranet snmpd[3912]:last message repeated 3 times
Apr 11 14:33:23 intranet snmpd[3912]: Received TERM or STOP signal...
shutting down...




I stopped snmpd since it's not being used. After that no other errors
which tells me what causes this came up.



-- 
Kind Regards
Rudi Ahlers
SoftDux

Website: http://www.SoftDux.com
Technical Blog: http://Blog.SoftDux.com
Office: 087 805 9573
Cell: 082 554 7532