On Sun, Apr 11, 2010 at 2:25 PM, Rudi Ahlers rudiahlers@gmail.com wrote:
On Sun, Apr 11, 2010 at 2:04 PM, JohnS jses27@gmail.com wrote:
On Sun, 2010-04-11 at 12:58 +0200, Rudi Ahlers wrote:
At the same time I can open a new SSH session and do whatevery I like. But it seems that running a command which takes time to complete hangs.
Try killing off those rsyncs and try it again. You need to provide some other type of error messages. Use strace. tail /var/log/messages and paste it in your reply. Even if you don't see anything in it that does not mean someone else can't. You may need a reboot.
John
John, I already said I can't kill the process and tail -f /var/log/messages *really* doesn't show me anything.
I am running tail -f /var/log/message in on SSH window, and at the same time killed & re-ran "yum remove iscsi-initiator-utils -y" in another SSH window. /var/log/message has *nothing* to report.
[root@intranet ~]# tail -f /var/log/messages Apr 11 14:18:50 intranet nmbd[4310]: find_domain_master_name_query_fail: Apr 11 14:18:50 intranet nmbd[4310]: Unable to find the Domain Master Browser name SOFTDUX<1b> for the workgroup SOFTDUX. Apr 11 14:18:50 intranet nmbd[4310]: Unable to sync browse lists in this workgroup. Apr 11 14:18:50 intranet nmbd[4310]: [2010/04/11 14:18:50, 0] nmbd/nmbd_browsesync.c:find_domain_master_name_query_fail(351) Apr 11 14:18:50 intranet nmbd[4310]: find_domain_master_name_query_fail: Apr 11 14:18:50 intranet nmbd[4310]: Unable to find the Domain Master Browser name SOFTDUX<1b> for the workgroup SOFTDUX. Apr 11 14:18:50 intranet nmbd[4310]: Unable to sync browse lists in this workgroup. Apr 11 14:19:14 intranet snmpd[3912]: error scanning interface data (expected 10, got 0) Apr 11 14:20:44 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:22:14 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:23:44 intranet snmpd[3912]:last message repeated 6 times
The ONLY "fix" is a reboot, but I don't want to reboot every few minutes (have done it already a few times today as the server is a dev server and everyone else working it (mainly web development) have to wait and this cuts in on production time.
I can't install strace either:
[root@intranet ~]# yum install strace -y Loaded plugins: fastestmirror Loading mirror speeds from cached hostfile * local-addons: 192.168.1.250 * local-base: 192.168.1.250 * local-extras: 192.168.1.250 * local-updates: 192.168.1.250 * rpmforge: apt.sw.be Setting up Install Process Resolving Dependencies --> Running transaction check ---> Package strace.x86_64 0:4.5.18-5.el5_4.4 set to be updated --> Finished Dependency Resolution
Dependencies Resolved
===================================================================================================================================================== Package Arch Version Repository Size ===================================================================================================================================================== Installing: strace x86_64 4.5.18-5.el5_4.4 local-updates 177 k
Transaction Summary ===================================================================================================================================================== Install 1 Package(s) Upgrade 0 Package(s)
Total size: 177 k Downloading Packages: Running rpm_check_debug Running Transaction Test
And that's where it sits and does nothing. The system's load isn't very high:
[root@intranet ~]# uptime 14:32:25 up 1 day, 2:05, 6 users, load average: 2.02, 2.02, 2.01
and again /var/log/messages reports nothing related to this problem:
[root@intranet ~]# tail -f /var/log/messages Apr 11 14:19:14 intranet snmpd[3912]: error scanning interface data (expected 10, got 0) Apr 11 14:20:44 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:22:14 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:23:44 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:25:14 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:26:44 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:28:14 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:29:44 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:31:14 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:32:44 intranet snmpd[3912]:last message repeated 6 times Apr 11 14:33:23 intranet snmpd[3912]:last message repeated 3 times Apr 11 14:33:23 intranet snmpd[3912]: Received TERM or STOP signal... shutting down...
I stopped snmpd since it's not being used. After that no other errors which tells me what causes this came up.