[CentOS] Broken pipe, x86_64 CentOS 5.2

Wed Oct 1 06:47:20 UTC 2008
Rubin <rubin at xs4all.nl>

Hi,

This is very probably a known issue from nss_ldap which breaks pipes.
Take a look at:

http://rhn.redhat.com/errata/RHBA-2008-0611.html

kind regards,


Rubin.

Daniel Andrzejewski wrote:
> Problem solved by removing ldap from the services line in /etc/nsswitch.
> 
> Daniel Andrzejewski
> 
> student IT Administrator
> Elec Engr & Comp Science
> University of Tennessee
> (865) 974 - 4388 (work)
> 
> "Investment in knowledge always pays the best interest"
> Benjamin Franklin
> -- 
> 
> Daniel Andrzejewski wrote:
>> I got slightly different error this time, maybe some clue, and found
>> this topic similar:
>>
>> http://lists.centos.org/pipermail/centos/2008-July/060274.html
>>
>>
>> This line may be the clue:
>> getsockname(3, 0x7fff7866e9e0, [128])   = -1 ENOTSOCK (Socket
>> operation on non-socket)
>>
>> [root at frodo9 torque-2.3.2-snap.200807081528]# strace -f pbs_mom
>> .
>> .
>> .
>> fcntl(4, F_SETLK, {type=F_UNLCK, whence=SEEK_SET, start=0, len=0}) = 0
>> clone(Process 26346 attached
>> child_stack=0, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,
>> child_tidptr=0x2aae326a1db0) = 26346
>> [pid 26345] exit_group(0)               = ?
>> getsockname(3, 0x7fff7866e9e0, [128])   = -1 ENOTSOCK (Socket
>> operation on non-socket)
>> fcntl(3, F_GETFD)                       = 0
>> dup(3)                                  = 7
>> fcntl(7, F_SETFD, 0)                    = 0
>> socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 8
>> close(3)                                = 0
>> fcntl(8, F_GETFD)                       = 0
>> dup2(8, 3)                              = 3
>> fcntl(3, F_SETFD, 0)                    = 0
>> close(8)                                = 0
>> write(3, "\25\3\1\0
>> \307\276jJ\207v\7\3473A\355\16\340\232\347\35\\\311\307\3472i)\5L\t\22"...,
>> 37) = -1 EPIPE (Broken pipe)
>> --- SIGPIPE (Broken pipe) @ 0 (0) ---
>> Process 26346 detached
>> [root at frodo9 torque-2.3.2-snap.200807081528]#
>>
>>
>> Daniel Andrzejewski
>> -- 
>>
>> Daniel Andrzejewski wrote:
>>> Hi All,
>>>
>>> I have a problem with torque (openPBS) on x86_64 CentOS 5.2. Just to
>>> add there's no problem on a 32bit CentOS 5.2 or 64bit Ubuntu 8.04.
>>>
>>> The problem is that pbs_mom's child quits without giving any error logs.
>>>
>>> [root at frodo9 torque-2.3.3]# strace -f pbs_mom
>>> .
>>> .
>>> .
>>> bind(6, {sa_family=AF_INET, sin_port=htons(15002),
>>> sin_addr=inet_addr("0.0.0.0")}, 16) = 0
>>> time(NULL)                              = 1222785330
>>> listen(6, 512)                          = 0
>>> socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 7
>>> setsockopt(7, SOL_SOCKET, SO_REUSEADDR, [1], 4) = 0
>>> bind(7, {sa_family=AF_INET, sin_port=htons(15003),
>>> sin_addr=inet_addr("0.0.0.0")}, 16) = 0
>>> time(NULL)                              = 1222785330
>>> listen(7, 512)                          = 0
>>> fcntl(5, F_SETLK, {type=F_UNLCK, whence=SEEK_SET, start=0, len=0}) = 0
>>> clone(Process 18441 attached
>>> child_stack=0, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,
>>> child_tidptr=0x2b276258fdb0) = 18441
>>> [pid 18441] getsockname(3, {sa_family=AF_INET, sin_port=htons(56711),
>>> sin_addr=inet_addr("172.16.0.9")}, [16]) = 0
>>> [pid 18441] getpeername(3, {sa_family=AF_INET, sin_port=htons(389),
>>> sin_addr=inet_addr("172.16.2.24")}, [68719476752]) = 0
>>> [pid 18441] fcntl(3, F_GETFD)           = 0x1 (flags FD_CLOEXEC)
>>> [pid 18441] dup(3)                      = 8
>>> [pid 18441] fcntl(8, F_SETFD, FD_CLOEXEC) = 0
>>> [pid 18441] socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 9
>>> [pid 18441] close(3)                    = 0
>>> [pid 18441] fcntl(9, F_GETFD)           = 0
>>> [pid 18441] dup2(9, 3)                  = 3
>>> [pid 18441] fcntl(3, F_SETFD, 0)        = 0
>>> [pid 18441] close(9)                    = 0
>>> [pid 18441] write(3, "\25\3\1\0
>>> \232\17\205\301f<O0\352\246\357\344Z\31&\243\361\356\2128\242\377\7O{\267\333"...,
>>> 37 <unfinished ...>
>>> [pid 18440] exit_group(0)               = ?
>>> [pid 18441] <... write resumed> )       = -1 EPIPE (Broken pipe)
>>> [pid 18441] --- SIGPIPE (Broken pipe) @ 0 (0) ---
>>> Process 18441 detached
>>> [root at frodo9 torque-2.3.3]#
>>>
>>>
>>> I have asked torque users mailing list, but since the problem is
>>> related strictly to CentOS 5.2 64bit, I thought I would go ahead and
>>> ask here.
>>>
>>> Any help is appreciated.
>>>
>>> Thanks,
>>>
>>> Daniel
>>>
>> _______________________________________________
>> CentOS mailing list
>> CentOS at centos.org
>> http://lists.centos.org/mailman/listinfo/centos
> _______________________________________________
> CentOS mailing list
> CentOS at centos.org
> http://lists.centos.org/mailman/listinfo/centos
> 
>