[CentOS] serious problem with torque (ignore - sorry, that's EPEL)

Wed May 27 14:43:00 UTC 2015
m.roth at 5-cent.us <m.roth at 5-cent.us>

m.roth at 5-cent.us wrote:
> Hi, folks,
>
>    The other admin updated torque without testing it on one machine, and
> we had Issues. The first I knew was when a user reported qstat
> returning
> socket_connect_unix failed: 15137
> socket_connect_unix failed: 15137
> socket_connect_unix failed: 15137
> qstat: cannot connect to server (null) (errno=15137) could not connect to
> trqauthd
>
> Attempting to restart the pbs_server did the same. Working with my
> manager, we found:
>   a) torque had been updated from 2.x to 4.2.10, which is huge.
>   b) Apparently, it no longer uses munged. Instead, it uses trqauthd, and
> that wasn't
>         in the updated packages.
>   c) We could not downgrade!!!
>   d) My manager updated from testing, and installed, and then running
> trqauthd, and
>         restarting pbs_server, it appears to be working again.
>
> Should I be filing a bug report?
>
Sorry, realized after I posted that it's a package from epel. Which, of
course, is part of fedora. Ah, how I love fedora...NOT.

        mark