On 26/07/16 15:30, Brian Stinson wrote:
how long was this machine deployed for ? The machine reaper will silently kill power to the node.
Right, but that's 8 hours, correct? I do have a "transparent duffy reuse" tool in https://github.com/cgwalters/centos-ci-skeleton which is used a lot in my jobs, but it generally avoids retaining machines for greater than an hour.
Take https://ci.centos.org/job/atomic-fedora-ws-treecompose/50/console if you click through to https://ci.centos.org/job/atomic-fedora-ws-duffy-allocate/6699/console
At: 22:09:11 Assigning host: n8.pufty.ci.centos.org (SSID=jenkins-atomic-fedora-ws-treecompose-50) Then at this point it hung and I finally aborted it: 22:32:45 Installing packages: 75% 23:53:14 Build was aborted
So that host had only been assigned for less than 30 minutes. Is there anything relevant in the Duffy logs about this?
nope, duffy only powers up machines and powers them down using ipmi calls. Nothing else in there, specially if the machine was still powered up.