We deployed a second runner in our CI cluster recently.
And things haven’t been going great ever since…
Lots of jobs would randomly fail with the clone:skipped
error message
But restarting them would, sometimes, fix them.
However, now, some projects keep failing to build with no useful logs…
Even when using the simplest pipeline configuration possible, the error message doesn’t change; pipeline gets destroyed as soon as it’s started…
.drone.jsonnet
[
{
kind: 'pipeline',
type: 'docker',
name: 'Base',
steps: [
{
commands: ['echo EHLO'],
image: 'node:14-alpine',
name: 'echo',
},
],
trigger: { event: ['push'] },
},
converted to .drone.yml
---
{
"kind": "pipeline",
"name": "Base",
"steps": [
{
"commands": [
"echo EHLO"
],
"image": "node:14-alpine",
"name": "echo"
}
],
"trigger": {
"event": [
"push"
]
},
"type": "docker"
}
---
kind: signature
hmac: 88611f11ae869c614ab4a045a94f0292d11a1ff99a5b74dbd997896657db855d
...
Runner Logs
time="2021-11-30T15:31:06Z" level=debug msg="stage received" stage.id=25718 stage.name=Base stage.number=1 thread=2
time="2021-11-30T15:31:06Z" level=debug msg="stage accepted" stage.id=25718 stage.name=Base stage.number=1 thread=2
time="2021-11-30T15:31:06Z" level=debug msg="stage details fetched" build.id=18272 build.number=41 repo.id=196 repo.name=modbus-manager repo.namespace=TS stage.id=25718 stage.name=Base stage.number=1 thread=2
time="2021-11-30T15:31:06Z" level=debug msg="updated stage to running" build.id=18272 build.number=41 repo.id=196 repo.name=modbus-manager repo.namespace=TS stage.id=25718 stage.name=Base stage.number=1 thread=2
time="2021-11-30T15:31:09Z" level=debug msg="destroying the pipeline environment" build.id=18272 build.number=41 repo.id=196 repo.name=modbus-manager repo.namespace=TS stage.id=25718 stage.name=Base stage.number=1 thread=2
time="2021-11-30T15:31:10Z" level=debug msg="successfully destroyed the pipeline environment" build.id=18272 build.number=41 repo.id=196 repo.name=modbus-manager repo.namespace=TS stage.id=25718 stage.name=Base stage.number=1 thread=2
time="2021-11-30T15:31:10Z" level=debug msg="updated stage to complete" build.id=18272 build.number=41 duration=2 repo.id=196 repo.name=modbus-manager repo.namespace=TS stage.id=25718 stage.name=Base stage.number=1 thread=2
time="2021-11-30T15:31:10Z" level=debug msg="poller: request stage from remote server" thread=2
time="2021-11-30T15:31:10Z" level=trace msg="http: context canceled"
time="2021-11-30T15:31:10Z" level=debug msg="done listening for cancellations" build.id=18272 build.number=41 repo.id=196 repo.name=modbus-manager repo.namespace=TS stage.id=25718 stage.name=Base stage.number=1 thread=2
Some other topics hinted at missing secrets failing the pipeline.
This pipeline fails even without any secrets anywhere
Other threads talk of restarting/updating Drone to fix the issue.
Admittedly, it worked some times
This morning, I’ve rebuilt the whole build cluster on drone/drone:2.6
and the same skipping issues still occur.
I’m at a loss now as to what I may try differently to fix this issue.
Any pointers?
Thanks