Kubernetes runner image building and caching best practises?

diotsoumas · October 14, 2021, 6:53am

We are evaluating the move from an older Drone1 dind implementation to Drone2 with kubernetes runner. We are in GCP, have ~60 pipelines and obviously we want to introduce as few changes as possible. We also want to be able to scale our GKE node workers based on CPU memory. Currently, we have 2 blockers that we are unsure how to solve:

First is docker image caching. If a pipeline runs on the same worker, everything is working fine as the image exists locally. When new workers are introduced though, everything needs to be rebuild again. I wonder what is the suggested way to move forward with this, I understand that image caching is a difficult problem to solve in the Kubernetes ecosystem, however the runner becomes less exciting if there is no way around image caching.
Then is the image building. Our drone.yaml file includes many different pipelines. One of them builds locally the Docker image, then the rest use it for all kind of things (testing, building etc). With multiple worker nodes, a pipeline can be picked up by a different worker, which doesn’t have the image locally (because another worker built it) and the pipeline fails. I guess we can push the image to GCR when we build it so each pipeline get it from there, but this sounds like an “expensive” way around it, especially without proper caching in place.

Is anyone using the kubernetes runner in a big production scale to share some ideas?

diotsoumas · October 14, 2021, 3:06pm

For the first point, we are proceeding with --cache-from --cache-to options in a shared filestore instance which is mounted to every new worker node when GKE autoscales the instances.

There is no easy way to solve the second point without playing around with mount volumes (which we want to avoid, we have ~60 configurations and some of them have big amount of steps).

However, we are now wondering, is there a way to force all pipelines in a single file, to be spanwed in the same kubernetes worker? That would solve our issue, since the image would always exist in the worker where the docker build command runs.

Topic		Replies	Views
Build Docker image and re-use in the next step - Kubernetes Drone Support	13	1173	July 22, 2020
Is Caching on Kubernetes Possible? Drone Support	4	1248	April 23, 2019
Kubernetes runner builds are left behind Drone Support	2	426	September 15, 2020
How do I create a ‘node local’ cache? Drone Support	1	620	April 15, 2021
Is there any way to specify which runner pipeline use? Drone Support	0	471	July 8, 2020

Kubernetes runner image building and caching best practises?

Related topics