Concurrency limiter controller #699

imjasonh · 2021-01-26T15:27:11Z

Opening this issue to collect ideas, discussion, interest, etc., for a supplemental PipelineRun controller (and possibly TaskRun controller?) that manages Pending PipelineRuns and update them to a Running state to limit execution concurrency.

We've heard a few use cases for limiting execution concurrency, but so far it's been hard to generalize the various needs into one single unified "concurrency" concept that we can apply across all of Tekton Pipelines. Some users might only want to have "deployment" pipeline running at a time, across the whole cluster. Others might want one "deployment" pipeline per namespace, or per deployment target (only one pipeline can deploy to Prod at a time, but you can deploy to Prod and Staging at the same time), or per input source (only deploy my Git repo to one place at a time), or per authorizing user (Alice can only deploy to one place at a time).

Users might also want to limit TaskRun concurrency, either when run as part of a PipelineRun or when executed directly.

We can experiment with supporting these various models and provide a runnable example of limiting concurrency, that users can adapt to their own needs.

As an initial idea, a concurrency controller could be configured with a ConfigMap describing a concurrency key format, and a concurrency limit:

kind: ConfigMap
metadata:
  name: concurrency-controller
data:
  concurrency-key: $(metadata.namespace)-$(spec.pipelineRef.name)
  concurrency-limit: 3

In this example, the key would limit the execution of PipelineRuns referencing the same Pipeline, running in the same namespace, to a max of 3. The concurrency controller would watch for Pending PipelineRuns, derive their keys, count ongoing PipelineRuns with the matching key, and choose to start the new Pending PipelineRun if count < limit. When a PipelineRun finishes, the concurrency controller would reevaluate any Pending runs, and choose one to start if it's under the limit.

(This is just one idea for describing this, if you have something else in mind please contribute it below)

The text was updated successfully, but these errors were encountered:

ghost · 2021-01-26T15:31:35Z

Here are the issues / PRs / TEPs related to this that I have seen so far:

Big +1 from my pov on making this a component external to Pipelines.

bigkevmcd · 2021-01-26T15:32:15Z

Should there be some sort of load-shedding?

Can you queue PipelineRuns for ever? Do they timeout?

imjasonh · 2021-01-26T15:33:34Z

cc @jbarrick-mesosphere for his work on the Pending TEP

imjasonh · 2021-01-26T15:35:41Z

Should there be some sort of load-shedding?

Can you queue PipelineRuns for ever? Do they timeout?

Excellent question! This seems like another useful configuration for the limiter, max age before dropping it on the ground.

Users might also want to be able to describe/derive a priority, which would weight a Pending PipelineRun ahead of others in the same concurrency bucket. edit: Along with priority comes preemption -- e.g., a new high-priority Pending PipelineRun should cancel an ongoing run to make room for it.

Ultimately the deliverable here isn't a production-grade maximally configurable controller, just a minimally useful example that operators can potentially modify to their own needs.

mjgallag · 2021-02-28T01:43:38Z

I'm currently facing this issue trying to do "branch preview", i.e. building and deploying each branch on every push to separate urls. Multiple pushes to multiple branches can run in parallel but multiple pushes to a single branch should be processed in order one at a time. I believe this use case would require the concurrency key format to have access to PipelineRun fields so that branch name could be included.

julweber · 2021-05-19T11:02:04Z

+1

eccox · 2021-07-01T08:37:47Z

+1

dbazhal · 2021-07-08T12:26:34Z

Plusing simultaneous pipelineruns limit.

Would like something as simple as

kind: Pipeline
...
spec:
    runPolicy:
      type: Parallel
      parallel:
          maxLimit: 3

with alternatives as Sequential, and LatestOnly, first executing run requests in natural order, starting next one when previous finishes or cancelled, and last one cancelling any previous runs as the new run is created.

And i expect that this functionality is tekton operator domain, because it would be strange if some external would decide should pipeline operator start processing next run, or should it wait.

I assume pending state is for situations like this.

I refer to openshift operators processing parallelism for BuildConfigs and Builds as straight analogy and good example how it should be done.

https://docs.okd.io/4.7/cicd/builds/advanced-build-operations.html#builds-build-run-policy_advanced-build-operations

https://github.com/openshift/openshift-controller-manager/blob/461fe64e30847a5ae9c361500d7434d2f1756de2/pkg/build/controller/build/build_controller.go#L714

https://github.com/openshift/openshift-controller-manager/blob/461fe64e30847a5ae9c361500d7434d2f1756de2/pkg/build/controller/policy/serial.go

dbazhal · 2021-07-08T12:30:37Z

I suppose run policy is also somehow connected with tektoncd/operator#209

dbazhal · 2021-07-08T12:55:23Z

As an alternative to operator functionality, i can make pipeline runs lock on something with first task of pipeline, and release lock with the finally. But it would break any pipeline run timing metrics as pipelines will start running much longer(including lock release wait time). I'd like pipeline duration numbers contain only "useful" info, showing how long task execution took, but not how long pipeline was waiting for another run to complete.

juliaaano · 2021-10-13T22:18:26Z

This is an important feature I have found and used in most CI systems.

Usually it is not affordable having two pipeline runs running at the same time if they modify a shared resource, such as if they result in api calls to a single instance of a system.

An approach like the one used in GitHub Actions seems an elegant way of implementing this feature: https://docs.github.com/en/actions/learn-github-actions/workflow-syntax-for-github-actions#concurrency

dmikalova · 2021-10-26T06:35:13Z

This would be useful for serializing Terraform runs:

If I have several triggers around the same time Terraform should only run one at a time in the order they came in.
An option to cancel intermediate runs - so that any pending runs are cancelled by newer pending runs, but running runs are not cancelled.
The queue should be keyed - it's not so much the Terraform pipeline that needs to be serialized, but Terraform runs of a specific key that need to be serialized.

I was able to implement this in Jenkins but the syntax for it was torturous.

tekton-robot · 2022-01-24T07:36:28Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale with a justification.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle stale

Send feedback to tektoncd/plumbing.

dbazhal · 2022-01-24T10:05:53Z

/remove-lifecycle stale
/lifecycle frozen

david972 · 2022-11-02T17:30:59Z

+1

shaharb-hs · 2023-03-21T13:54:02Z

👍

AshwinSridharan0410 · 2023-07-11T09:05:26Z

Hi. I would like to run my databases parallely so that when I give the flyway command, it should happen parallely to all the databases. I dont want the process to happen sequentially.Any idea would be helpful

emirot · 2023-07-31T18:24:30Z

Any updates on that ?
Found that workaround https://holly-k-cummins.medium.com/using-lease-resources-to-manage-concurrency-in-tekton-builds-344ba84df297
but this is not native and does not have ordering.

jimmyjones2 · 2023-09-06T20:33:48Z

With TEP-0135 coscheduling mode it'll delete PVCs when PipelineRuns are finished. Maybe adding a ResourceQuota for number of PVC will therefore limit the number of concurrent PipelineRuns to that limit?

imjasonh added the kind/feature Categorizes issue or PR as related to a new feature. label Jan 26, 2021

ghost mentioned this issue Jan 26, 2021

TEP-0013 for adding a limit to pipeline concurrency tektoncd/community#228

Closed

afrittoli mentioned this issue Feb 9, 2021

provide a pipeline concurrency limit tektoncd/pipeline#1305

Closed

afrittoli mentioned this issue Apr 6, 2021

Controlling max parallel jobs per pipeline tektoncd/pipeline#2591

Open

jerop mentioned this issue Apr 21, 2021

Idea: Pipeline Mutexes tektoncd/pipeline#2828

Open

badamowicz mentioned this issue Jul 14, 2021

When does a timeout counter actually start? tektoncd/pipeline#4078

Open

afrittoli mentioned this issue Oct 22, 2021

Dogfooding Roadmap - Tekton Based CI/CD for Tekton tektoncd/plumbing#912

Open

18 tasks

michaelsauter mentioned this issue Jan 12, 2022

race conditions when having more than one pipeline of the same branch opendevstack/ods-pipeline#394

Closed

dibyom mentioned this issue Jan 20, 2022

[Workflows] Explore Pipeline Concurrency Support in Workflows #826

Open

tekton-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 24, 2022

tekton-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 24, 2022

ankitm123 mentioned this issue Mar 14, 2022

provide an exclusive lock step in a pipeline to avoid concurrent builds jenkins-x/jx#5471

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrency limiter controller #699

Concurrency limiter controller #699

imjasonh commented Jan 26, 2021

ghost commented Jan 26, 2021 •

edited by ghost

Loading

bigkevmcd commented Jan 26, 2021

imjasonh commented Jan 26, 2021 •

edited

Loading

imjasonh commented Jan 26, 2021 •

edited

Loading

mjgallag commented Feb 28, 2021 •

edited

Loading

julweber commented May 19, 2021

eccox commented Jul 1, 2021

dbazhal commented Jul 8, 2021 •

edited

Loading

dbazhal commented Jul 8, 2021

dbazhal commented Jul 8, 2021

juliaaano commented Oct 13, 2021

dmikalova commented Oct 26, 2021 •

edited

Loading

tekton-robot commented Jan 24, 2022

dbazhal commented Jan 24, 2022

david972 commented Nov 2, 2022

shaharb-hs commented Mar 21, 2023

AshwinSridharan0410 commented Jul 11, 2023

emirot commented Jul 31, 2023

jimmyjones2 commented Sep 6, 2023

Concurrency limiter controller #699

Concurrency limiter controller #699

Comments

imjasonh commented Jan 26, 2021

ghost commented Jan 26, 2021 • edited by ghost Loading

bigkevmcd commented Jan 26, 2021

imjasonh commented Jan 26, 2021 • edited Loading

imjasonh commented Jan 26, 2021 • edited Loading

mjgallag commented Feb 28, 2021 • edited Loading

julweber commented May 19, 2021

eccox commented Jul 1, 2021

dbazhal commented Jul 8, 2021 • edited Loading

dbazhal commented Jul 8, 2021

dbazhal commented Jul 8, 2021

juliaaano commented Oct 13, 2021

dmikalova commented Oct 26, 2021 • edited Loading

tekton-robot commented Jan 24, 2022

dbazhal commented Jan 24, 2022

david972 commented Nov 2, 2022

shaharb-hs commented Mar 21, 2023

AshwinSridharan0410 commented Jul 11, 2023

emirot commented Jul 31, 2023

jimmyjones2 commented Sep 6, 2023

ghost commented Jan 26, 2021 •

edited by ghost

Loading

imjasonh commented Jan 26, 2021 •

edited

Loading

imjasonh commented Jan 26, 2021 •

edited

Loading

mjgallag commented Feb 28, 2021 •

edited

Loading

dbazhal commented Jul 8, 2021 •

edited

Loading

dmikalova commented Oct 26, 2021 •

edited

Loading