Move tag-based concurrency management into clients #14382

abrookins · 2024-06-27T19:02:13Z

Move tag-based concurrency handling client-side, implemented with global concurrency limits. This fixes #14360 and forms part of our larger effort to move all elements of task orchestration client-side.

Limitations and future work:

This changes the behavior of task runs waiting for a concurrency slot. Runs transition to Running before they acquire a slot. As future work, we could make runs that use tag-based concurrency transition to a named Running state, such as Running["AcquiringSlot"], and then transition to the normal Running state after acquiring a slot.
Task run concurrency limits can report which task runs are using the limits, but global concurrency limits do not report the entity using a limit. In future work, users will be able to see which task runs and flow runs are using global concurrency limits.

Example

This PR changes tag-based task concurrency to use global concurrency limits. When a global concurrency limit exists whose name matches a tag in your task, we will apply that limit to the task when it runs. If you want to create a limit to match a tag on your task, you should create a global concurrency limit, not a task run concurrency limit. Future work will likely consolidate these concepts.

Checklist

This pull request includes a label categorizing the change e.g. maintenance, fix, feature, enhancement, docs.
This pull request references any related issue by including "closes <link to issue>"
- If no issue exists and your change is not a small fix, please create an issue first.
If this pull request adds new functionality, it includes unit tests that cover the changes
If this pull request removes docs files, it includes redirect settings in mint.json.
If this pull request adds functions or classes, it includes helpful docstrings.

src/prefect/client/orchestration.py

src/prefect/task_engine.py

src/prefect/utilities/engine.py

zangell44

A couple concerns

The concurrency v2 api does not track which object took a given slot. If the client crashes mid-run, we have no way of recovering a slot automatically.
We should not concurrency limit ALL of a task run's tags. Currently we only apply them to tags with limits defined.
3.x and 2.x task run limits will not be compatible with one another
It seems odd for task run code to be activating concurrency limits. What happens if I want to shut them off?

zhen0 · 2024-07-08T20:40:21Z

@abrookins - doing a bit of maintenance as we have a lot of potentially stale PRs. Is this one that needs action? Or can it be closed?

abrookins · 2024-07-09T14:19:23Z

Still working on this one! 👍

abrookins · 2024-07-10T23:41:59Z

@zangell44 Good questions! We may need to expand the concurrency v2 API with more functionality. I'm thinking about this now. I didn't understand question #4.

codspeed-hq · 2024-07-12T23:03:06Z

CodSpeed Performance Report

Merging #14382 will not alter performance

_{Comparing global-concurrency-tags (69e8665) with main (0d23f58)}

Summary

✅ 5 untouched benchmarks

zangell44 · 2024-07-15T13:34:07Z

I think the create_if_missing kwarg + functionality resolves questions 2 and 4.
I do think 1 and 3 are still worthy of consideration.

1.) The concurrency v2 api does not track which object took a given slot. If the client crashes mid-run, we have no way of recovering a slot automatically.
3.) 3.x and 2.x task run limits will not be compatible with one another

3 may not have a solution outside of documenting the behavior.

cicdw

first pass review - didn't dig into the user agent stuff versioning logic yet

src/prefect/client/orchestration.py

src/prefect/server/models/concurrency_limits_v2.py

src/prefect/settings.py

abrookins · 2024-07-23T23:06:43Z

@zangell44 For 1), I think global concurrency limits should be able to tell you who or what is using them. I plan to add an API endpoint in a follow-up PR that looks at limit acquired and limit released events within a time window to flow or task runs currently using the limit.

For 4), the story should be a little simpler now that client-side concurrency limits will ship with client-side orchestration. That allows us to dump the version-checking code server-side because clients will only be using this new concurrency approach when they use client-side orchestration.

However, the fact remains that if you use client-side orchestration with a task whose tags you had previously created limits for, you would currently need to recreate the limits as global concurrency limits. I haven't spent much time thinking through how to smooth this for users.

cicdw

Good stuff! A few minor nitpicks, otherwise LGTM

src/prefect/concurrency/services.py

src/prefect/concurrency/sync.py

…Q/prefect into global-concurrency-tags

src/prefect/concurrency/asyncio.py

POC

3115e68

abrookins requested review from a team and zangell44 as code owners June 27, 2024 19:02

mintlify bot deployed to staging June 27, 2024 19:05 View deployment

abrookins changed the title ~~POC of supporting tag-based concurrency with global concurrency limits~~ Move tag-based concurrency management into clients Jun 27, 2024

bunchesofdonald reviewed Jun 27, 2024

View reviewed changes

src/prefect/client/orchestration.py Outdated Show resolved Hide resolved

src/prefect/client/orchestration.py Outdated Show resolved Hide resolved

src/prefect/task_engine.py Outdated Show resolved Hide resolved

abrookins commented Jun 27, 2024

View reviewed changes

src/prefect/utilities/engine.py Outdated Show resolved Hide resolved

abrookins added 5 commits June 27, 2024 13:48

Revert experiment

245e71f

Revert experiment

a2e2ad0

ditto

f8ca77e

ditto

45b1b34

ditto

9ad5a37

zangell44 reviewed Jun 28, 2024

View reviewed changes

fix propose state call

beb2c49

Merge branch 'main' into global-concurrency-tags

1398c46

abrookins added 4 commits July 12, 2024 13:29

Checkpoint

e15e1f8

Use client-side orchestration policy

c0c166b

restore type hint

c75a48a

Merge branch 'main' into global-concurrency-tags

088bb29

Merge branch 'main' into global-concurrency-tags

0d7b445

abrookins requested review from discdiver, daniel-prefect and cicdw as code owners July 22, 2024 22:06

github-actions bot added the 3.x label Jul 22, 2024

github-actions bot added the bug Something isn't working label Jul 22, 2024

cicdw reviewed Jul 22, 2024

View reviewed changes

src/prefect/client/orchestration.py Outdated Show resolved Hide resolved

src/prefect/server/models/concurrency_limits_v2.py Show resolved Hide resolved

src/prefect/settings.py Outdated Show resolved Hide resolved

src/prefect/settings.py Outdated Show resolved Hide resolved

Move client-side concurrency behind orchestration flag

a206fbb

abrookins added 3 commits July 23, 2024 16:37

Fix an error

5a9152a

Merge branch 'main' into global-concurrency-tags

782026c

Merge branch 'main' into global-concurrency-tags

bc7e466

cicdw reviewed Jul 26, 2024

View reviewed changes

src/prefect/concurrency/services.py Outdated Show resolved Hide resolved

src/prefect/concurrency/services.py Outdated Show resolved Hide resolved

src/prefect/concurrency/sync.py Outdated Show resolved Hide resolved

abrookins added 3 commits July 26, 2024 08:36

Merge branch 'main' into global-concurrency-tags

f98ef0c

Merge branch 'global-concurrency-tags' of https://github.com/PrefectH…

a5ec4f0

…Q/prefect into global-concurrency-tags

Fix docstrings

c9d01bd

cicdw reviewed Jul 26, 2024

View reviewed changes

src/prefect/concurrency/asyncio.py Outdated Show resolved Hide resolved

cicdw reviewed Jul 26, 2024

View reviewed changes

src/prefect/concurrency/asyncio.py Outdated Show resolved Hide resolved

abrookins added 6 commits July 26, 2024 13:39

Fix more docstrings

b4299b6

Merge branch 'main' into global-concurrency-tags

b2ec8e0

Skip a flaky test

204b146

fix variable and param names

7cbd5ca

more variable name changes

7528cbe

Fix test name

69e8665

abrookins added the enhancement An improvement of an existing feature label Jul 26, 2024

cicdw approved these changes Jul 27, 2024

View reviewed changes

abrookins merged commit 717dcff into main Jul 27, 2024
32 of 33 checks passed

abrookins deleted the global-concurrency-tags branch July 27, 2024 00:51

abrookins mentioned this pull request Jul 29, 2024

Global concurency utilities should return early if no limit names are given #14786

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move tag-based concurrency management into clients #14382

Move tag-based concurrency management into clients #14382

abrookins commented Jun 27, 2024 •

edited

Loading

zangell44 left a comment

zhen0 commented Jul 8, 2024

abrookins commented Jul 9, 2024

abrookins commented Jul 10, 2024

codspeed-hq bot commented Jul 12, 2024 •

edited

Loading

zangell44 commented Jul 15, 2024

cicdw left a comment

abrookins commented Jul 23, 2024 •

edited

Loading

cicdw left a comment

Move tag-based concurrency management into clients #14382

Move tag-based concurrency management into clients #14382

Conversation

abrookins commented Jun 27, 2024 • edited Loading

Example

Checklist

zangell44 left a comment

Choose a reason for hiding this comment

zhen0 commented Jul 8, 2024

abrookins commented Jul 9, 2024

abrookins commented Jul 10, 2024

codspeed-hq bot commented Jul 12, 2024 • edited Loading

CodSpeed Performance Report

Merging #14382 will not alter performance

Summary

zangell44 commented Jul 15, 2024

cicdw left a comment

Choose a reason for hiding this comment

abrookins commented Jul 23, 2024 • edited Loading

cicdw left a comment

Choose a reason for hiding this comment

abrookins commented Jun 27, 2024 •

edited

Loading

codspeed-hq bot commented Jul 12, 2024 •

edited

Loading

abrookins commented Jul 23, 2024 •

edited

Loading