Application dependencies #7437

jessesuen · 2021-10-14T07:28:34Z

Summary

I was speaking with @JasonMorgan from Buoyant today about a missing feature in Argo CD for blocking application syncs based on required dependencies on other applications. The use case is:

I need to deploy apps A and B
B must not be deployed before A (because A has a mutating webhook which must be in place before B starts)
I want to sync them all at the same time and don't want to think about clicking sync in some correct order

This is especially important for the bootstrapping use case where you're recreating a cluster from git, and you need to create many apps after a bunch of system-level add-ons are fully available. e.g. linkerd must be in place before any applications come up, because linkerd's mutating webhook needs to inject sidecars into application pods starting up.

The use case is very compelling and I'm convinced we should prioritize this. I think this feature, combined with ApplicationSets will really start to complete our bootstrapping story.

Motivation

Please give examples of your use case, e.g. when would you use this.

During cluster bootstrapping, cluster addons (especially ones with mutating webhooks) need to be in place before application pods can come up.

Proposal

How do you think this should be implemented?

It turns out, @jannfis already started some work on this, and the spec changes close to what we need: #3892

Given the age of the original PR, I'm filing an issue in case we abandon #3892 for a new attempt, and targeting this for tentative next milestone in case someone wants to pick this up.

jannfis · 2021-10-30T09:42:21Z

I'm glad to see this gaining traction again. From previous discussions, we thought that the sync retry feature would solve this problem in a more declarative way (e.g. reconcile as long as necessary, hoping for dependencies to have finished reconciling in a certain time frame).

I think we could build up upon the existing PoC code, however I think we should consider some more things than are currently implemented in the PoC:

Application dependency specification should allow for label selector as well as single, named Applications
Ability to optional restrict dependencies on same destination clusters
Force sync should override/ignore any unmet dependencies when syncing manually
Dependencies should be visualized in the UI, similar to how we visualize ownerReferences

And probably some more things I have somewhere in the back of my mind from when I came up with the PoC.

jessesuen · 2021-11-04T01:24:20Z

I'm glad to see this gaining traction again. From previous discussions, we thought that the sync retry feature would solve this problem in a more declarative way (e.g. reconcile as long as necessary, hoping for dependencies to have finished reconciling in a certain time frame).

Yes, what I now realize is that retries don't help because in the problematic scenario (mutating webhooks), nothing actually "fails" per se and so there is nothing to retry. The dependent application silently succeeds even though it didn't get injected properly.

I think we could build up upon the existing PoC code, however, I think we should consider some more things than are currently implemented in the PoC:

I love your ideas on making this even more powerful with labels and force sync. But for MVP, we can keep this quite simple, not very far removed from your PoC. The way I think this feature should work is:

Application B depends on A. Both applications are created, but neither is deployed (have a Missing health status).
User clicks sync on B
B now has an operation in a Running state (because we don't have a Pending state), but stays inRunning indefinitely because A is not healthy (NOTE: we would also keep it in Running if A did not exist).
User eventually clicks on sync on A
As soon as A is Healthy, B would actually go through with the operation.

I took a look at your work, and I believe you implemented it just like how I described it.

Dependencies should be visualized in the UI, similar to how we visualize ownerReferences

I think this is more than we need, a simple message in the operation would be sufficient to understand what's going on.

Lavanya-Anbalagan · 2022-03-18T07:21:45Z

This is a blocker for us and makes us to put lot of efforts between the dependency applications. Can we get an update on this ?.

flaviomoringa · 2022-03-25T17:51:16Z

Have the exact same issue with installing Kyverno and then some policies.
Also referenced here:
#8358
#7978

hhannani · 2022-03-28T18:25:38Z

Hi team, is there a way to use dependencies between yaml files within the same Application?

DotNetRockStar · 2022-03-30T18:36:07Z

bump; same issues.

rafilkmp3 · 2022-04-01T19:28:05Z

bump; same issues.

christianh814 · 2022-04-22T15:16:23Z

Just adding my "bump" here. This is mainly because I would also like this with ApplicationSets as I stated in issue #221

wmgroot · 2022-04-22T22:10:41Z

I've opened a PR showing a possible implementation path (which needs some work).
This is against the old repo, but I'd like to get feedback on the direction before investing more effort into migrating it to this repo.
wmgroot/applicationset#1

If the dependency work is close to completion, I believe it could replace the user defined rollout stages in my PR.

qxmips · 2022-06-01T04:46:14Z

same here

nneram · 2022-06-09T08:35:06Z

We would love to see this feature as well ! 👍🏻

rumstead · 2022-06-22T18:24:26Z

Adding my "bump".

EDIT:
Use cases:

Namespaces/Namespace quotas (cluster bootstrap)
Vault (mutating webhook)
Service mesh (Consul with a mutating webhook)
Capsule (multi-tenancy enabler)
Business applications

chenele · 2022-06-28T04:55:02Z

Adding my bump

crenshaw-dev · 2022-06-28T18:43:04Z

Thanks for the +1s! If you leave a comment, please add info about your use case so it can be considered when writing the feature. Otherwise adding a thumbs-up to the issue is sufficient to move it up the priorities list. :-)

imusmanmalik · 2022-07-07T08:47:10Z

+1 would love to see this feature as well

Also have this requirement of Apps based on Apps and so on... same use-case Application B depends on A.

dgsardina · 2022-07-18T11:45:48Z

+1

My use case will be on a cluster bootstrap we have istiod and istio-ingressgateway deployed as independent applications but the latter fails to sync as the mutating webhook of the first was not ready when it was deployed.

RobCannon · 2022-07-23T14:34:09Z

My use cases are:
I have an Application that references a folder-based chart that has our Certificate declarations. That Application will fail unless the Application that installs the cert-manager helm chart has succeeded (even if I install the CRDs first). I would also like to make the Applications that deploy our app services dependent on the certificates Application.

I can use sync waves and App of App hierarchies to get everything to deploy in the right order when I bootstrap a cluster, but just having a property on the Application that says it is dependent on one or more other Applications seems MUCH easier to manage. Let ArgoCD figure out the order based on the dependency info!

RobCannon · 2022-07-23T14:40:21Z

It looks like this is being tracked on the roadmap in this issue. Please go upvote!
#3517

day0hero · 2022-07-28T14:05:39Z

I would really like to see this feature added! We are using jobs with sync-waves/hooks to get this functionality. While it works, it can be cumbersome to implement/debug especially when you're putting these hooks in across 10+ applications. Having the ability to clearly define the dependencies between the applications would be awesome!

Just as an example of our deployment scenario (there are other components to this, but the flow is the similar):

deploy cloud storage (openshift data foundation)
kubernetes job that waits for the storage to become available
deploy dependent resources (quay, objectstoreuser (for s3 integration)
kubernetes job that waits for the user and secret to get auto-generated
deploy remaining applications

jaxels10 · 2023-07-28T08:04:52Z

We need this for deploying certain applications before others, such as kyverno with kyverno policies, but also having Ceph fully reconciled before letting applications use its storage classes. This is the number one missing features keeping us from using Argo and instead using Flux. If this was implemented I am sure we would make the switch.

sambonbonne · 2023-07-31T12:28:39Z

@jaxels10 maybe you already considered this option but why not using sync waves if you "just" want to be sure with the apply order?

See the documentation for more information.

If your applications are centralized in one repository, with the apps of apps pattern, you can use sync waves to ensure apply order.

fvogl · 2023-08-09T05:28:02Z

@sambonbonne unfortunately sync-waves don't work for app-of-apps in case of updates. The sync order is working for the initial deployment of the apps and also while deleting them (Argo takes them out in the descending order). For updates though the order is random and basically most of the changes are applied at the same time. I would love to see the sync-waves working.

purduemike · 2023-08-09T17:36:39Z

the solution here is to make sure all apps are truly independent and will retry themselves until all the definitions they rely on are in memory.

I tend to agree with @shanproofpoint. We should try to make sure apps are independent. My use-case is to ensure our DB schema changes are live before starting App B. This can easily be done in code. App B, just need to check the schema version in the DB before making its health check green.
The problem with sync-waves between apps is, how should apps behave if it updates don't finish before the next sync? I feel like this can get really complex really quickly. So, shooting for app independence is key.

jannfis · 2023-08-29T17:47:00Z

I took a new throw at implementing this. I diverted a little from the previous approach, but I think it's pretty usable already: #15280

leoluz · 2023-09-25T18:57:58Z

This is a highly voted proposal and while I think the main use-case (mutation webhook) makes some sense, I am also concerned about how this feature could be promoting anti-patterns when it comes to micro-service designs.

The first example that comes to mind is the distributed monolith. Ideally (in the perfect world :) ) an application should be resilient enough to allow it to be deployed even if its dependencies aren't satisfied. A simple example is one service that depends on Prometheus infra to expose metrics. It doesn't really matter if Prometheus is available on the cluster or not. The core functionality of this service should still be available and once Prometheus infra is up it will start scraping metrics without requiring the application to restart. If someone configures this service in Argo CD with a dependency to Prometheus it will block new syncs if Prometheus is unavailable (maybe even if it is Degraded?) while it shouldn't. This is a very simplistic example but I am pretty sure that there are much more in terms of how this feature could be misused which would make support much harder for Argo CD admins.

If the dependency graph is complex with many apps and levels involved, how users would be able to visualize the dependency tree to understand what is causing their application to remain out-of-sync?

@jessesuen @jannfis

jannfis · 2023-09-25T19:10:09Z

If the dependency graph is complex with many apps and levels involved, how users would be able to visualize the dependency tree to understand what is causing their application to remain out-of-sync?

In the most recent incarnation, if the sync is blocked by a dependency's state, it will be noted in the Application's .status field. So far, there are no plans on visualization, but the information is readily available in the Application CRs. The wait state will also be reflected in an Application's conditions, so the information is easily accessible from the UI.

leoluz · 2023-09-25T19:35:37Z

The wait state will also be reflected in an Application's conditions, so the information is easily accessible from the UI

I am sorry but as far as I know the Application's status fields are not exposed in the UI. Am I wrong? It requires kubectl access in the cluster where the Applications are synced. Anyhow, let's put ourselves in the user's shoes: As a devops, I pushed a change in git and my application remains out of sync. Even if I click the sync button nothing happens. There is no place in Argo CD UI to tell me why the application is not syncing. I have to call support. Argo CD admin must look in the gigantic Application's status field to dig where the error is.

We are having many different support issues where the answer is in the resource's status field but users just don't look at it. The direction that we are going is to surface important status fields data in Argo CD UI to make it more user friendly.

jannfis · 2023-09-25T20:01:54Z

@leoluz While waiting for any dependencies, it will look in the UI right now as follows:

and

So no direct cluster access required. Obviously, this information could be surfaced a little better. I'm open to suggestions, but I believe for an MVP, this might be good enough.

shinebayar-g · 2023-10-16T03:22:07Z

I can use sync waves and App of App hierarchies to get everything to deploy in the right order when I bootstrap a cluster

Excuse me, how do you do this? I am using App of Apps pattern and added argocd.argoproj.io/sync-wave: '-1' to the CRDs application. But kube-prometheus-stack still started syncing before even CRDs are installed.

apiVersion: argoproj.io/v1alpha1
kind: Application
metadata:
  annotations:
    argocd.argoproj.io/sync-wave: '-1'
  name: kube-prometheus-stack-crds
  namespace: argocd
  finalizers:
    - resources-finalizer.argocd.argoproj.io

Edit: Found this really nice blog post that explains it. https://codefresh.io/blog/argo-cd-application-dependencies/

aiceball · 2023-11-01T20:08:33Z

@jannfis
am I correct in understanding that your PR: #15280 would function for any application deployment strategies?

i.e. it would cover all of the following cases:

manual deployment of multiple apps
app of apps
applicationsets
app of applicationsets

jannfis · 2023-11-13T13:23:34Z

@aiceball Yes, the dependency mechanism would be rather independent of the pattern you use to create/maintain your applications.

zs-dima · 2023-12-09T10:47:59Z

What about dependsOn for ApplicationSet elements?

apiVersion: argoproj.io/v1alpha1
kind: ApplicationSet
metadata:
  name: my-applications
  namespace: argocd
spec:
  generators:
    - list:
        elements:
          # Infrastructure
          - name: cert-manager
            path: infrastructure/networking/cert-manager
          - name: traefik
            path: infrastructure/networking/traefik
            dependsOn:
              - cert-manager
          - name: rancher
            path: infrastructure/system/rancher
            dependsOn:
              - traefik
          # Apps
          - name: n8n
            path: apps/n8n
            dependsOn:
              - traefik
  template:
    metadata:
      name: '{{name}}'
    spec:
      project: default
      source:
        repoURL: 'https://github.com/${GITHUB_USER}/${GITHUB_REPO}.git'
        targetRevision: HEAD
        path: '{{path}}'
      destination:
        server: 'https://kubernetes.default.svc'
        namespace: '{{name}}-system'

FluxCD has dependencies:
https://fluxcd.io/flux/components/kustomize/kustomizations/#dependencies

Event Docker Compose and Docker Swarm have depends_on:
https://docs.docker.com/compose/compose-file/compose-file-v3/#depends_on

christianh814 · 2023-12-11T15:31:15Z

@zs-dima There's already a way to do that with progressive syncs

https://argo-cd.readthedocs.io/en/stable/operator-manual/applicationset/Progressive-Syncs/

Signed-off-by: Boris Kurktchiev <kurktchiev@gmail.com>

vvatlin · 2024-04-15T14:59:59Z

It's still impossible to guarantee orders between apps. Sync waves don't work.

nneram · 2024-04-15T15:27:56Z

Hi @vvatlin, I can confirm that it's working, at least in the version I use, v2.8.4. I have an app of apps pattern with 11 applications and still growing, with nearly 7 waves. All you need is here: https://argo-cd.readthedocs.io/en/stable/operator-manual/cluster-bootstrapping/#app-of-apps-pattern. However, you need to add health assessment since v1.8 (#3781). Otherwise, it will not work.

For more information: https://argo-cd.readthedocs.io/en/stable/operator-manual/upgrading/1.7-1.8/.
I think you also have ApplicationSets, but I didn't look in that way.

They are working solutions but it would be easier with dependencies. I agree with that.

christianh814 · 2024-04-15T17:32:30Z

It's still impossible to guarantee orders between apps. Sync waves don't work.

hey @vvatlin , I wrote a blog about getting Syncwaves working with App of Apps

vvatlin · 2024-04-16T11:00:25Z

I have app of apps and Health assessment also. And my child apps still synchronize randomly. argocd 2.10.7

chanakya-svt · 2024-07-30T21:03:56Z

Hi @vvatlin, I can confirm that it's working, at least in the version I use, v2.8.4. I have an app of apps pattern with 11 applications and still growing, with nearly 7 waves. All you need is here: https://argo-cd.readthedocs.io/en/stable/operator-manual/cluster-bootstrapping/#app-of-apps-pattern. However, you need to add health assessment since v1.8 (#3781). Otherwise, it will not work.

For more information: https://argo-cd.readthedocs.io/en/stable/operator-manual/upgrading/1.7-1.8/. I think you also have ApplicationSets, but I didn't look in that way.

They are working solutions but it would be easier with dependencies. I agree with that.

Hi @vvatlin, with the setup thats working for you, are you using ServerSideApply/ServerSideDiff in the ApplicationSet?

jessesuen added the enhancement New feature or request label Oct 14, 2021

jessesuen added this to the v2.3 milestone Oct 14, 2021

jessesuen mentioned this issue Oct 22, 2021

docs: more post v2.3 roadmap items #7509

Merged

alexmt modified the milestones: v2.3, v2.4 Jan 26, 2022

christianh814 mentioned this issue Jan 26, 2022

RFE: Support for "SyncWaves" for ApplicationSets argoproj/applicationset#221

Open

jannfis mentioned this issue Feb 4, 2022

Argo cd doesn't respect waves #8358

Closed

wmgroot mentioned this issue Apr 22, 2022

Experimental AppSet progressive sync capability wmgroot/applicationset#1

Open

alexmt modified the milestones: v2.4, v2.5 Jun 21, 2022

crenshaw-dev mentioned this issue Aug 1, 2022

Add explicit "dependsOn" functionality to Applications #10154

Closed

jannfis linked a pull request Aug 29, 2023 that will close this issue

feat: Application dependencies #15280

Open

13 tasks

blakepettersson mentioned this issue Sep 24, 2023

argocd cluster rm --cascade ? #8886

Closed

blakepettersson mentioned this issue Dec 11, 2023

Add sync-waves/dependencies at the application level #7978

Closed

csantanapr mentioned this issue Mar 8, 2024

Question about this pattern and ordered application set gitops-bridge-dev/gitops-bridge#57

Open

kurktchiev added a commit to back-stack/everything-as-code that referenced this issue Mar 11, 2024

update to the proper current ordering. see argoproj/argo-cd#7437

0d9c34f

Signed-off-by: Boris Kurktchiev <kurktchiev@gmail.com>

ChristianCiach mentioned this issue Apr 19, 2024

Include Application Health Check (Revert #3781) #16870

Open

3 tasks

dtzar mentioned this issue Apr 30, 2024

Eliminate sleep pre-sync hook for CAPZ install Azure-Samples/aks-platform-engineering#47

Open

ChristianCiach mentioned this issue May 17, 2024

Support sync waves for applicationset #18268

Open

metacoma mentioned this issue May 31, 2024

Istio Ingress occasionally fails with ImagePullBackoff error mindwm/mindwm-gitops#19

Closed

alexmt added component:argo-cd type:enhancement labels Jul 30, 2024

Application dependencies #7437

Application dependencies #7437

Comments

jessesuen commented Oct 14, 2021 • edited Loading

Summary

Motivation

Proposal

jannfis commented Oct 30, 2021

jessesuen commented Nov 4, 2021

Lavanya-Anbalagan commented Mar 18, 2022

flaviomoringa commented Mar 25, 2022

hhannani commented Mar 28, 2022 • edited Loading

DotNetRockStar commented Mar 30, 2022

rafilkmp3 commented Apr 1, 2022

christianh814 commented Apr 22, 2022

wmgroot commented Apr 22, 2022

qxmips commented Jun 1, 2022

nneram commented Jun 9, 2022

rumstead commented Jun 22, 2022 • edited Loading

chenele commented Jun 28, 2022

crenshaw-dev commented Jun 28, 2022

imusmanmalik commented Jul 7, 2022 • edited Loading

dgsardina commented Jul 18, 2022

RobCannon commented Jul 23, 2022

RobCannon commented Jul 23, 2022 • edited Loading

day0hero commented Jul 28, 2022

jaxels10 commented Jul 28, 2023

sambonbonne commented Jul 31, 2023

fvogl commented Aug 9, 2023

purduemike commented Aug 9, 2023

jannfis commented Aug 29, 2023

leoluz commented Sep 25, 2023 • edited Loading

jannfis commented Sep 25, 2023 • edited Loading

leoluz commented Sep 25, 2023

jannfis commented Sep 25, 2023 • edited Loading

shinebayar-g commented Oct 16, 2023 • edited Loading

aiceball commented Nov 1, 2023

jannfis commented Nov 13, 2023

zs-dima commented Dec 9, 2023 • edited Loading

christianh814 commented Dec 11, 2023

vvatlin commented Apr 15, 2024

nneram commented Apr 15, 2024

christianh814 commented Apr 15, 2024

vvatlin commented Apr 16, 2024

chanakya-svt commented Jul 30, 2024

jessesuen commented Oct 14, 2021 •

edited

Loading

hhannani commented Mar 28, 2022 •

edited

Loading

rumstead commented Jun 22, 2022 •

edited

Loading

imusmanmalik commented Jul 7, 2022 •

edited

Loading

RobCannon commented Jul 23, 2022 •

edited

Loading

leoluz commented Sep 25, 2023 •

edited

Loading

jannfis commented Sep 25, 2023 •

edited

Loading

jannfis commented Sep 25, 2023 •

edited

Loading

shinebayar-g commented Oct 16, 2023 •

edited

Loading

zs-dima commented Dec 9, 2023 •

edited

Loading