-
Notifications
You must be signed in to change notification settings - Fork 14.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Number of queries to Airflow database in "DAG File Processing Stats" #40282
Comments
Good idea ! I would also love a global airflow option that could be set to True , so parsing of dags with variable / connection ... at top level fail. And promote usage of Dynamic Task mapping in the error log |
@raphaelauv Wouldn't such a feature conflict with the theory of the Secrets Cache that was recently released too? |
and yes this feature will no more be usefull when |
@eladkal @potiuk @kaxil |
Yeah, let's keep it separate |
In my PR for this feature I have added a new column to the table with processing results in the log file. I have seen in the discussion thread some ideas about adding this information to the UI or DB, but I didn't notice a strong consensus about it and I decided not to make any UI or DB changes in this PR. I will be ready to create a new PR in future when we have a clear vision where on UI and DB are better to have this data. |
Description
This new feature will add a new column to the "DAG File Processing Stats" of DAG processor logs. This column will store information about the number of queries to the Airflow database per DAG.
Use case/motivation
This new column may be convenient to have it in case of debugging issues related to high load on Airflow database, e.g. typical scenario is when DAG file(s) have a lot of queries to database done on the top level of code and those are executed each time during parsing of these DAG files. One common example is excessive usage of "Variables.get" as top-level statements in DAG files.
Having information about "number of queries to Airflow database" per DAG file may help a lot during debugging issues related to high load on database or issues related to long parsing of the DAG files.
Related issues
Thread with discussion in the Airflow community: https://lists.apache.org/thread/9j6q2lq521rt5zx46l2dvow2c85sgqwb
Are you willing to submit a PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: