Audit task failed-deps for later investigation #39712

eladkal · 2024-05-20T10:39:16Z

Body

Currently Airflow offers failed-deps to investigate why task isn't being scheduled. This is very helpful tool however it works only in real time according to the current entries in the metadb. Investigating past anomalies isn't supported.

Sometimes scheduling problems are "solved" on their own. It could be that pool is overcrowded or concurrency has been reached but eventually stress is reduced and tasks are scheduled, thus when you notice it and want to investigate why there was a delay to begin with your capabilities are limited as there could be many reasons.

The needed solution:
We should investigate the option to audit the failed-deps information or alternatively offer an easy way to export this information in real time to an external audit storage for later investigation.

Committer

I acknowledge that I am a maintainer/committer of the Apache Airflow project.

The text was updated successfully, but these errors were encountered:

tirkarthi · 2024-05-20T12:12:12Z

Recently I worked on this and the information is available as part of UI and API for tasks in scheduled or None state. Perhaps the API could be used for export and also enriched with additional checks that provide useful information.

Ref : #38449

eladkal · 2024-05-20T15:31:46Z

Recently I worked on this and the information is available as part of UI and API for tasks in scheduled or None state. Perhaps the API could be used for export and also enriched with additional checks that provide useful information.

Ref : #38449

The UI part is exposing failed-deps as is. It doesnt have the mechanism to export/store the information.
There is also the question of export interval

eladkal added area:Scheduler Scheduler or dag parsing Issues kind:feature Feature Requests labels May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audit task failed-deps for later investigation #39712

Audit task failed-deps for later investigation #39712

eladkal commented May 20, 2024

tirkarthi commented May 20, 2024

eladkal commented May 20, 2024 •

edited

Audit task failed-deps for later investigation #39712

Audit task failed-deps for later investigation #39712

Comments

eladkal commented May 20, 2024

Body

Committer

tirkarthi commented May 20, 2024

eladkal commented May 20, 2024 • edited

eladkal commented May 20, 2024 •

edited