KubernetesPodOperator with multiple containers hangs if container other than base container is still running #39693

jonathan-ostrander · 2024-05-17T16:53:33Z

Apache Airflow version

main (development)

If "Other Airflow 2 version" selected, which one?

No response

What happened?

A KubernetesPodOperator with the following full_pod_spec:

apiVersion: v1
kind: Pod
metadata:
  name: multi-container-pod
spec:
  restartPolicy: Never
  containers:
  - name: base
    image: busybox
    command: ["sh", "-c", "echo base will exit after 30 seconds; sleep 30"]
  - name: sidecar
    image: busybox
    command: ["sh", "-c", "echo sidecar running indefinitely; while true; do sleep 3600; done"]

will not mark the task as successful after 30 seconds because the sidecar will continue to run after the base container has succeeded. This happens because the pod_manager gets stuck waiting for pod completion. This if statement returns False when istio is not enabled on the pod.

What you think should happen instead?

The pod should be considered complete when the base container succeeds regardless of whether or not any other containers on the pod are still running.

How to reproduce

Create a KubernetesPodOperator with the full_pod_spec provided.

Operating System

MacOS 14.4.1

Versions of Apache Airflow Providers

apache-airflow-providers-cncf-kubernetes==8.0.1

Deployment

Other

Deployment details

Kubernetes on Google Kubernetes Engine. Kubernetes executors and worker pods all run on the same cluster.

Anything else?

No response

Are you willing to submit PR?

Yes I am willing to submit a PR!

Code of Conduct

I agree to follow this project's Code of Conduct

The text was updated successfully, but these errors were encountered:

boring-cyborg · 2024-05-17T16:53:36Z

Thanks for opening your first issue here! Be sure to follow the issue template! If you are willing to raise PR to address this issue please do so, no need to wait for approval.

Taragolis · 2024-05-17T17:45:57Z

Feel free to fix this behaiviour

Fixes apache#39693. Special logic was added to `KubernetesPodOperator`'s lifecycle to handle the case where an istio proxy sidecar is running and preventing the pod from completing, but this logic should have been applied more generally to handle when multiple containers are ran in the pod. The new behavior considers the pod completed if the container matching the base name has completed.

dirrao · 2024-05-19T05:40:16Z

related: #39625

jonathan-ostrander added area:core kind:bug This is a clearly a bug needs-triage label for new issues that we didn't triage yet labels May 17, 2024

Taragolis added area:providers provider:cncf-kubernetes Kubernetes provider related issues and removed area:core needs-triage label for new issues that we didn't triage yet labels May 17, 2024

Taragolis assigned jonathan-ostrander May 17, 2024

jonathan-ostrander linked a pull request May 17, 2024 that will close this issue

fix: delete pod when base container completes #39694

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KubernetesPodOperator with multiple containers hangs if container other than base container is still running #39693

KubernetesPodOperator with multiple containers hangs if container other than base container is still running #39693

jonathan-ostrander commented May 17, 2024

boring-cyborg bot commented May 17, 2024

Taragolis commented May 17, 2024

dirrao commented May 19, 2024

KubernetesPodOperator with multiple containers hangs if container other than base container is still running #39693

KubernetesPodOperator with multiple containers hangs if container other than base container is still running #39693

Comments

jonathan-ostrander commented May 17, 2024

Apache Airflow version

If "Other Airflow 2 version" selected, which one?

What happened?

What you think should happen instead?

How to reproduce

Operating System

Versions of Apache Airflow Providers

Deployment

Deployment details

Anything else?

Are you willing to submit PR?

Code of Conduct

boring-cyborg bot commented May 17, 2024

Taragolis commented May 17, 2024

dirrao commented May 19, 2024