Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cloud Runner, K8s - "RPC Failure" (inconsistent, instability) #526

Open
frostebite opened this issue Mar 27, 2023 · 0 comments
Open

Cloud Runner, K8s - "RPC Failure" (inconsistent, instability) #526

frostebite opened this issue Mar 27, 2023 · 0 comments
Assignees
Labels
bug Something isn't working cloud-runner

Comments

@frostebite
Copy link
Member

Bug description

Log stream while running kubernetes job is occasionally interrupted, the job seems to be deleted, and the message "RPC Failure" is shown.

How to reproduce

Inconsistent, use cloud runner k8s for long running jobs. The end to end tests seem to produce this slightly more often.

This may relate to Google Kubernetes Engine - "AutoPilot" for kubernetes that I use to host the test environment.

Expected behavior

Log streams should be as reliable as possible.

Additional details

Remediation would include recommending async mode for inconsistent log stream platforms. The log would have to be stored within the job and uploaded for this route as we must parse the output to properly confirm if a job passes.

@frostebite frostebite added bug Something isn't working cloud-runner labels Mar 27, 2023
@frostebite frostebite self-assigned this Mar 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working cloud-runner
Projects
None yet
Development

No branches or pull requests

1 participant