Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is consistently_failing #45402

Closed
can-anyscale opened this issue May 17, 2024 · 38 comments
Assignees
Labels
bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability triage Needs triage (eg: priority, bug/not-bug, and owning component)

Comments

@can-anyscale
Copy link
Collaborator

CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is consistently_failing. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4505#018f8519-116f-487c-9ce5-fc3ca791106e
- https://buildkite.com/ray-project/postmerge/builds/4505#018f84f9-fb09-4ab5-bdc1-160a45fcec68

DataCaseName-linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack-END
Managed by OSS Test Policy

@can-anyscale can-anyscale added bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability triage Needs triage (eg: priority, bug/not-bug, and owning component) weekly-release-blocker Issues that will be blocking Ray weekly releases labels May 17, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale changed the title CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is consistently_failing CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is flaky May 17, 2024
@can-anyscale can-anyscale reopened this May 17, 2024
@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4505#018f8519-116f-487c-9ce5-fc3ca791106e
- https://buildkite.com/ray-project/postmerge/builds/4505#018f84f9-fb09-4ab5-bdc1-160a45fcec68

DataCaseName-linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

passing now

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale reopened this May 17, 2024
@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4505#018f8519-116f-487c-9ce5-fc3ca791106e
- https://buildkite.com/ray-project/postmerge/builds/4505#018f84f9-fb09-4ab5-bdc1-160a45fcec68

DataCaseName-linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4505#018f8519-116f-487c-9ce5-fc3ca791106e
- https://buildkite.com/ray-project/postmerge/builds/4505#018f84f9-fb09-4ab5-bdc1-160a45fcec68

DataCaseName-linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

probably not a release blocker, remove the tag temporarily to generate a green build, will confirm Monday

@can-anyscale can-anyscale added release-blocker P0 Issue that blocks the release and removed weekly-release-blocker Issues that will be blocking Ray weekly releases labels May 18, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4520#018f8a1c-003b-4696-8abf-d23042cb6a27
- https://buildkite.com/ray-project/postmerge/builds/4505#018f8519-116f-487c-9ce5-fc3ca791106e
- https://buildkite.com/ray-project/postmerge/builds/4505#018f84f9-fb09-4ab5-bdc1-160a45fcec68

DataCaseName-linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale reopened this May 28, 2024
1 similar comment
@can-anyscale can-anyscale added weekly-release-blocker Issues that will be blocking Ray weekly releases and removed release-blocker P0 Issue that blocks the release labels May 28, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale removed the weekly-release-blocker Issues that will be blocking Ray weekly releases label May 29, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale changed the title CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is flaky CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is consistently_failing May 30, 2024
@can-anyscale can-anyscale reopened this May 30, 2024
@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is consistently_failing. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4663#018fc80d-6e43-43a5-ac99-51f4e8725acb
- https://buildkite.com/ray-project/postmerge/builds/4663#018fc7e9-a7f1-4e92-9ed4-4caa98ae6d95
- https://buildkite.com/ray-project/postmerge/builds/4631#018fbffd-e9f9-4d60-9914-8a5b7c6992ff

DataCaseName-linux://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability triage Needs triage (eg: priority, bug/not-bug, and owning component)
Projects
None yet
Development

No branches or pull requests

2 participants