Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Watch dog #1114

Open
wks opened this issue Apr 12, 2024 · 2 comments
Open

Watch dog #1114

wks opened this issue Apr 12, 2024 · 2 comments
Labels
P-normal Priority: Normal.

Comments

@wks
Copy link
Collaborator

wks commented Apr 12, 2024

We have recently observed some bugs causing tests to hang while doing GC. For example

Given that a typical GC shouldn't take more than a few seconds, there should be some watch dog mechanism so that the process can panic and printing the stack trace of all threads.

Watch dog is also valuable for real-world applications, especially for mobile applications. If an application is unresponsive, the OS will try to restart it, or notify the user for further actions.

@k-sareen
Copy link
Collaborator

If a thread is waiting on a lock for more than X seconds/minutes in ART, it panics and dies. Perhaps we need something similar.

@udesou udesou added the P-normal Priority: Normal. label May 15, 2024
@wks
Copy link
Collaborator Author

wks commented May 17, 2024

https://github.com/mmtk/mmtk-jikesrvm/actions/runs/9124315467/job/25088273084?pr=172

In this test run, JikesRVM hung for 35 minutes without making progress while running lusearch with RFastAdaptiveMarkSweep. There is no indication if it hung during GC, but it is very likely.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P-normal Priority: Normal.
Projects
None yet
Development

No branches or pull requests

3 participants