-
Notifications
You must be signed in to change notification settings - Fork 262
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Orion intermittently exits with code 139 #3452
Comments
(I have cleaned up comments from #3326 to keep each issue focused on its topic) |
Exit code 139 means that the process ended due to signal 11 (SIGSEGV). It would be useful to have the backtrace in order to have more information. The typical procedure to get the backtrace is:
However, not sure how this can be achieved if Orion runs inside docker container... Some investigation needs to be conducted on this. Has you been able to reproduce the problem running Orion outside a docker container? This issue and the aforementioned #3326 make me wonder if the cause of the problem could be in Docker instead of Orion. Or, more precisely, in the way that Orion is running inside Docker. Maybe the resources available to Orion inside docker are limited (i.e. small amount of RAM) and that's causing the problem. |
I have always ran Orion as a service in Docker Compose inside a virtual machine, so I haven't reproduced the error. Maybe I should try it, but I presumed running using Docker was natural with FIWARE, much of the documentation is dedicated to using FIWARE with Docker and Docker Compose, and I would like to continue using them. I haven't tried enabling core generation within a Docker container, from quick Googling it could be possible: https://stackoverflow.com/questions/28335614/how-to-generate-core-file-in-docker-container#47694315. I'll likely have to time to look into this in the coming weeks. Today I changed the image used for Orion in Docker Compose from My current docker-compose.yml file:
The resources of the virtual machine should be enough, there is about 30 gigs of memory, and four 2,1gig cores. The OS is Ubuntu 18.04. But looking at the virtual machines logs I do see some troubling lines concerning segfaulting libmicrohttpd:
These segfaults might be a reason or a consequence of the problem our system is facing. This could move the cause of error away from Orion. There is little new information in this comment, but the Orion hasn't crashed for awhile now. |
Thank you for the detailed report. Please keep posting if some new finding arrives. |
After a long quarantine of more than 4 years :) I think this issue can be closed. If problem persist, please open a new issue with fresh updated information. |
In our FIWARE system consisting of Orion, STH-Comet, and Mongo, Orion worked well for some months, but then started dying every now and then with exit code 139. This case has been difficult to for me to debug, as I can't see no cause for this in the logs.
End of Orion's log up to untimely exit:
As I wrote, at this time a have no further information to give about this problem. I would appreciate any advice on how to proceed with getting this solved.
Earlier issue has some similarities, but while the exit code 139 is common with these issues, the reason for it seem to be different.
The text was updated successfully, but these errors were encountered: