New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
byconity server getting stuck when multiple servers are used #1153
Comments
@dmthuc does |
Hi @dogauzuncukoglu , thank you for reporting the issue. The setting |
I think the solution can be improve further after discuss with my colleagues but most of them are on leave now. @Andygogo15 , you can take a look when you have time. |
Hi @dogauzuncukoglu , I think the correct solution for your case is to send insert request directly to worker. It will reduce to load in server. When send insert request to worker you need to use the settings |
@dmthuc thank you very much for the informative answer. It helps a lot. To give more context about the issue, we were previously running into an issue with materialized views documented here: #774 To work around that issue we were manually selecting materialized columns in the materialized views. This works when we send insert request to server but it gives an error when same request sent to write worker directly.
for context this would be the minimal setting basically.
Assume you use in this case if you send write request to server it works. But if you send it directly to write worker it gives the error I copy/pasted above Example insert request
|
Hi @dogauzuncukoglu , byconity servers maintains the |
@dmthuc thanks for the suggestion. It makes sense to me let me try and see what happens. |
Bug Report
Briefly describe the bug
We have observed this bug is happening when using multiple servers. One server stops responding and goes into deadlock like state. When this happens there will be intermittent errors with requests. Server eventually logs below errors.
here is all the logs from the server when error happened.
byconity_server_error.log
After around ~1h later we observed the above
Got exception while starting thread for connection. Error code: 0, message: 'No thread available'
logs and server started working again.The result you expected
If a server becomes unavailable for any reason some tables are affected which means multiple servers do not make the system HA. It will just limit the surface area to specific tasks the server was handling. We found this setting
server_write_ha
but not sure whether setting it to true would solve our issue, or where should we set it if we are deploying to kubernetes via helm?How to Reproduce
Sending a lot of insert requests in parallel via http requests to server seems to put them into this state.
Version
cc4e467
The text was updated successfully, but these errors were encountered: