Reduction of escalus crashes when things go wrong & Timeout parameter accepted #43

philwitty · 2014-10-16T16:29:41Z

See commit messages for changes

Justification:
As soon as target systems started timing out, or didn't exist we were generating error logs with hundreds of thousands of entries due to having a few thousand concurrent escalus connections running. We instead wanted to record these issues using the onX functions passed as parameters. Have tried to change so all failures in setting up the connection are caught under the connection step and returned with the connection step failing. Inside the clients themselves tried to minimize the chance of mostly badmatches happening, reporting what we can back to the owner. Will freely admit some of the decisions are a bit hacky and not very nice but open to ideas on a better way to do this. We just needed to avoid crashes as much as possible in normal operation (which involves things not going to plan!).

Timeout is self explanatory and should have 0 functional changes if ignored.

* Catch all errors on connection steps * Exit nicely if transport:init() fails * On parse error send an error message to owner rather than crash * BOSH Don't crash on timeout reply to request

michalwski · 2014-11-18T07:56:51Z

Could you create an empty PR for MongooseIM and ejabberd_tests using your escalus branch?
Here is some doc regarding test branch discovery.

erszcz · 2018-01-04T15:36:57Z

src/escalus_bosh.erl

@@ -240,27 +241,33 @@ init([Args, Owner]) ->
    Port = proplists:get_value(port, Args, 5280),
    Path = proplists:get_value(path, Args, <<"/http-bind">>),
    Wait = proplists:get_value(bosh_wait, Args, ?DEFAULT_WAIT),
+    Timeout = proplists:get_value(timeout, Args, infinity),


infinity is never a good timeout value. We should approximate infinity with a long, but still finite, number here.

philwitty added 2 commits October 22, 2014 15:48

Reduce number of uncaught crashes when things go wrong

031e8fb

* Catch all errors on connection steps * Exit nicely if transport:init() fails * On parse error send an error message to owner rather than crash * BOSH Don't crash on timeout reply to request

Allow passing of timeout to bosh and tcp

20cd2a7

philwitty force-pushed the more_xmpp_errors branch from 3ad2630 to 20cd2a7 Compare October 22, 2014 14:50

erszcz reviewed Jan 4, 2018

View reviewed changes

fenek added the WIP label Jun 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduction of escalus crashes when things go wrong & Timeout parameter accepted #43

Reduction of escalus crashes when things go wrong & Timeout parameter accepted #43

philwitty commented Oct 16, 2014

michalwski commented Nov 18, 2014

erszcz Jan 4, 2018

Reduction of escalus crashes when things go wrong & Timeout parameter accepted #43

Are you sure you want to change the base?

Reduction of escalus crashes when things go wrong & Timeout parameter accepted #43

Conversation

philwitty commented Oct 16, 2014

michalwski commented Nov 18, 2014

erszcz Jan 4, 2018

Choose a reason for hiding this comment