Refactor stop_host method to use topology #292

petroav · 2016-12-07T23:19:50Z

Use requests instead of urllib because it handles connection reset errors better.
Use upload_topology to mimic behavior of stop_host.

rschlussel-zz · 2016-12-08T14:13:07Z

tests/product/test_configuration.py

                    'workers': good_hosts}
        self.upload_topology(topology)
-        self.cluster.stop_host(bad_host)
+        self.cluster.stop_host(self.cluster.slaves[0])


you've already put the bad host name in the topology, so stopping the host should really do anything. Maybe we don't need stop_host at all, just upload bad worker/coordinator topologies all around. In any case it should be one or the other.

rschlussel-zz · 2016-12-08T14:31:18Z

tests/base_cluster.py

+        # Change the topology to something that doesn't exist
+        ips = self.get_ip_address_dict()
+        down_hostname = self.get_down_hostname()
+        self.exec_cmd_on_host(


Why do we need all three of these sed commands? Doesn't the topology usually just contain the internal host name?

petroav · 2016-12-13T09:40:11Z

@rschlussel updated version that addresses your comments. I also added a commit that fixed the Connection reset errors I was seeing while building the offline installer.

rschlussel-zz · 2016-12-13T16:10:04Z

travis failures are real. It looks like you missed a few usages of stop_host

rschlussel-zz · 2016-12-14T20:22:34Z

looks good % fixing the test failures.

rschlussel-zz · 2016-12-15T18:27:18Z

Can you also revert the timeout increase (and make sure that this fixes the issue)?

- The requests library handles connection resets from the server better than urllib.

- Stopping a host is now done implicitly by uploading a topology that points to a host that doesn't exist. This was done because stopping a container caused nasty race conditions.

rschlussel-zz · 2017-03-03T18:51:44Z

@petroav what's the status of this?

petroav · 2017-03-03T20:44:25Z

@rschlussel I put this on pause since we were seeing all those UnixPoolConnectionError I thought I would wait until we fixed the intermittent errors before going forward with this. I just saw the Travis has actually been very stable for presto-admin so I might rebase this.

petroav assigned cawallin and rschlussel-zz Dec 7, 2016

rschlussel-zz reviewed Dec 8, 2016

View reviewed changes

petroav force-pushed the fix-intermittent branch from d5ada45 to 0131baf Compare December 13, 2016 09:30

petroav added 2 commits January 6, 2017 21:46

Use requests library to download pre-compiled wheels

eee320a

- The requests library handles connection resets from the server better than urllib.

Remove the stop_host method

0ab3512

- Stopping a host is now done implicitly by uploading a topology that points to a host that doesn't exist. This was done because stopping a container caused nasty race conditions.

petroav force-pushed the fix-intermittent branch from 46cd187 to 0ab3512 Compare January 6, 2017 20:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor stop_host method to use topology #292

Refactor stop_host method to use topology #292

petroav commented Dec 7, 2016 •

edited

rschlussel-zz Dec 8, 2016

rschlussel-zz Dec 8, 2016

petroav commented Dec 13, 2016

rschlussel-zz commented Dec 13, 2016

rschlussel-zz commented Dec 14, 2016

rschlussel-zz commented Dec 15, 2016

rschlussel-zz commented Mar 3, 2017

petroav commented Mar 3, 2017

Refactor stop_host method to use topology #292

Are you sure you want to change the base?

Refactor stop_host method to use topology #292

Conversation

petroav commented Dec 7, 2016 • edited

rschlussel-zz Dec 8, 2016

Choose a reason for hiding this comment

rschlussel-zz Dec 8, 2016

Choose a reason for hiding this comment

petroav commented Dec 13, 2016

rschlussel-zz commented Dec 13, 2016

rschlussel-zz commented Dec 14, 2016

rschlussel-zz commented Dec 15, 2016

rschlussel-zz commented Mar 3, 2017

petroav commented Mar 3, 2017

petroav commented Dec 7, 2016 •

edited