`request()` retries fetch twice, which can cause timeouts that are twice as long as requested #191

dhalbert · 2024-04-26T18:08:39Z

request() tries twice to get a socket, connect, and fetch twice:

Adafruit_CircuitPython_Requests/adafruit_requests.py

Lines 513 to 518 in 12e6b47

    
           # We may fail to send the request if the socket we got is closed already. So, try a second 
        
           # time in that case. 
        
           retry_count = 0 
        
           last_exc = None 
        
           while retry_count < 2: 
        
               retry_count += 1

This is confusing because if it tries a second time, it can make the default or user specified timeout (see #9210) appear to take twice as long as requested.

I am not sure about the circumstances the comment is talking about:

Adafruit_CircuitPython_Requests/adafruit_requests.py

Line 513 in 12e6b47

    
           # We may fail to send the request if the socket we got is closed already. So, try a second

and whether it's true for non-ESP32-SPI, etc. I wonder if some checking or resetting of the socket can be done in advance.

The text was updated successfully, but these errors were encountered:

justmobilize · 2024-04-26T19:16:10Z

When I pulled out code for ConnectionManager, I tried to find a way to make this happen for real, but couldn't. Do you have a way to make it happen?

dhalbert · 2024-04-26T20:47:06Z

Here's a rather rough program I used (I originally used some personal domains that I knew succeeded or failed, but I tried again with the ones listed):

import os
import time
import wifi

import adafruit_connection_manager
import adafruit_requests

class Timer:
    def __init__(self, title):
        self.title = title

    def __enter__(self):
        self.start = time.monotonic()
        return self

    def __exit__(self, type, value, traceback):
        print(self.title+":", time.monotonic() - self.start)

# Get WiFi details, ensure these are setup in settings.toml
ssid = os.getenv("CIRCUITPY_WIFI_SSID")
password = os.getenv("CIRCUITPY_WIFI_PASSWORD")

SUCCEED_URL = "https://example.org"
FAIL_URL = "https://192.168.1.253"

pool = adafruit_connection_manager.get_radio_socketpool(wifi.radio)
ssl_context = adafruit_connection_manager.get_radio_ssl_context(wifi.radio)
requests = adafruit_requests.Session(pool, ssl_context)

wifi.radio.connect(ssid, password)

with Timer("default success"):
    response = requests.get(SUCCEED_URL)
try:
    with Timer("default fail"):
        response = requests.get(FAIL_URL)
except Exception as e:
    print(e)

with Timer("timeout success"):
    response = requests.get(SUCCEED_URL, timeout=2)
try:
    with Timer("timeout fail"):
        response = requests.get(FAIL_URL, timeout=3)
except Exception as e:
    print(e)

with output:

code.py output:
default success: 0.432861
default fail: 60.072
Error connecting socket: [Errno 119] EINPROGRESS
timeout success: 0.693115
timeout fail: 6.073
Error connecting socket: [Errno 116] ETIMEDOUT

Note that the last timeout fail: asked for a timeout of 3 seconds, not 6 seconds. Changing this to some other value causes double the value.

justmobilize · 2024-04-26T21:06:42Z

What MCU are you using?

On a UM FeatherS3, I get:

default success: 0.872002
default fail: 36.912
Error connecting socket: [Errno 113] ECONNABORTED
timeout success: 0.887009
timeout fail: 36.618
Error connecting socket: [Errno 113] ECONNABORTED

On an Feather ESP32-S2 with TFT, I get (added print(f"error: {e}") because I was getting blanks...):

default success: 0.694092
default fail: 0.00390625
error:
timeout success: 0.704956
timeout fail: 0.00402832
error:

dhalbert · 2024-04-26T21:31:56Z

I was using a Metro ESP32-S3

dhalbert · 2024-04-26T21:37:51Z

My testing was with adafruit/circuitpython#9210, not yet merged as of this writing. I shjould test that on an S2, then.

justmobilize · 2024-04-26T22:08:59Z

What was odd was that the UM FeatherS3 seemed to ignore the timeouts. I remember some other boards did that in 8.x

justmobilize · 2024-04-27T04:57:35Z

@dhalbert tomorrow I'll try again on both my S2 and UMS3 with that build

justmobilize · 2024-04-27T16:48:22Z

@dhalbert, added some logging. The double timeout in this case is in ConnectionManager. The flow from CM is that it tries once, and then if it fails and has any sockets it can release to try again after closing them:

RQ retry_count: 0
CM try_count: 1, timeout: 60
CM self._socket_pool.socket: 0.000976563
CM socket.connect: 0.513184
default success: 0.737793

RQ retry_count: 0
CM try_count: 1, timeout: 60
CM self._socket_pool.socket: 0.0678711
CM socket.connect: 18.1948
CM connect OSError: [Errno 119] EINPROGRESS
CM try_count: 2, timeout: 60
CM self._socket_pool.socket: 0.000976563
CM socket.connect: 18.4888
CM connect OSError: [Errno 119] EINPROGRESS
default fail: 36.7739
error: Error connecting socket: [Errno 119] EINPROGRESS

RQ retry_count: 0
CM try_count: 1, timeout: 2
CM self._socket_pool.socket: 0.000976563
CM socket.connect: 0.580078
timeout success: 0.808105

RQ retry_count: 0
CM try_count: 1, timeout: 3
CM self._socket_pool.socket: 0.0678711
CM socket.connect: 3.10693
CM connect OSError: [Errno 116] ETIMEDOUT
CM try_count: 2, timeout: 3
CM self._socket_pool.socket: 0.0
CM socket.connect: 3.10596
CM connect OSError: [Errno 116] ETIMEDOUT
timeout fail: 6.30176
error: Error connecting socket: [Errno 116] ETIMEDOUT

If I just do:

try:
    with Timer("timeout fail"):
        response = requests.get(FAIL_URL, timeout=3)
except Exception as e:
    print(f"error: {e}")

I get:

RQ retry_count: 0
CM try_count: 1, timeout: 3
CM self._socket_pool.socket: 0.000976563
CM socket.connect: 3.10596
CM connect OSError: [Errno 116] ETIMEDOUT
CM try_count: 2, timeout: 3
timeout fail: 3.11914

What we could do is in CM, look at specific errors like:

errno.ECONNABORTED
errno.ETIMEDOUT
errno.ECONNREFUSED
errno.EINPROGRESS

And stop trying.

Thoughts? If you help me come up with which ones to not try again on, I can open up a PR in CM

dhalbert · 2024-04-27T20:12:43Z

Ah, OK, so it is not the retry loop in requests at all, you're saying? So I was mistaken. Does it not make sense to close a socket before trying, because we'd like to reuse a socket that has an existing open connection to a particular host?

I think it would be worth looking at the logic in CPython requests and lower down, if necessary, because they don't have double timeouts. I am not more experienced with socket programming than you.

justmobilize · 2024-04-28T01:23:40Z

I totally want to make this better. I trimmed what I could from the previous request code. It used to try up to 5 times...

I think making both requests and connection manager smarter is a good idea. I left this block in, because I couldn't find a way to reproduce is and so didn't want to take it out. Part of me wants to take it out and then fix it when it breaks for someone...

I'll try some other things and see if I can get it to try the second time

justmobilize mentioned this issue Apr 27, 2024

Question regarding get_socket error reporting adafruit/Adafruit_CircuitPython_ConnectionManager#14

Closed

dhalbert mentioned this issue May 1, 2024

ESP32-S2: only one socket can be created adafruit/circuitpython#9219

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`request()` retries fetch twice, which can cause timeouts that are twice as long as requested #191

`request()` retries fetch twice, which can cause timeouts that are twice as long as requested #191

dhalbert commented Apr 26, 2024

justmobilize commented Apr 26, 2024

dhalbert commented Apr 26, 2024

justmobilize commented Apr 26, 2024

dhalbert commented Apr 26, 2024

dhalbert commented Apr 26, 2024 •

edited

justmobilize commented Apr 26, 2024

justmobilize commented Apr 27, 2024

justmobilize commented Apr 27, 2024

dhalbert commented Apr 27, 2024 •

edited

justmobilize commented Apr 28, 2024

request() retries fetch twice, which can cause timeouts that are twice as long as requested #191

request() retries fetch twice, which can cause timeouts that are twice as long as requested #191

Comments

dhalbert commented Apr 26, 2024

justmobilize commented Apr 26, 2024

dhalbert commented Apr 26, 2024

justmobilize commented Apr 26, 2024

dhalbert commented Apr 26, 2024

dhalbert commented Apr 26, 2024 • edited

justmobilize commented Apr 26, 2024

justmobilize commented Apr 27, 2024

justmobilize commented Apr 27, 2024

dhalbert commented Apr 27, 2024 • edited

justmobilize commented Apr 28, 2024

`request()` retries fetch twice, which can cause timeouts that are twice as long as requested #191

`request()` retries fetch twice, which can cause timeouts that are twice as long as requested #191

dhalbert commented Apr 26, 2024 •

edited

dhalbert commented Apr 27, 2024 •

edited