Optimize request intervals for full archive search with OAuth 2.0 Bearer Token for pagination.py #1923

TomatenMarc · 2022-07-17T13:27:14Z

With this PR I would like to solve the issues raised in #1688 #1907 and #1871 of missing wait time between requests for Twitter api v2, which causes a direct exceeding of rate limits during a full archive search.

The problem seems not to be in https://github.com/tweepy/tweepy/blob/master/tweepy/client.py as in #1871, but rather in https://github.com/tweepy/tweepy/blob/master/tweepy/pagination.py as described in #1688.

It is a way to simply wait one second at the end of each request and the processing time of the data, but this always requires consideration and knowledge of the problem by the user.
Likewise, it may happen that the optimal time windows of 1 request per second and 300 requests per 900 requests (3 seconds for request + processing) cannot be met if 1 second is always added to the request and processing time.

Mainly, though, I think the problem should be solved within Tweepy, since the user (at least I did and take some time to find the problem) assumes that Tweepy implements Twitter's guidelines.

Since the Twitter guidelines only require a limit of 1 request per second for the full archive search /2/tweets/search/all in combination with OAuth 2.0 Bearer Token cf. https://developer.twitter.com/en/docs/twitter-api/tweets/search/migrate it seems appropriate to measure the time from the beginning in __next__ of https://github.com/tweepy/tweepy/blob/master/tweepy/pagination.py and to fill up the next second after receiving the response.

Cheers :-)

…rer Token for pagination.py

vgewilliam · 2022-07-22T11:19:30Z

With this PR I would like to solve the issues raised in #1688 #1907 and #1871 of missing wait time between requests for Twitter api v2, which causes a direct exceeding of rate limits during a full archive search.

The problem seems not to be in https://github.com/tweepy/tweepy/blob/master/tweepy/client.py as in #1871, but rather in https://github.com/tweepy/tweepy/blob/master/tweepy/pagination.py as described in #1688.

It is a way to simply wait one second at the end of each request and the processing time of the data, but this always requires consideration and knowledge of the problem by the user. Likewise, it may happen that the optimal time windows of 1 request per second and 300 requests per 900 requests (3 seconds for request + processing) cannot be met if 1 second is always added to the request and processing time.

Mainly, though, I think the problem should be solved within Tweepy, since the user (at least I did and take some time to find the problem) assumes that Tweepy implements Twitter's guidelines.

Since the Twitter guidelines only require a limit of 1 request per second for the full archive search /2/tweets/search/all in combination with OAuth 2.0 Bearer Token cf. https://developer.twitter.com/en/docs/twitter-api/tweets/search/migrate it seems appropriate to measure the time from the beginning in __next__ of https://github.com/tweepy/tweepy/blob/master/tweepy/pagination.py and to fill up the next second after receiving the response.

Cheers :-)

Hi dude, thanks for your contribution, it works for me.

TomatenMarc · 2022-11-02T18:19:52Z

@vgewilliam Is there anything else to take into account or can this be merged? :-)

Optimize request intervals for full archive search with OAuth 2.0 Bea…

e841fe1

…rer Token for pagination.py

Riahiamirreza approved these changes Aug 23, 2022

View reviewed changes

TomatenMarc closed this by deleting the head repository Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize request intervals for full archive search with OAuth 2.0 Bearer Token for pagination.py #1923

Optimize request intervals for full archive search with OAuth 2.0 Bearer Token for pagination.py #1923

TomatenMarc commented Jul 17, 2022

vgewilliam commented Jul 22, 2022

TomatenMarc commented Nov 2, 2022

Optimize request intervals for full archive search with OAuth 2.0 Bearer Token for pagination.py #1923

Optimize request intervals for full archive search with OAuth 2.0 Bearer Token for pagination.py #1923

Conversation

TomatenMarc commented Jul 17, 2022

vgewilliam commented Jul 22, 2022

TomatenMarc commented Nov 2, 2022