Get_tweets gives the same page #101

AnnOlChik · 2019-12-24T17:25:42Z

So, I checked get_tweets function with default pages arg 25 and different hashtags. And as output, I got just 25 same pages of tweets. I did this:

from twitter_scraper import get_tweets
for tweet in get_tweets('#brexit'):
    print(tweet['text'], tweet['time'])

When I define the number of pages, it doesn't really change anything.

The text was updated successfully, but these errors were encountered:

bisguzar · 2020-01-11T17:19:06Z

Yes, look like scrolling mechanism not working on hashtag pages.

>>> liste = [] 
... from twitter_scraper import get_tweets 
... for tweet in get_tweets('#mock', 3): 
...     liste.append(tweet['tweetId'])                                                                                                                                                                               

>>> liste.count(liste[0])                                                                                                                                                                                            
3 # first tweet's id repeats 3 times in our scraped tweets


>>> liste = [] 
... from twitter_scraper import get_tweets 
... for tweet in get_tweets('bugraisguzar', 3): 
...     liste.append(tweet['tweetId'])                                                                                                                                                                               

>>> liste.count(liste[0])                                                                                                                                                                                            
1 # first tweet's id is uniq as it should be

We will work on it, help needed

seaona · 2020-05-29T10:43:03Z

Got the same error. Does it have to do with the way it's queried and the resulting URL? maybe the pagination works different from User profiles to Hashtags?
Any thoughts? would be awesome to make it work!
Ps. by the way, great repo!

bisguzar · 2020-05-30T21:38:49Z

First at all, thanks. Looks like "max_position" parameter not supported for hashtag pages. I couldn't search it yet. Helps are welcome!

anushkmittal · 2020-06-15T00:19:30Z

did anyone have any progress? stuck at the same issue :/

xeliot · 2020-06-27T22:57:11Z

Finally got it to work! See changes in #150

Brief summary of how I fixed it by modifying tweets.py:

Change r = session.get(url, headers=headers) to r = session.get(url+'&max_position', headers=headers). This allows response json to return min_position parameter which will then be used as the max_position parameter in the next session.get

r_json = r.json()

Change r = session.get(url, params={'max_position': last_tweet}, headers=headers) to r = session.get(url, params={'max_position': r_json['min_position']}, headers=headers).

This gets rid of twitter pages repeating on search query.

BradKML · 2023-04-27T09:12:13Z

Would this pagination method in search applicable to digging all tweets from a user?

bisguzar pinned this issue Jan 11, 2020

bisguzar added the help wanted Extra attention is needed label Jan 11, 2020

bisguzar mentioned this issue Jun 30, 2020

fixed pagination bug not extracting tweets after first page #150

Merged

bisguzar closed this as completed in 8bab41b Jul 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get_tweets gives the same page #101

Get_tweets gives the same page #101

AnnOlChik commented Dec 24, 2019 •

edited

bisguzar commented Jan 11, 2020

seaona commented May 29, 2020 •

edited

bisguzar commented May 30, 2020

anushkmittal commented Jun 15, 2020

xeliot commented Jun 27, 2020 •

edited

BradKML commented Apr 27, 2023

Get_tweets gives the same page #101

Get_tweets gives the same page #101

Comments

AnnOlChik commented Dec 24, 2019 • edited

bisguzar commented Jan 11, 2020

seaona commented May 29, 2020 • edited

bisguzar commented May 30, 2020

anushkmittal commented Jun 15, 2020

xeliot commented Jun 27, 2020 • edited

BradKML commented Apr 27, 2023

AnnOlChik commented Dec 24, 2019 •

edited

seaona commented May 29, 2020 •

edited

xeliot commented Jun 27, 2020 •

edited