Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

404 error when creating data #79

Open
Yuki-Nagato opened this issue Nov 30, 2021 · 0 comments
Open

404 error when creating data #79

Yuki-Nagato opened this issue Nov 30, 2021 · 0 comments

Comments

@Yuki-Nagato
Copy link

It seems that pushshift.io changed the format of submission data to .zst in August. Old .xz format was deleted. So when I was running SIZE=full make -j 8 in reddit_extractor directory, an error occurred.

The log is following.

DialoGPT/reddit_extractor$ cat logs/RS_v2_2005-12.xz.log
--2021-11-30 15:47:24--  https://files.pushshift.io/reddit/submissions/RS_v2_2005-12.xz
Resolving files.pushshift.io (files.pushshift.io)... 104.21.55.251, 172.67.174.211, 2606:4700:3033::6815:37fb, ...
Connecting to files.pushshift.io (files.pushshift.io)|104.21.55.251|:443... connected.
HTTP request sent, awaiting response... 404 Not Found
2021-11-30 15:47:26 ERROR 404: Not Found.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant