Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lesson 10 notebooks: bunzip throws an error when unzipping .bz2 files #39

Open
jcatanza opened this issue Feb 22, 2020 · 1 comment
Open

Comments

@jcatanza
Copy link
Contributor

jcatanza commented Feb 22, 2020

On a Windows 10 64-bit machine:

bunzip throws "EOFError: Compressed file ended before the end-of-stream marker was reached" when processing these files:
viwiki-latest-pages-articles.xml.bz2I
trwiki-latest-pages-articles.xml.bz2

Attaching a screenshot:
bunzip_error

Windows version of 7-zip throws a similar error

Note 1: A valid .xml format file is still saved.

Note 2: The problem was resolved when I downloaded the files directly from https://archive.org/details/wikipediadumps

@jcatanza jcatanza changed the title bunzip throws errors when unzipping .bz2 files Lesson 10 notebooks: bunzip throws an error when unzipping .bz2 files Feb 22, 2020
@alirezadigi
Copy link

somehow same error :(

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants