Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Download format is .tar instead of .tar.gz #2

Open
HaritYadav opened this issue Jan 12, 2021 · 4 comments
Open

Download format is .tar instead of .tar.gz #2

HaritYadav opened this issue Jan 12, 2021 · 4 comments

Comments

@HaritYadav
Copy link

Hi

When I downloaded the dataset, I got a .tar file. Had to look around before finding the git and renaming it to .tar.gz to make it work.

Please correct this or mention this on site.

Thnx

@phirework
Copy link
Collaborator

I just checked and the download links being generated on the https://commonvoice.mozilla.org/datasets is .tar.gz, and all the files on our S3 server are .tar.gz files. Can you paste the link that you had that resulted in a .tar file?

@Gorodecki
Copy link

Hi.
The link points to ru.tar.gz. And it is saved as indicated in the screenshot.

https://mozilla-common-voice-datasets.s3.dualstack.us-west-2.amazonaws.com/cv-corpus-6.1-2020-12-11/ru.tar.gz?.........
screenshot_2021-03-02

screenshot_2021-03-02_111

@ks-sav
Copy link

ks-sav commented Mar 17, 2021

I join the request. I racked my brain for two days before I found this question. It seems that the problem is only with the Russian dataset

@phirework
Copy link
Collaborator

Hi - I just downloaded the Russian dataset to the live site and it links to .tar.gz and saves as .tar.gz. I suspect how it's saving is specific to your operating system - can you provide some extra detail about what OS you're using?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants