Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Datasets #109

Open
baranaldemir opened this issue Dec 18, 2020 · 8 comments
Open

Datasets #109

baranaldemir opened this issue Dec 18, 2020 · 8 comments

Comments

@baranaldemir
Copy link

baranaldemir commented Dec 18, 2020

I have 3 questions if you don't mind.

  1. https://www.kaggle.com/tawsifurrahman/covid19-radiography-database this dataset doesn't have any metadata file right now can you please provide it?
  2. As far as I understand RSNA dataset has some pneumonia duplicates. I might be wrong but I think you didn't notice that the stage_2_train_labels.csv file has duplicate patient Ids. Am I right?
  3. Is there any CSV files for the RSNA test set too?
@ZaraNaSha
Copy link

Dear all,
I have the same problem as mentioned above and could not split the data with the jupyter notebook file.

@VedantWani
Copy link

I see that the https://www.kaggle.com/tawsifurrahman/covid19-radiography-database has been updated to version 3. The images have also changed with the new version and there will be duplicates if the script for Covidx5 is used.

Not able to download version 1 of the dataset which this dataset uses, the dataset scripts should be updated

For those who are using version 2 and version 3 of the covid19-radiography-database, your result may vary.

@haydengunraj
Copy link
Collaborator

Hi everyone, we've updated the scripts to address the changes in the data sources. As a result, previous versions of the dataset notebooks may not work correctly with the current versions of the various data sources. To fix this, you can modify the old notebooks to accommodate the changes, or use previous versions of the source datasets ensure compatibility.

@VedantWani
Copy link

Hi everyone, we've updated the scripts to address the changes in the data sources. As a result, previous versions of the dataset notebooks may not work correctly with the current versions of the various data sources. To fix this, you can modify the old notebooks to accommodate the changes, or use previous versions of the source datasets ensure compatibility.

Hi, I have two questions.

  1. The dataset source version of https://www.kaggle.com/tawsifurrahman/covid19-radiography-database which is currently available has different filenames and different images source from various references. If you look at the COVID (1).png of the current version is different from COVID-19 (1).png from version 1 of the same dataset. This means COvidx7 is not compatible with earlier versions, right?

  2. The metadata of the above-mentioned dataset contains a slightly different URL compared to version 1. Also, the script to create the dataset fails to address the duplicate image from the cohen dataset and the above Kaggle dataset (due to URL not matching). Are there duplicate images?

PS: I have downloaded version 1 of the https://www.kaggle.com/tawsifurrahman/covid19-radiography-database (had to manually download one image at a time) also created the complete dataset from the earlier script.

@GliozzoJ
Copy link

GliozzoJ commented Apr 11, 2021

Hi all,

I have a question.
How can I download the version 1 of the dataset COVID-19 Radiography Database ?
It seems to me that it is impossible from kaggle API and I cannot download it from the webpage:

https://www.kaggle.com/tawsifurrahman/covid19-radiography-database/version/1

@VedantWani How did you download the version 1 of the dataset? I couldn't even download a single image since every time I get the message "404 We can't find that page".

@VedantWani
Copy link

Hi all,

I have a question.
How can I download the version 1 of the dataset COVID-19 Radiography Database ?
It seems to me that it is impossible from kaggle API and I cannot download it from the webpage:

https://www.kaggle.com/tawsifurrahman/covid19-radiography-database/version/1

@VedantWani How did you download the version 1 of the dataset? I couldn't even download a single image since every time I get the message "404 We can't find that page".

@GliozzoJ The only way I found to download required the covid-19 images is using data explorer, open COVID-19 directory, then click on the image, once the image opens, right-click on the image, and finally click save image as. For each image.

@GliozzoJ
Copy link

Hi all,
I have a question.
How can I download the version 1 of the dataset COVID-19 Radiography Database ?
It seems to me that it is impossible from kaggle API and I cannot download it from the webpage:
https://www.kaggle.com/tawsifurrahman/covid19-radiography-database/version/1
@VedantWani How did you download the version 1 of the dataset? I couldn't even download a single image since every time I get the message "404 We can't find that page".

@GliozzoJ The only way I found to download required the covid-19 images is using data explorer, open COVID-19 directory, then click on the image, once the image opens, right-click on the image, and finally click save image as. For each image.

@VedantWani Thank you for your reply. Do you know also how to download the file COVID-19.metadata.xlsx ?

@VedantWani
Copy link

VedantWani commented Apr 11, 2021

Hi all,
I have a question.
How can I download the version 1 of the dataset COVID-19 Radiography Database ?
It seems to me that it is impossible from kaggle API and I cannot download it from the webpage:
https://www.kaggle.com/tawsifurrahman/covid19-radiography-database/version/1
@VedantWani How did you download the version 1 of the dataset? I couldn't even download a single image since every time I get the message "404 We can't find that page".

@GliozzoJ The only way I found to download required the covid-19 images is using data explorer, open COVID-19 directory, then click on the image, once the image opens, right-click on the image, and finally click save image as. For each image.

@VedantWani Thank you for your reply. Do you know also how to download the file COVID-19.metadata.xlsx ?

@GliozzoJ COVID-19.metadata.xlsx

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants