Skip to content

This is a fast API requester for fetching data from the cancer imaging archive

License

Notifications You must be signed in to change notification settings

NeoMedSys/tcia-api-requester

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TAr

The Cancer Imaging Archive API Requester (tcia-api-req)


[active]

The cancer imaging archive can provide some useful public datasets to experiment with e.g. deep learning for medical diagnostics. The downside is that the data is hard to obtain if you are on a linux machine or cumbersome if you are using the NMBIA desktop application. This is why the tcia-api-requester (TAr) was developed. Use this wrapper to fetch any publically available dataset that includes the modalities MR, CT and PET.

0. Requirements

  • linux
  • poetry
  • py39

1. Getting started

1.1 Start by installing the dependencies from the toml file. Use poetry for this.

$ poetry install

1.2 Run the main python script (you might have to set the PYTHONPATH)

$ poetry run python api_request_controller.py

Use the command line arguments to request various datasets and to speed up the process

$ poetry run python api_request_controller.py -d <dataset-name> --use_cpu_count True --thread_num <n_threads> --workers <n_workers>

The maximum workers will be set to the number of cores times the user specified amount of workers if the use_cpu_count flag is true.

1.3 Data will be stored in the following order:

patient_uid
   study_uid
       series_uid
           series_description_uid
                image.dcm

Powered by 20200907_104224

About

This is a fast API requester for fetching data from the cancer imaging archive

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages