Skip to content

YoannPa/biotab.manager

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

biotab.manager - Manage TCGA clinical tables

GitHub repo size GitHub issues GitHub closed issues

biotab.manager is a R package allowing you to download, manage, subset, and aggregate TCGA patients clinical data (biotabs) from the GDC portal. The package is built upon TCGAbiolinks to query TCGA databases, and makes use of R data.table handle queries results.

Author: PAGEAUD Y.1
1- DKFZ - Division of Applied Bioinformatics, Germany.

How to cite: Pageaud Y. et al., biotab.manager - Manage TCGA clinical tables.
GitHub R package version

GitHub last commit
GitHub

Content

Currently the package biotab.manager contains 8 functions:

  • dt.develop() - Develops a data.table row if it contains information for more than 1 sample.
  • get.ls.TCGA.biotab() - Returns a list of biotabs, one biotab per TCGA project.
  • get.TCGA.biotab() - Returns a biotab for a given TCGA project ID.
  • get.TCGA.clinical() - Collects patients clinical data from specific TCGA projects.
  • ICGC.to.TCGA.project() - Convert TCGA project ID into ICGC cohort and vice versa.
  • multi.subset.biotab() - Multi-step clinical biotabs subsetting following a type of data and its manifest.
  • subset.clinical.biotab() - Subset clinical biotabs following a type of data and its manifest.
  • TCGA.clinical.as.dt() - Subset clinical data from collected patients clinical data, and return it into a data.table.

Prerequisites

Install Bioconductor dependencies

In R do:

if (!requireNamespace("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("TCGAbiolinks")

Install CRAN dependencies

inst.pkgs = c('data.table', 'simsalapar')
install.packages(inst.pkgs)

Installing

  1. In the Git repository click on "Clone or Download".
  2. Copy the HTTPS link.
  3. Open a terminal and type:
git clone https://github.com/YoannPa/biotab.manager.git
  1. Open the folder biotab.manager/ and open the "biotab.manager.Rproj" file in RStudio.
  2. In the RStudio console, type:
devtools::install()

Problems ? / I need help !

For any questions Not related to bugs or development please check the section "Known Issues" available below. If the issue you experience is not adressed in the known issues you can write me at y.pageaud@dkfz.de.

Known Issues

No issues reported yet for the package.

Technical questions / Development / Feature request

If you encounters issues or if a feature you would expect is not available in a biotab.manager function, please check if an existing issue adresses your point here. If not, create a new issue here.

References

  1. Colaprico, A. et al. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Research 44, e71–e71 (2016).
  2. The Cancer Genome Atlas Research Network et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nature Genetics 45, 1113–1120 (2013).
  3. Grossman, Robert L., Heath, Allison P., Ferretti, Vincent, Varmus, Harold E., Lowy, Douglas R., Kibbe, Warren A., Staudt, Louis M. (2016) Toward a Shared Vision for Cancer Genomic Data. New England Journal of Medicine 375:12, 1109-1112.
  4. Official R data.table Github repository: https://github.com/Rdatatable/data.table