Skip to content

broadinstitute/protigy

Repository files navigation

Drawing

This Shiny app facilitates exploratory and interactive analysis of quantitative omics datasets. Primarily developed for mass spectrometry-based proteomics data ProTIGY supports any dataset that can be arranged as p x n matrix with p being features (e.g. proteins, transcripts, genes, ...) quantitatively measured across n samples (e.g. replicates of different experimental conditions, ... ). Examples of other datasets other than proteomics include datasets derived from RNA-seq or gene expression microarrays.


What types of analyses can I do in ProTIGY?

Data manipulation

  • Transformation

    • log transformation
  • Sample-wise Normalization

    • Centering (median)
    • Centering and scaling (median-MAD)
    • 2-component normalization
    • Upper quartile normalization
    • Quantile normalization
    • Variance stabilizing normalization (VSN)
  • Filtering

    • Missing Values
    • Reproducibility filter across replicate measurements
    • Standard deviation across samples

Marker selection (based on limma package)

  • One-sample moderated T-test
  • Two-sample moderated T-test
  • Moderated F-test

Interactive data analysis and visualization

  • Heatmaps and cluster analysis

  • Volcano plots

  • Scatterplots between samples

  • Principal component analysis (PCA)

  • QC-plots

    • Pairs-plots
    • Correlation matrix
    • Data distribution
    • Extend of missing values

Protein-protein interaction (PPI) analysis

  • Automatic conversion of protein accession numbers (UniProt, RefSeq) to gene symbols

  • Integration of protein-protein interaction databases (human only)

Export of analysis results

  • Results comprise high-quality figures (pdf), Excel-sheets and the R-workspace itself.
  • All results can be downloaded as single zip-file.
  • GCT v1.3
  • Rmarkdown analysis reports (beta)
  • Save and share sessions on a server (Shiny Server Pro, RStudio Connect)

How do I get my data into ProTIGY?

ProTIGY takes any type of text file containing both, quantitative and annotation data as input. Result files of proteomics software packages such as Spectrum Mill or MaxQuant can be readily imported into ProTIGY.

The preferred data format is GCT v1.3, a tab-delimited text file format that is convenient for analysis of matrix-compatible datasets as it allows metadata about an experiment to be stored alongside the data from the experiment.

For file formats other than GCT v1.3 a separate experimental design file annotating which columns in the uploaded data file contain expression data measured across different experimental conditions has to be uploaded separately. This file is dataset specific and a template can be downloaded from ProTIGY.

More information on how to upload data into ProTIGY can be found in this pdf file.

Supported file formats:


How can I run ProTIGY on my own hardware?

The app can run locally on your desktop computer (Windows/Linux/MAC) or can be deployed to Shiny Server or RStudio Connect.

Running ProTIGY locally

Software requirements:

  • R 3.4-3.6.3
  • Shiny R-package : install.packages("shiny")
  • Pandoc (optional, required to create R Markdown reports)
  • Perl (optional, required to create Excel sheets on Windows OS)

The easiest way to play around with ProTIGY is to directly run it from GitHub. Open R and type:

shiny::runGitHub("protigy", "broadinstitute")

This command will download the repository and run the app. Please follow the instructions to make sure all required R packages will get properly installed. This process might take several minutes when you run the command for the first time. Please note that running ProTIGY from GitHub will initiate the download every time which - depending on the available internet connection - might take a moment or two.

If you plan to use ProTIGY regularly we recommend to clone or download the repository to your local PC/Mac. To start the app open R and type:

shiny::runApp("/path/to/Protigy")

Running ProTIGY in a Shiny Server or RStudio Connect environment

To access all implemented features the app has to be deployed to Shiny Server Pro (SSP) or RStudio Connect (RSC). SSP/RSC specific features include:

  • User authentication as provided by SSP.
  • Save and retrieve app-sessions on the server.
  • Share saved sessions with your team.

Please note that some server specific parameters (e.g. folder to store sessions data, tmp folder, email for trouble shooting, ...) have to be modified accordingly in global.R. If you are deploying to RSC make sure to disable the pacman package manager by setting PACMAN <- FALSE in global.R.


Where can I find documentation?

Below are slides presented during a BroadE workshop in Feb 2018 at the Broad Institute:


License

ProTIGY has been released under BSD-3 license. ProTIGY is being developed and maintained at the Proteomics Platform of the Broad Institute of MIT and Harvard.