Skip to content

MassGEM_Plot can visualize the expression of a large number of genes at the same time

License

Notifications You must be signed in to change notification settings

yueyaog/MassGEM_Plot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

MassGEM_Plot

This repository is designed to visualize the expression of a large amount of genes as mutiple line plots and save them to a mutipage pdf.

Motivation

Accurate detection of differentially expressed genes (DEGs) is not a travial task. Although there are visualization methods to present the expressions of a large amount of genes at the same time, biases introduced in normalization, scaling transformation, and replicate variability. So most researchers conduct careful 'eyeball' inspection to curate a reliable set of DEGs. This process can be time consuming and perhaps frustrating for most researchers. With MassGEM_Plot, you don't need to copy and paste to check gene expression line plot one by one any more. MassGEM_Plot takes a user input Gene Expression Matrix (GEM) and a user specified gene list to generate all the gene expression line plots you want at the same time.

Installation

All of MassGEM_Plot's dependencies can be installed through Anaconda3. The following installation commands are specified to create an Anaconda environment using Clemson's Palmetto cluster. But the dependencies can be easily install to any HPC system or even your own laptop!

module load anaconda3/5.1.0-gcc/8.3.1

conda create -n MassGEM_Plot python=3.6 matplotlib numpy pandas seaborn

Once the anaconda environment has been created, you need to activate your environment to use MassGEM_Plot:

source activate MassGEM_Plot

After that, simply clone MassGEM_Plot repository to use MassGEM_Plot

git clone https://github.com/yueyaog/MassGEM_Plot.git

Input Data

MassGEM_Plot takes three primary inputs:

  • a gene expression matrix(GEM)
  • a mega info file with sample labels and timepoints
  • a list of gene sets To visualize the gene expression of user request genelist:
python MassGEM_Plot.py -emx /path/to/expression.csv -mega /path/to/Megainfo.csv -genelist /path/to/user_genelist.txt -smpl sample1 sample2 sample5 -o test

Above,expression.csv is of the format:

About

MassGEM_Plot can visualize the expression of a large number of genes at the same time

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages