Skip to content

alberlab/pgs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PGS: Population-based Genome Structure modeling tools package

Build Status Documentation Status

PGS is a population-based 3D genome-modeling package implemented in Python. The software takes Hi-C matrix and chromosome regions segmentation or topological associated domains(TADs) information, which then generates an ensemble of structure population. The software also automatically generates analysis reports, such as structure quality based on scoring parameters, plots of radial positions and contact frequency maps from the structures. The whole codes are wrapped in Python, and users can simply execute it one time.

To get started, please follow the instructions below or read our Documentation.

Note: PGS software will run on high performance computing environment (HPC), such as sun grid engine and TORQUE (pbs script), as well as local machine. But, it is highly recommended to run on HPC because of computational resource and running time.

Here is the overview of PGS pipeline:


Installation

Requirements:

Conda package is recommended to install all the requirements. Either Anaconda or the minimal Miniconda are suitable for managing required packages including IMP. If you use Miniconda, then you can install as follows:

    $ conda install numpy scipy pandas h5py matplotlib seaborn

Install IMP using conda:

    $ conda config --add channels salilab
    $ conda install imp

All other dependencies for imp and python packages will be automatically installed.

Then install PGS workflow packages:

    $ python setup.py install

PGS Helper GUI

PGS package includes Graphical User Interface (GUI) based helper program for user to run pgs easily. User can generate command script (i.e. runPgs.sh) and configuration file(i.e. input_config.json) through the PGS Helper.

Run PGS Helper

To initialize PGS Helper:

    $ java -jar PGSHelper.jar

The following GUI will appear:

RUN PGS

User can run pgs package through the following command.

    $ PROJECT_DIR> sh runPgs.sh

References

Hua et al. Producing genome structure populations with the dynamic and automated PGS software. Nature Protocols 13 915-926 (2018).

Tjong et al. Population-based 3D genome structure analysis reveals driving forces in spatial genome organizations. PNAS 113, E1663-E1672 (2016).