PconsFold

A pipeline for protein folding using predicted contacts from PconsC and a Rosetta folding protocol.

You find supplementary data, such as protein IDs, sequences, native and predicted structures, predicted contacts at the bottom of the release page.

Pipeline overview:

Input: fasta file containing one protein sequence
Prepare input for PconsC
Contact prediction with PconsC
Prepare input for Rosetta folding
Rosetta folding
Extract and relax structures with lowest Rosetta energy
Output: the predicted contact map (also as a plot) and the top-ranked structural model(s) relaxed and non-relaxed

Dependencies:

MATLAB is needed to run plmDCA. However, if MATLAB is not available you can also use a compiled version of plmDCA. For the compiled version to run you need to provide a path to MCR.

How to run it:

Make sure all dependencies are working correctly and adjust the paths in localconfig.py.

To run the full pipeline use:

./pcons_fold.py [-c n_cores] [-n n_decoys] [-m n_models]
                [-f factor] [--norelax] [--nohoms] 
                hhblits_database jackhmmer_database sequence_file

Required:
- hhblits_database and jackhmmer_database are paths to the databases used by HHblits and Jackhmmer
- sequence_file is the path to the input protein sequence in FASTA format (only single sequences).
Optional:
- n_cores specifies the number of cores to use during computation (default: number of available cores).
- n_decoys specifies the number of decoy structures generated by Rosetta (default: 2000, see publication).
- n_models is the number of top-ranked models being extracted and eventually relaxed in the end (default: 10).
- factor determines the number of constraints used to fold the protein, which is: factor * length_of_the_input_sequence (default: 1.0).
- norelax is a flag that supresses relaxation of the final models. This can be used to quickly extract structures in the end.
- nohoms is a flag that ensures that homologous structures are excluded from fragment picking. This is only useful in test cases if the model quality needs to be evaluated with a known structure.

You can also run PconsC contact prediction independently with this command:

./pconsc/predict_all.py [-c cores] hhblits_database jackhmmer_database sequence_file

And then fold the protein according to given predicted contacts with the following commands:

./folding/rosetta/prepare_input.py [-f factor] [--nohoms] sequence_file contact_map 

./folding/rosetta/fold.py [-c n_cores] [-n n_decoys] sequence_file rosetta_constraintfile

./folding/rosetta/extract.py [-c n_cores] [-m n_models] [--norelax] number_of_extracted_structures

The first script generates the file (pconsc_output)-(factor).constraints which is then used by Rosetta in the next step with rosetta_constraintfile.

Citation

M Michel, S Hayat, MJ Skwark, C Sander, DS Marks and A Elofsson. PconsFold: Improved contact predictions improve protein models. Bioinformatics (2014). 30(17): i482-i488

Name		Name	Last commit message	Last commit date
Latest commit History 245 Commits
folding		folding
paper		paper
pconsc		pconsc
supplement/procheck		supplement/procheck
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
localconfig.py.template		localconfig.py.template
pcons_fold.py		pcons_fold.py
pipeline_horiz.png		pipeline_horiz.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

folding

folding

paper

paper

pconsc

pconsc

supplement/procheck

supplement/procheck

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

localconfig.py.template

localconfig.py.template

pcons_fold.py

pcons_fold.py

pipeline_horiz.png

pipeline_horiz.png

Repository files navigation

PconsFold

Pipeline overview:

Dependencies:

How to run it:

Citation

About

Releases 1

Packages

Contributors 3

Languages

License

ElofssonLab/pcons-fold

Folders and files

Latest commit

History

Repository files navigation

PconsFold

Pipeline overview:

Dependencies:

How to run it:

Citation

About

Resources

License

Stars

Watchers

Forks

Languages