TCRSIP

TCRSIP is a tool built upon AlphaFold2 based structural inference to predict TCR-pMHC specificity from inferred structures. The end-to-end model is available through a google colab notebook based webserver instance to be rapidly deployable by researchers.

		Date Published	Status
TCRSIP v1.0	link	3-29-26	Running

The tool requires sequence information for the full length or truncated alpha beta TCR, the peptide sequence (valid amino acid only), and the full-length or truncated MHC molecule (Class 1 or 2). Sequences must include the variable region of both chains of the TCR and the interacting region of the MHC (Will not accept B2M). The model has only been validated on human TCR and not on gamma delta TCR or CD1 molecules. Putative TCR-pMHC can be run in batches and it is recommended to provide a google drive folder to save intermediate datasteps as each sample can take up to 20 min to run.

The code hosted in this repository can be run to process and extract structural features in TCR-pMHC structures. Code to reproduce specific analyses from manuscript or simulation runs can also be found in the ./scripts directory.

Installation

Requirements: TCRSIP pipeline requires a version of the conda package manager to be installed.

To install the TCRSIP pipeline, clone the directory to a local directory:

git clone https://github.com/RobbenLab/TCRSIP

Create a new conda environment using the provided environment.yml file

conda env create -f environment.yml

You can then activate the environment with the following command:

conda activate new_env

Must also install PyMol:

conda install -c conda-forge pymol-open-source

And install PyRosetta from website (requires license): https://www.pyrosetta.org/downloads

And install gromacs to /usr/local/ as specificied in the instructions: https://manual.gromacs.org/current/install-guide/index.html

Running Structural Processing and Feature Extraction Pipeline

To run the processing pipeline, use the run.sh command from the main directory and point it towards a directory containing structures in the required format.

./{Folder}/Structures/Raw/....

Run.sh will run a pipeline that takes a folder of TCRpMHC pdb files as input and then extract features used for analysis and prediction of interaction.

bash run.sh ./data/example/ -o ./output

This code will run the 10 pdb's in the example folder and will output in the output directory specified.

Run.sh
 
Usage: bash Run.sh /path/to/input/dir/ [options]
  -o | --output (./output) Output directory
  -r | --no-rosetta (off) Flag to turn off rosetta binding energy calculation (This saves time, ~7 min per file, but results in less predictive features) using Rosetta is on by default
  -g | --no-gromacs (off) Flag to turn off gromacs enabled Hbond calculations, using Gromacs is on by default

The command will run the **Analyze_Folded.py** and **ExtractFeatures.py** scripts to generate features from the input pdb files. Input directory is required to Temporary files and finished features will be output into the input directory.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
settings		settings
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Run.sh		Run.sh
TCRtemplate.csv		TCRtemplate.csv
environment.yml		environment.yml
mdout.mdp		mdout.mdp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TCRSIP

Installation

Running Structural Processing and Feature Extraction Pipeline

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TCRSIP

Installation

Running Structural Processing and Feature Extraction Pipeline

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages