Skip to content

CSBG-LSU/BionoiNet

Repository files navigation

BionoiNet

BionoiNet is a deep learning-based framework for ligand-binding site classification. It transforms 3-d structures of ligand-binding sites into 2-d images, and then these images are processed by a deep neural network (DNN) to generate classification results. The pipeline of BionoiNet is shown below:

The 2-d images of ligand-binding sites are generated using the Bionoi software and the DNN is implemented with the APIs of Pytorch. The datasets are divided into 5 folds for cross-validation and the sequence identity between the training and validation sets is less than 20%.

Dependency

Install the dependency package using this file: dependency/environment.yml. This denpendency file is exported by Anaconda. To install the environment:

conda env create -f environment.yml

Folders

  • bionoi: a software that transforms ligand-binding sites (.mol2 files) into Voronoi diagrams.
  • bionoi_autoencoder: autoencoder that trained on Bionoi images in an un-supervised manner. The trained autoencoder can be used to produce latent space vectors of Bionoi images for machine learning applications.
  • bionoi_cnn_homology_reduced: a convolutional neural network (CNN) trained on the Voronoi representations of ligand-binding sites for classification.
  • dependency: dependency python packages.
  • homology_reduced_folds: files containing folds such that the sequence identity between train and validation is less than 20%.
  • lecacy: codes produced during research, not used in final version.

Usage

  1. Unzip the .mol2 files located at /homology_reduced_folds/mols.zip.
  2. Download this repository and put it where you extract the .mol2 files. The layout of the project should be like following:
    .
    ├── BionoiNet
    └── homology_reduced_mols      
        ├── control
            ├── fold_1   
            ├── fold_2
            ├── ...
            └── fold_5
        ├── heme
            ├── fold_1   
            ├── fold_2
            ├── ...
            └── fold_5
        └── nucleotide
            ├── fold_1   
            ├── fold_2
            ├── ...
            └── fold_5
  1. Run Bionoi by executing /bionoi/img_gen.sh to transform .mol2 files to images.
  2. Train the convolutional neural network (CNN) at /bionoi_cnn_homology_reduced/ for cross-validation:
python homology_reduced_cnn_cv_resnet18.py -op control_vs_nucleotide -root_dir ../../bionoi_output/residue_type/ -batch_size 32 -result_file_suffix 1st
  1. Grab a cup of coffee and wait for results.

Citation

If you find this work useful, please cite our papers in your work :)

Shi W, Lemoine JM, Shawky MA, Singha M, Pu L, Yang S, Ramanujam J, Brylinski M. (2020) BionoiNet: Ligand-binding site classification with off-the-shelf deep neural network. Bioinformatics 36 (10): 3077-3083.

Feinstein J, Shi W, Ramanujam J, Brylinski M. Bionoi: A Voronoi Diagram-Based Representation of Ligand-Binding Sites in Proteins for Machine Learning Applications. Methods Mol Biol. 2021;2266:299-312. doi: 10.1007/978-1-0716-1209-5_17. PMID: 33759134.