From d806ecd62179f5644ae8e8d1c540d037b59090d3 Mon Sep 17 00:00:00 2001 From: AngelRuizMoreno Date: Fri, 17 Sep 2021 23:14:25 -0500 Subject: [PATCH] README --- .../1.-Molecular_Docking-checkpoint.ipynb | 2033 +++++++++++ .../3.-Blind_Docking-checkpoint.ipynb | 2970 +++++++++++++++++ .ipynb_checkpoints/README-checkpoint.ipynb | 30 +- 1.-Molecular_Docking.ipynb | 81 +- 3.-Blind_Docking.ipynb | 8 - README.ipynb | 30 +- README.md | 18 +- 7 files changed, 5077 insertions(+), 93 deletions(-) create mode 100644 .ipynb_checkpoints/1.-Molecular_Docking-checkpoint.ipynb create mode 100644 .ipynb_checkpoints/3.-Blind_Docking-checkpoint.ipynb diff --git a/.ipynb_checkpoints/1.-Molecular_Docking-checkpoint.ipynb b/.ipynb_checkpoints/1.-Molecular_Docking-checkpoint.ipynb new file mode 100644 index 00000000..d1cb8e03 --- /dev/null +++ b/.ipynb_checkpoints/1.-Molecular_Docking-checkpoint.ipynb @@ -0,0 +1,2033 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "id": "6951e168-585b-426c-8bd9-bbbef8a1e14e", + "metadata": {}, + "source": [ + "# Molecular docking" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "id": "756603a6-b74c-48e2-a532-0a1a26c56d53", + "metadata": {}, + "outputs": [], + "source": [ + "from pymol import cmd\n", + "import py3Dmol\n", + "\n", + "from vina import Vina\n", + "\n", + "from openbabel import pybel\n", + "\n", + "from rdkit import Chem\n", + "from rdkit.Chem import AllChem, Draw\n", + "\n", + "from meeko import MoleculePreparation\n", + "from meeko import obutils\n", + "\n", + "import MDAnalysis as mda\n", + "from MDAnalysis.coordinates import PDB\n", + "\n", + "import prolif as plf\n", + "from prolif.plotting.network import LigNetwork\n", + "\n", + "\n", + "import sys, os\n", + "sys.path.insert(1, 'utilities/')\n", + "from utils import fix_protein, getbox, generate_ledock_file, dok_to_sdf\n", + "\n", + "\n", + "import warnings\n", + "warnings.filterwarnings(\"ignore\")\n", + "%config Completer.use_jedi = False" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "id": "f34f98c3-1454-4f67-92b7-ce9cf054c333", + "metadata": {}, + "outputs": [], + "source": [ + "os.chdir('test/Molecular_Docking/')" + ] + }, + { + "cell_type": "markdown", + "id": "72001c44-17ca-4518-ad7f-d0db24f68063", + "metadata": {}, + "source": [ + "## Fetching the system directly from PDB using pymol" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "id": "050514b3-d554-43ab-ba1e-62c1cb95458b", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " PyMOL not running, entering library mode (experimental)\n" + ] + } + ], + "source": [ + "cmd.fetch(code='1AZ8',type='pdb1')\n", + "cmd.select(name='Prot',selection='polymer.protein')\n", + "cmd.select(name='Lig',selection='organic')\n", + "cmd.save(filename='1AZ8_clean.pdb',format='pdb',selection='Prot')\n", + "cmd.save(filename='1AZ8_lig.mol2',format='mol2',selection='Lig')\n", + "cmd.delete('all')" + ] + }, + { + "cell_type": "markdown", + "id": "fe52106e-292c-4ff3-87bc-de5265368853", + "metadata": {}, + "source": [ + "## Protein sanitization" + ] + }, + { + "cell_type": "markdown", + "id": "5dec3aa7-6554-41ba-8a66-1f3e82214235", + "metadata": {}, + "source": [ + "#### Method 1: LePro\n", + "\n", + "Usage:\n", + "\n", + " **lepro [PDB file] [-rot || -metal || -p]**\n", + "\n", + " **-rot** [[chain] resid] align principal axes of the binding site with Cartesian\n", + "\n", + " **-metal** keep ZN/MN/CA/MG\n", + "\n", + " **-metal -p** redistribute metal charge to protein" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "id": "836973a5-248f-43ba-a1f9-345c2cded6b0", + "metadata": {}, + "outputs": [], + "source": [ + "!../../bin/lepro_linux_x86 {'1AZ8_clean.pdb'}\n", + "\n", + "os.rename('pro.pdb','1AZ8_clean_H.pdb')" + ] + }, + { + "cell_type": "markdown", + "id": "e89ca421-ab3a-4af4-8160-bcc9f2334beb", + "metadata": {}, + "source": [ + "#### Method 2: fix_protein (PDBFixer)\n", + "\n", + "**_fix_protein ( params )_**\n", + "\n", + "Params:\n", + " \n", + " - **filename**: _str or path-like_ ; input file containing protein struture to be modified, file extrension must be pdb\n", + "\n", + " - **addHs_pH**: _float_ ; Add hydrogens at user defined pH\n", + "\n", + " - **try_renumberResidues**: _bool_ ; By default PDBFixer renumarets residues starting in 1, this option tries to recover originar residues numbering\n", + " \n", + " - **output**: _str or path-like_ ; output filename, extension must be pdb" + ] + }, + { + "cell_type": "markdown", + "id": "e1e6f4b6-d96a-458b-9b33-3f4846ff3e68", + "metadata": {}, + "source": [ + "```\n", + "fix_protein(filename='1AZ8_clean.pdb',addHs_pH=7.4,try_renumberResidues=True,output='1AZ8_clean_H.pdb')\n", + "```" + ] + }, + { + "cell_type": "markdown", + "id": "2a4fc4a8-a5bc-4cb1-a329-dc1c52db032b", + "metadata": {}, + "source": [ + "## Ligand sanitization" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "id": "c461429c-3036-43a0-8ec9-ba3e854c045f", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + "RDKit WARNING: [15:10:19] 1AZ8: Warning - no explicit hydrogens in mol2 file but needed for formal charge estimation.\n" + ] + }, + { + "data": { + "image/png": "\n", + "text/plain": [ + "" + ] + }, + "execution_count": 6, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "m=Chem.MolFromMol2File('1AZ8_lig.mol2',False)\n", + "Draw.MolToImage(m)" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "id": "3076da38-9420-4e3c-8bb9-4c99eae74f27", + "metadata": {}, + "outputs": [], + "source": [ + "mol= [m for m in pybel.readfile(filename='1AZ8_lig.mol2',format='mol2')][0]\n", + "mol.addh()\n", + "out=pybel.Outputfile(filename='1AZ8_lig_H.mol2',format='mol2',overwrite=True)\n", + "out.write(mol)\n", + "out.close()" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "id": "8473f062-aade-45ca-a18b-463545341c41", + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "" + ] + }, + "execution_count": 8, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "m=Chem.MolFromMol2File('1AZ8_lig_H.mol2')\n", + "m" + ] + }, + { + "cell_type": "markdown", + "id": "f1db2b5f-d083-4640-8b29-8b6ee4ba2d37", + "metadata": {}, + "source": [ + "## System visualization" + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "id": "77c37840-7880-4a12-84a3-feaeb794e726", + "metadata": {}, + "outputs": [ + { + "data": { + "application/3dmoljs_load.v0": "
\n

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol

\n
\n", + "text/html": [ + "
\n", + "

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n", + " jupyter labextension install jupyterlab_3dmol

\n", + "
\n", + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "view = py3Dmol.view()\n", + "view.removeAllModels()\n", + "view.setViewStyle({'style':'outline','color':'black','width':0.1})\n", + "\n", + "view.addModel(open('1AZ8_clean_H.pdb','r').read(),format='pdb')\n", + "Prot=view.getModel()\n", + "Prot.setStyle({'cartoon':{'arrows':True, 'tubes':True, 'style':'oval', 'color':'white'}})\n", + "view.addSurface(py3Dmol.VDW,{'opacity':0.6,'color':'white'})\n", + "\n", + "\n", + "view.addModel(open('1AZ8_lig_H.mol2','r').read(),format='mol2')\n", + "ref_m = view.getModel()\n", + "ref_m.setStyle({},{'stick':{'colorscheme':'greenCarbon','radius':0.2}})\n", + "\n", + "view.zoomTo()\n", + "view.show()" + ] + }, + { + "cell_type": "markdown", + "id": "e4114eb4-1614-44df-b837-f695bd3ead01", + "metadata": {}, + "source": [ + "## Docking with Vina" + ] + }, + { + "cell_type": "markdown", + "id": "f79f6541-a4ee-44e8-b837-f939ce818ef2", + "metadata": {}, + "source": [ + "### Protein preparation" + ] + }, + { + "cell_type": "markdown", + "id": "29d06de0-20b4-4af5-baf5-30c1d25ccfc4", + "metadata": {}, + "source": [ + "Usage: \n", + "\n", + " **prepare_receptor -r filename**\n", + "\n", + " Description of command...\n", + " -r receptor_filename\n", + " supported file types include pdb,mol2,pdbq,pdbqs,pdbqt, possibly pqr,cif\n", + " Optional parameters:\n", + " [-v] verbose output (default is minimal output)\n", + " [-o pdbqt_filename] (default is 'molecule_name.pdbqt')\n", + " [-A] type(s) of repairs to make:\n", + " 'bonds_hydrogens': build bonds and add hydrogens\n", + " 'bonds': build a single bond from each atom with no bonds to its closest neighbor\n", + " 'hydrogens': add hydrogens\n", + " 'checkhydrogens': add hydrogens only if there are none already\n", + " 'None': do not make any repairs\n", + " (default is 'None')\n", + " [-C] preserve all input charges ie do not add new charges\n", + " (default is addition of gasteiger charges)\n", + " [-p] preserve input charges on specific atom types, eg -p Zn -p Fe\n", + " [-U] cleanup type:\n", + " 'nphs': merge charges and remove non-polar hydrogens\n", + " 'lps': merge charges and remove lone pairs\n", + " 'waters': remove water residues\n", + " 'nonstdres': remove chains composed entirely of residues of\n", + " types other than the standard 20 amino acids\n", + " 'deleteAltB': remove XX@B atoms and rename XX@A atoms->XX\n", + " (default is 'nphs_lps_waters_nonstdres')\n", + " [-e] delete every nonstd residue from any chain\n", + " 'True': any residue whose name is not in this list:\n", + " ['CYS','ILE','SER','VAL','GLN','LYS','ASN',\n", + " 'PRO','THR','PHE','ALA','HIS','GLY','ASP',\n", + " 'LEU', 'ARG', 'TRP', 'GLU', 'TYR','MET',\n", + " 'HID', 'HSP', 'HIE', 'HIP', 'CYX', 'CSS']\n", + " will be deleted from any chain.\n", + " NB: there are no nucleic acid residue names at all\n", + " in the list and no metals.\n", + " (default is False which means not to do this)\n", + " [-M] interactive\n", + " (default is 'automatic': outputfile is written with no further user input)\n", + " [-d dictionary_filename] file to contain receptor summary information\n", + " [-w] assign each receptor atom a unique name: newname is original name plus its index(1-based)" + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "id": "d182cce2-1805-42ff-bbdd-53e3b5a9fe75", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "set verbose to True\n", + "set receptor_filename to 1AZ8_clean_H.pdb\n", + "set outputfilename to 1AZ8_clean_H.pdbqt\n", + "read 1AZ8_clean_H.pdb\n", + "setting up RPO with mode= automatic and outputfilename= 1AZ8_clean_H.pdbqt\n", + "charges_to_add= gasteiger\n", + "delete_single_nonstd_residues= None\n", + "adding gasteiger charges to peptide\n" + ] + } + ], + "source": [ + "!../../bin/prepare_receptor -v -r 1AZ8_clean_H.pdb -o 1AZ8_clean_H.pdbqt" + ] + }, + { + "cell_type": "markdown", + "id": "8203e23c-a4be-4b29-b609-b543d86e1b56", + "metadata": {}, + "source": [ + "### Ligand preparation" + ] + }, + { + "cell_type": "markdown", + "id": "f9473d92-ce0b-4c64-95a1-d7188b4f6225", + "metadata": {}, + "source": [ + "#### Method 1: ADTools binaries\n", + "\n", + "Usage: \n", + "\n", + "**prepare_ligand -l filename**\n", + "\n", + " Description of command...\n", + " -l ligand_filename (.pdb or .mol2 or .pdbq format)\n", + " Optional parameters:\n", + " [-v] verbose output\n", + " [-o pdbqt_filename] (default output filename is ligand_filename_stem + .pdbqt)\n", + " [-d] dictionary to write types list and number of active torsions\n", + " [-A] type(s) of repairs to make:\n", + " bonds_hydrogens, bonds, hydrogens (default is to do no repairs)\n", + " [-C] do not add charges (default is to add gasteiger charges)\n", + " [-p] preserve input charges on an atom type, eg -p Zn\n", + " (default is not to preserve charges on any specific atom type)\n", + " [-U] cleanup type:\n", + " nphs_lps, nphs, lps, '' (default is 'nphs_lps')\n", + " [-B] type(s) of bonds to allow to rotate\n", + " (default sets 'backbone' rotatable and 'amide' + 'guanidinium' non-rotatable)\n", + " [-R] index for root\n", + " [-F] check for and use largest non-bonded fragment (default is not to do this)\n", + " [-M] interactive (default is automatic output)\n", + " [-I] string of bonds to inactivate composed of\n", + " of zero-based atom indices eg 5_13_2_10\n", + " will inactivate atoms[5]-atoms[13] bond\n", + " and atoms[2]-atoms[10] bond\n", + " (default is not to inactivate any specific bonds)\n", + " [-Z] inactivate all active torsions\n", + " (default is leave all rotatable active except amide and guanidinium)\n", + " [-g] attach all nonbonded fragments\n", + " [-s] attach all nonbonded singletons:\n", + " NB: sets attach all nonbonded fragments too\n", + " (default is not to do this)\n", + " [-w] assign each ligand atom a unique name: newname is original name plus its index(1-based)" + ] + }, + { + "cell_type": "code", + "execution_count": 11, + "id": "35c34131-bda2-43ac-8a44-932d61016e7c", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "set verbose to True\n", + "set ligand_filename to 1AZ8_lig_H.mol2\n", + "set outputfilename to 1AZ8_lig_H.pdbqt\n", + "read 1AZ8_lig_H.mol2\n", + "setting up LPO with mode= automatic and outputfilename= 1AZ8_lig_H.pdbqt\n", + "and check_for_fragments= False\n", + "and bonds_to_inactivate= \n", + "returning 0\n", + "No change in atomic coordinates\n" + ] + } + ], + "source": [ + "!../../bin/prepare_ligand -v -l 1AZ8_lig_H.mol2 -o 1AZ8_lig_H.pdbqt" + ] + }, + { + "cell_type": "markdown", + "id": "0c3d96a3-2f37-4710-b132-b4b160fd76a3", + "metadata": {}, + "source": [ + "#### Method 2: Meeko" + ] + }, + { + "cell_type": "markdown", + "id": "4bcf45df-7781-4243-bc89-881693c03875", + "metadata": {}, + "source": [ + "```\n", + "mol = obutils.load_molecule_from_file('1AZ8_lig_H.mol2')\n", + "\n", + "preparator = MoleculePreparation(merge_hydrogens=True,hydrate=False)\n", + "preparator.prepare(mol)\n", + "\n", + "preparator.write_pdbqt_file('1AZ8_lig_H.pdbqt')\n", + "```" + ] + }, + { + "cell_type": "markdown", + "id": "c2e2b7a0-8e6d-4214-9997-d5cb363937ea", + "metadata": {}, + "source": [ + "#### Method 3: Pybel" + ] + }, + { + "cell_type": "markdown", + "id": "f0141f8b-86cf-4450-95d9-93dc58a34a1b", + "metadata": {}, + "source": [ + "```\n", + "ligand = [m for m in pybel.readfile(filename='1AZ8_lig_H.mol2',format='mol2')][0]\n", + "out=pybel.Outputfile(filename='1AZ8_lig_H.pdbqt',format='pdbqt',overwrite=True)\n", + "out.write(ligand)\n", + "out.close()\n", + "```" + ] + }, + { + "cell_type": "markdown", + "id": "86513681-d4c6-438c-aa98-b3aa7513f830", + "metadata": {}, + "source": [ + "### Box definition" + ] + }, + { + "cell_type": "markdown", + "id": "5c5d6834-ff67-48fa-85c7-e7e264a2438f", + "metadata": {}, + "source": [ + "**_get_box( params )_**\n", + "\n", + "params:\n", + "\n", + " - **selection**: _str , pymol_selection_; The selection for docking box, can be atom, resn, resid, or any other pymol selection\n", + " \n", + " - **extending**: _float_; value to extend the boundaries of the selection\n", + " \n", + " - **software**: _str , 'vina','ledock', 'both'_ ; Depending on selected software the funtion will provide the box coordinates in vina, ledock, or both formats" + ] + }, + { + "cell_type": "code", + "execution_count": 14, + "id": "2c1b8f34-3443-4656-8c2e-0de373df7aef", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "{'center_x': 31.859049797058105, 'center_y': 13.347449779510498, 'center_z': 17.06589984893799} \n", + " {'size_x': 24.56949806213379, 'size_y': 18.123299598693848, 'size_z': 17.374399185180664}\n" + ] + } + ], + "source": [ + "cmd.load(filename='1AZ8_clean_H.pdb',format='pdb',object='prot')\n", + "cmd.load(filename='1AZ8_lig_H.mol2',format='mol2',object='lig')\n", + "\n", + "center, size= getbox(selection='lig',extending=5.0,software='vina')\n", + "\n", + "cmd.delete('all')\n", + "\n", + "print(center,'\\n',size)" + ] + }, + { + "cell_type": "markdown", + "id": "d9e3cd32-f577-4f76-b7e1-328a2f65c818", + "metadata": {}, + "source": [ + "### Docking" + ] + }, + { + "cell_type": "markdown", + "id": "36d185a4-71c2-4c6a-9a56-0ac8f60b0ee8", + "metadata": {}, + "source": [ + "#### Method : vina" + ] + }, + { + "cell_type": "code", + "execution_count": 15, + "id": "732c4d41-a58a-479f-95c5-024a48b4eae1", + "metadata": {}, + "outputs": [], + "source": [ + "v = Vina(sf_name='vina')\n", + "\n", + "v.set_receptor('1AZ8_clean_H.pdbqt')\n", + "\n", + "v.set_ligand_from_file('1AZ8_lig_H.pdbqt')\n", + "\n", + "v.compute_vina_maps(center=[center['center_x'], center['center_y'], center['center_z']], \n", + " box_size=[size['size_x'], size['size_y'], size['size_z']])\n", + "\n", + "'''\n", + "# Score the current pose\n", + "energy = v.score()\n", + "print('Score before minimization: %.3f (kcal/mol)' % energy[0])\n", + "\n", + "# Minimized locally the current pose\n", + "energy_minimized = v.optimize()\n", + "print('Score after minimization : %.3f (kcal/mol)' % energy_minimized[0])\n", + "v.write_pose('1iep_ligand_minimized.pdbqt', overwrite=True)\n", + "'''\n", + "\n", + "# Dock the ligand\n", + "v.dock(exhaustiveness=10, n_poses=10)\n", + "v.write_poses('1AZ8_lig_vina_out.pdbqt', n_poses=10, overwrite=True)" + ] + }, + { + "cell_type": "markdown", + "id": "26f92461-7734-4722-abef-cad651554c97", + "metadata": {}, + "source": [ + "#### Method 2: smina\n", + "\n", + "Correct usage:\n", + "\n", + " Input:\n", + " -r [ --receptor ] arg rigid part of the receptor (PDBQT)\n", + " --flex arg flexible side chains, if any (PDBQT)\n", + " -l [ --ligand ] arg ligand(s)\n", + " --flexres arg flexible side chains specified by comma\n", + " separated list of chain:resid or\n", + " chain:resid:icode\n", + " --flexdist_ligand arg Ligand to use for flexdist\n", + " --flexdist arg set all side chains within specified distance\n", + " to flexdist_ligand to flexible\n", + "\n", + " Search space (required):\n", + " --center_x arg X coordinate of the center\n", + " --center_y arg Y coordinate of the center\n", + " --center_z arg Z coordinate of the center\n", + " --size_x arg size in the X dimension (Angstroms)\n", + " --size_y arg size in the Y dimension (Angstroms)\n", + " --size_z arg size in the Z dimension (Angstroms)\n", + " --autobox_ligand arg Ligand to use for autobox\n", + " --autobox_add arg Amount of buffer space to add to auto-generated\n", + " box (default +4 on all six sides)\n", + " --no_lig no ligand; for sampling/minimizing flexible\n", + " residues\n", + "\n", + " Scoring and minimization options:\n", + " --scoring arg specify alternative builtin scoring function\n", + " --custom_scoring arg custom scoring function file\n", + " --custom_atoms arg custom atom type parameters file\n", + " --score_only score provided ligand pose\n", + " --local_only local search only using autobox (you probably\n", + " want to use --minimize)\n", + " --minimize energy minimization\n", + " --randomize_only generate random poses, attempting to avoid\n", + " clashes\n", + " --minimize_iters arg (=0) number iterations of steepest descent; default\n", + " scales with rotors and usually isn't sufficient\n", + " for convergence\n", + " --accurate_line use accurate line search\n", + " --minimize_early_term Stop minimization before convergence conditions\n", + " are fully met.\n", + " --approximation arg approximation (linear, spline, or exact) to use\n", + " --factor arg approximation factor: higher results in a\n", + " finer-grained approximation\n", + " --force_cap arg max allowed force; lower values more gently\n", + " minimize clashing structures\n", + " --user_grid arg Autodock map file for user grid data based\n", + " calculations\n", + " --user_grid_lambda arg (=-1) Scales user_grid and functional scoring\n", + " --print_terms Print all available terms with default\n", + " parameterizations\n", + " --print_atom_types Print all available atom types\n", + "\n", + " Output (optional):\n", + " -o [ --out ] arg output file name, format taken from file\n", + " extension\n", + " --out_flex arg output file for flexible receptor residues\n", + " --log arg optionally, write log file\n", + " --atom_terms arg optionally write per-atom interaction term\n", + " values\n", + " --atom_term_data embedded per-atom interaction terms in output\n", + " sd data\n", + "\n", + " Misc (optional):\n", + " --cpu arg the number of CPUs to use (the default is to\n", + " try to detect the number of CPUs or, failing\n", + " that, use 1)\n", + " --seed arg explicit random seed\n", + " --exhaustiveness arg (=8) exhaustiveness of the global search (roughly\n", + " proportional to time)\n", + " --num_modes arg (=9) maximum number of binding modes to generate\n", + " --energy_range arg (=3) maximum energy difference between the best\n", + " binding mode and the worst one displayed\n", + " (kcal/mol)\n", + " --min_rmsd_filter arg (=1) rmsd value used to filter final poses to remove\n", + " redundancy\n", + " -q [ --quiet ] Suppress output messages\n", + " --addH arg automatically add hydrogens in ligands (on by\n", + " default)\n", + "\n", + " Configuration file (optional):\n", + " --config arg the above options can be put here\n", + "\n", + " Information (optional):\n", + " --help display usage summary\n", + " --help_hidden display usage summary with hidden options\n", + " --version display program version" + ] + }, + { + "cell_type": "markdown", + "id": "800679a2-6785-4c0a-93a2-ac1d396e4c04", + "metadata": {}, + "source": [ + "```\n", + "!../../bin/smina -r 1AZ8_clean_H.pdbqt -l 1AZ8_lig_H.pdbqt --center_x 31.859 --center_y 13.34 --center_z 17.065 --size_x 24.569 --size_y 18.12 --size_z 17.37 --exhaustiveness 8 --num_modes 5\n", + "```" + ] + }, + { + "cell_type": "markdown", + "id": "ce41cf4f-a916-46ea-9f7c-073292abaa4e", + "metadata": {}, + "source": [ + "### Converting files from PDBQT to SDF" + ] + }, + { + "cell_type": "code", + "execution_count": 17, + "id": "1b8121d0-2485-4865-8cd3-10c945aad530", + "metadata": {}, + "outputs": [], + "source": [ + "results = [m for m in pybel.readfile(filename='1AZ8_lig_vina_out.pdbqt',format='pdbqt')]\n", + "out=pybel.Outputfile(filename='1AZ8_lig_vina_out.sdf',format='sdf',overwrite=True)\n", + "for pose in results:\n", + " out.write(pose)\n", + "out.close()" + ] + }, + { + "cell_type": "markdown", + "id": "01b78957-5274-4a2e-bb91-4d83c551a7b4", + "metadata": {}, + "source": [ + "### Docking poses visualization" + ] + }, + { + "cell_type": "code", + "execution_count": 18, + "id": "f95c31b9-ed5b-4dd3-be0f-380c462dd01f", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " VINA RESULT: -7.492 0.000 0.000\n", + " INTER + INTRA: -13.330\n", + " INTER: -12.196\n", + " INTRA: -1.134\n", + " UNBOUND: -1.458\n", + " 11 active torsions:\n", + " status: ('A' for Active; 'I' for Inactive)\n", + " 1 A between atoms: N1_2 and C16_22 \n", + " 2 A between atoms: N3_6 and C20_25 \n", + " 3 A between atoms: C4_8 and C10_16 \n", + " 4 A between atoms: O4_10 and C22_26 \n", + " 5 A between atoms: C6_12 and C20_25 \n", + " 6 A between atoms: C9_15 and C11_17 \n", + " 7 A between atoms: C9_15 and C10_16 \n", + " 8 A between atoms: C10_16 and C14_20 \n", + " 9 A between atoms: C11_17 and C12_18 \n", + " 10 A between atoms: C14_20 and C22_26 \n", + " 11 A between atoms: C15_21 and C16_22 \n" + ] + }, + { + "data": { + "application/3dmoljs_load.v0": "
\n

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol

\n
\n", + "text/html": [ + "
\n", + "

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n", + " jupyter labextension install jupyterlab_3dmol

\n", + "
\n", + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "view = py3Dmol.view()\n", + "view.removeAllModels()\n", + "view.setViewStyle({'style':'outline','color':'black','width':0.1})\n", + "\n", + "view.addModel(open('1AZ8_clean_H.pdb','r').read(),format='pdb')\n", + "Prot=view.getModel()\n", + "Prot.setStyle({'cartoon':{'arrows':True, 'tubes':True, 'style':'oval', 'color':'white'}})\n", + "view.addSurface(py3Dmol.VDW,{'opacity':0.6,'color':'white'})\n", + "\n", + "\n", + "view.addModel(open('1AZ8_lig_H.mol2','r').read(),format='mol2')\n", + "ref_m = view.getModel()\n", + "ref_m.setStyle({},{'stick':{'colorscheme':'greenCarbon','radius':0.2}})\n", + "\n", + "\n", + "results=Chem.SDMolSupplier('1AZ8_lig_vina_out.sdf')\n", + "\n", + "p=Chem.MolToMolBlock(results[0],False)\n", + "print (results[0].GetProp('REMARK'))\n", + "\n", + "view.addModel(p,'mol')\n", + "x = view.getModel()\n", + "x.setStyle({},{'stick':{'colorscheme':'cyanCarbon','radius':0.2}})\n", + "\n", + "view.zoomTo()\n", + "view.show()" + ] + }, + { + "cell_type": "markdown", + "id": "75f1749a-85cd-4340-9a6d-d8ff6258ce34", + "metadata": {}, + "source": [ + "### Molecular interacions" + ] + }, + { + "cell_type": "code", + "execution_count": 19, + "id": "3ba16cc8-9285-409e-99ff-7df3e849f96d", + "metadata": {}, + "outputs": [], + "source": [ + "fix_protein(filename='1AZ8_clean.pdb',addHs_pH=7.4,try_renumberResidues=True,output='1AZ8_clean_H_fix.pdb')" + ] + }, + { + "cell_type": "code", + "execution_count": 20, + "id": "089b321d-63d1-46f2-96f1-abdd2dfb00f9", + "metadata": {}, + "outputs": [ + { + "data": { + "application/vnd.jupyter.widget-view+json": { + "model_id": "628eef057d7d42d4a83e9cf476151d47", + "version_major": 2, + "version_minor": 0 + }, + "text/plain": [ + " 0%| | 0/10 [00:00\n", + "\n", + "\n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + "
ligandUNL1
proteinASP189.ACYS191.AGLY219.A
interactionHBDonorHydrophobicHBDonor
0(12, 0)(16, 0)(11, 0)
1(11, 0)(None, None)(None, None)
2(None, None)(16, 0)(None, None)
3(None, None)(31, 0)(None, None)
4(None, None)(None, None)(23, 0)
5(None, None)(None, None)(None, None)
6(26, 0)(None, None)(None, None)
7(None, None)(None, None)(25, 0)
8(None, None)(None, None)(23, 0)
9(None, None)(None, None)(None, None)
\n", + "" + ], + "text/plain": [ + "ligand UNL1 \n", + "protein ASP189.A CYS191.A GLY219.A\n", + "interaction HBDonor Hydrophobic HBDonor\n", + "0 (12, 0) (16, 0) (11, 0)\n", + "1 (11, 0) (None, None) (None, None)\n", + "2 (None, None) (16, 0) (None, None)\n", + "3 (None, None) (31, 0) (None, None)\n", + "4 (None, None) (None, None) (23, 0)\n", + "5 (None, None) (None, None) (None, None)\n", + "6 (26, 0) (None, None) (None, None)\n", + "7 (None, None) (None, None) (25, 0)\n", + "8 (None, None) (None, None) (23, 0)\n", + "9 (None, None) (None, None) (None, None)" + ] + }, + "execution_count": 20, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "# load protein\n", + "prot = mda.Universe(\"1AZ8_clean_H_fix.pdb\")\n", + "prot = plf.Molecule.from_mda(prot)\n", + "prot.n_residues\n", + "\n", + "# load ligands\n", + "lig_suppl = list(plf.sdf_supplier('1AZ8_lig_vina_out.sdf'))\n", + "# generate fingerprint\n", + "fp = plf.Fingerprint()\n", + "fp.run_from_iterable(lig_suppl, prot)\n", + "results_df = fp.to_dataframe(return_atoms=True)\n", + "results_df" + ] + }, + { + "cell_type": "code", + "execution_count": 21, + "id": "60090cbb-2fca-42f8-a325-173bebc22030", + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "" + ], + "text/plain": [ + "" + ] + }, + "execution_count": 21, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "net = LigNetwork.from_ifp(results_df,lig_suppl[0],kind=\"frame\", frame=0,rotation=270)\n", + "net.display()" + ] + }, + { + "cell_type": "markdown", + "id": "fbab030f-d433-458f-8e75-d36428d250c0", + "metadata": {}, + "source": [ + "## Docking with Ledock" + ] + }, + { + "cell_type": "markdown", + "id": "e92d1738-3440-4d0e-8ae4-24f6fd50ffcf", + "metadata": {}, + "source": [ + "### Box definition" + ] + }, + { + "cell_type": "code", + "execution_count": 22, + "id": "13e73174-9e26-424d-8abd-b8df3f37fc16", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "{'minX': 19.57430076599121, 'maxX': 44.143798828125} \n", + " {'minY': 4.285799980163574, 'maxY': 22.409099578857422} \n", + " {'minZ': 8.378700256347656, 'maxZ': 25.75309944152832}\n" + ] + } + ], + "source": [ + "cmd.load(filename='1AZ8_clean_H.pdb',format='pdb',object='prot')\n", + "cmd.load(filename='1AZ8_lig_H.mol2',format='mol2',object='lig')\n", + "\n", + "X,Y,Z= getbox(selection='lig',extending=5.0,software='ledock')\n", + "cmd.delete('all')\n", + "\n", + "print(X,'\\n',Y,'\\n',Z)" + ] + }, + { + "cell_type": "markdown", + "id": "3a2bd5f7-1325-426b-be5d-db29a992ab6b", + "metadata": {}, + "source": [ + "### Ledock parameters" + ] + }, + { + "cell_type": "markdown", + "id": "f03fb247-097b-4914-96cd-14f03d46b5ca", + "metadata": {}, + "source": [ + "**_generate_ledock_file( params )_**\n", + "\n", + "params:\n", + "\n", + "- **receptor**: _str or path-like string_ ; protein file for docking including hydrogens, format must be pdb \n", + "- **x**: _2 element list of floats [ float , float ]_; Xmin and Xmax coordinates of docking box\n", + "- **y**: _2 element list of floats [ float , float ]_; Ymin and Ymax coordinates of docking box\n", + "- **z**: _2 element list of floats [ float , float ]_; Zmin and Zmax coordinates of docking box\n", + "- **n_poses**: _float_ ; n_of poses to retrieve from docking \n", + "- **rmsd**: _float_ ; minimum RMSD diference between docking poses \n", + "- **l_list**: _list of n strings or path-like strings [ lig1, lig2, lig3 ... ] ; list of ligands or ligands paths to dock \n", + "- **l_list_outfile**: _str or path-like string_ ; filename to save the ligand list, needed for ledock to locate ligands\n", + "- **out**: _str or path-like string_ ; outfile to save docking paramemeters, needed to launch the docking" + ] + }, + { + "cell_type": "code", + "execution_count": 23, + "id": "21078e78-8772-4b50-83c1-3dcd7381b819", + "metadata": {}, + "outputs": [], + "source": [ + "generate_ledock_file(receptor='1AZ8_clean_H.pdb',x=[X['minX'],X['maxX']],\n", + " y=[Y['minY'],Y['maxY']],\n", + " z=[Z['minZ'],Z['maxZ']],\n", + " n_poses=10,\n", + " rmsd=1.0,\n", + " l_list='1AZ8_lig_H.mol2', \n", + " l_list_outfile='ledock_ligand.list1',\n", + " out='dock.in')" + ] + }, + { + "cell_type": "markdown", + "id": "8f724cc7-32a1-47a2-813e-09f5baee3a07", + "metadata": {}, + "source": [ + "### Docking" + ] + }, + { + "cell_type": "code", + "execution_count": 24, + "id": "bc334f42-0d13-4e98-a24f-698adc1d5da9", + "metadata": {}, + "outputs": [], + "source": [ + "!../../bin/ledock_linux_x86 dock.in" + ] + }, + { + "cell_type": "markdown", + "id": "9895b690-5593-45b4-9a6c-1d1ef6a54987", + "metadata": {}, + "source": [ + "### Files conversion from DOK to SDF" + ] + }, + { + "cell_type": "markdown", + "id": "d8e49032-3801-43da-afb6-ef7398fe82f9", + "metadata": {}, + "source": [ + "**_dok_to_sdf ( params )_**\n", + "\n", + "params:\n", + "\n", + " - **dok_file**: _str or path-like_ ; dok file from ledock docking\n", + "\n", + " - **output**: _str or path-like_ ; out file from ledock docking, extension must be sdf" + ] + }, + { + "cell_type": "code", + "execution_count": 25, + "id": "73878c34-8346-49ef-a6c7-44733115510f", + "metadata": {}, + "outputs": [], + "source": [ + "dok_to_sdf(dok_file='1AZ8_lig_H.dok',output='1AZ8_lig_ledock_out.sdf')" + ] + }, + { + "cell_type": "markdown", + "id": "48e6dad1-078a-4579-b72e-22c1cf1e48ed", + "metadata": {}, + "source": [ + "### Docking poses visualization" + ] + }, + { + "cell_type": "code", + "execution_count": 26, + "id": "97fc294d-1d1c-440d-ad35-da4de5d098d1", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " Cluster 1 of Poses: 2 Score: -9.03 kcal/mol\n" + ] + }, + { + "data": { + "application/3dmoljs_load.v0": "
\n

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol

\n
\n", + "text/html": [ + "
\n", + "

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n", + " jupyter labextension install jupyterlab_3dmol

\n", + "
\n", + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "view = py3Dmol.view()\n", + "view.removeAllModels()\n", + "view.setViewStyle({'style':'outline','color':'black','width':0.1})\n", + "\n", + "view.addModel(open('1AZ8_clean_H.pdb','r').read(),format='pdb')\n", + "Prot=view.getModel()\n", + "Prot.setStyle({'cartoon':{'arrows':True, 'tubes':True, 'style':'oval', 'color':'white'}})\n", + "view.addSurface(py3Dmol.VDW,{'opacity':0.6,'color':'white'})\n", + "\n", + "\n", + "view.addModel(open('1AZ8_lig_H.mol2','r').read(),format='mol2')\n", + "ref_m = view.getModel()\n", + "ref_m.setStyle({},{'stick':{'colorscheme':'greenCarbon','radius':0.2}})\n", + "\n", + "results=Chem.SDMolSupplier('1AZ8_lig_ledock_out.sdf')\n", + "\n", + "\n", + "p=Chem.MolToMolBlock(results[0])\n", + "print (results[0].GetProp('REMARK'))\n", + "\n", + "view.addModel(p,'mol')\n", + "x = view.getModel()\n", + "x.setStyle({},{'stick':{'colorscheme':'cyanCarbon','radius':0.2}})\n", + "\n", + "view.zoomTo()\n", + "view.show()" + ] + }, + { + "cell_type": "markdown", + "id": "b1548780-48ac-4843-be7e-29ab8de33726", + "metadata": {}, + "source": [ + "### Molecular interactions" + ] + }, + { + "cell_type": "code", + "execution_count": 27, + "id": "c8c42b93-ab73-4fda-817f-373289df549f", + "metadata": {}, + "outputs": [ + { + "data": { + "application/vnd.jupyter.widget-view+json": { + "model_id": "1201edfad8594ac19f02c9140d9ce657", + "version_major": 2, + "version_minor": 0 + }, + "text/plain": [ + " 0%| | 0/8 [00:00\n", + "\n", + "\n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + "
ligandUNL1
proteinASN143.AASP189.ACYS191.ACYS220.AGLN192.AGLY148.AGLY216.A...SER214.ATHR149.ATRP215.ATYR228.AVAL213.AVAL227.A
interactionHBAcceptorHydrophobicHBDonorHydrophobicHydrophobicHydrophobicHydrophobicHBDonorHydrophobicHBAcceptor...HBDonorHydrophobicHBDonorHBAcceptorHBDonorHydrophobicHBDonorHydrophobicHydrophobicHBAcceptor
0(3, 12)(21, 9)(None, None)(24, 9)(0, 2)(7, 9)(4, 2)(28, 6)(21, 5)(9, 1)...(None, None)(None, None)(None, None)(None, None)(None, None)(0, 2)(None, None)(None, None)(0, 8)(None, None)
1(1, 12)(None, None)(32, 11)(24, 9)(0, 2)(7, 9)(2, 2)(27, 6)(21, 5)(6, 1)...(None, None)(None, None)(None, None)(None, None)(None, None)(0, 2)(None, None)(None, None)(0, 8)(None, None)
2(3, 12)(21, 9)(32, 10)(24, 9)(0, 2)(7, 9)(2, 2)(28, 6)(21, 5)(6, 1)...(None, None)(None, None)(None, None)(None, None)(None, None)(0, 4)(None, None)(None, None)(0, 8)(None, None)
3(None, None)(None, None)(33, 11)(24, 9)(0, 2)(7, 9)(2, 2)(27, 6)(21, 5)(None, None)...(None, None)(13, 4)(None, None)(None, None)(None, None)(0, 2)(None, None)(None, None)(0, 8)(None, None)
4(3, 12)(21, 9)(None, None)(24, 9)(0, 2)(7, 9)(2, 2)(28, 6)(21, 5)(6, 1)...(None, None)(None, None)(28, 12)(None, None)(None, None)(0, 2)(None, None)(None, None)(0, 8)(None, None)
5(3, 12)(None, None)(32, 11)(24, 9)(0, 2)(10, 9)(4, 2)(None, None)(None, None)(None, None)...(None, None)(2, 4)(None, None)(None, None)(None, None)(0, 2)(None, None)(None, None)(0, 8)(None, None)
6(None, None)(None, None)(32, 10)(24, 9)(0, 2)(7, 9)(2, 2)(28, 6)(21, 5)(None, None)...(None, None)(None, None)(None, None)(None, None)(None, None)(0, 4)(None, None)(None, None)(0, 8)(None, None)
7(None, None)(None, None)(28, 10)(None, None)(12, 2)(13, 9)(14, 2)(None, None)(None, None)(None, None)...(32, 5)(12, 4)(None, None)(3, 1)(30, 0)(0, 2)(27, 19)(21, 14)(12, 8)(3, 1)
\n", + "

8 rows × 33 columns

\n", + "" + ], + "text/plain": [ + "ligand UNL1 \\\n", + "protein ASN143.A ASP189.A \n", + "interaction HBAcceptor Hydrophobic HBDonor Hydrophobic \n", + "0 (3, 12) (21, 9) (None, None) (24, 9) \n", + "1 (1, 12) (None, None) (32, 11) (24, 9) \n", + "2 (3, 12) (21, 9) (32, 10) (24, 9) \n", + "3 (None, None) (None, None) (33, 11) (24, 9) \n", + "4 (3, 12) (21, 9) (None, None) (24, 9) \n", + "5 (3, 12) (None, None) (32, 11) (24, 9) \n", + "6 (None, None) (None, None) (32, 10) (24, 9) \n", + "7 (None, None) (None, None) (28, 10) (None, None) \n", + "\n", + "ligand \\\n", + "protein CYS191.A CYS220.A GLN192.A GLY148.A \n", + "interaction Hydrophobic Hydrophobic Hydrophobic HBDonor Hydrophobic \n", + "0 (0, 2) (7, 9) (4, 2) (28, 6) (21, 5) \n", + "1 (0, 2) (7, 9) (2, 2) (27, 6) (21, 5) \n", + "2 (0, 2) (7, 9) (2, 2) (28, 6) (21, 5) \n", + "3 (0, 2) (7, 9) (2, 2) (27, 6) (21, 5) \n", + "4 (0, 2) (7, 9) (2, 2) (28, 6) (21, 5) \n", + "5 (0, 2) (10, 9) (4, 2) (None, None) (None, None) \n", + "6 (0, 2) (7, 9) (2, 2) (28, 6) (21, 5) \n", + "7 (12, 2) (13, 9) (14, 2) (None, None) (None, None) \n", + "\n", + "ligand ... \\\n", + "protein GLY216.A ... SER214.A THR149.A \n", + "interaction HBAcceptor ... HBDonor Hydrophobic HBDonor \n", + "0 (9, 1) ... (None, None) (None, None) (None, None) \n", + "1 (6, 1) ... (None, None) (None, None) (None, None) \n", + "2 (6, 1) ... (None, None) (None, None) (None, None) \n", + "3 (None, None) ... (None, None) (13, 4) (None, None) \n", + "4 (6, 1) ... (None, None) (None, None) (28, 12) \n", + "5 (None, None) ... (None, None) (2, 4) (None, None) \n", + "6 (None, None) ... (None, None) (None, None) (None, None) \n", + "7 (None, None) ... (32, 5) (12, 4) (None, None) \n", + "\n", + "ligand \\\n", + "protein TRP215.A TYR228.A \n", + "interaction HBAcceptor HBDonor Hydrophobic HBDonor \n", + "0 (None, None) (None, None) (0, 2) (None, None) \n", + "1 (None, None) (None, None) (0, 2) (None, None) \n", + "2 (None, None) (None, None) (0, 4) (None, None) \n", + "3 (None, None) (None, None) (0, 2) (None, None) \n", + "4 (None, None) (None, None) (0, 2) (None, None) \n", + "5 (None, None) (None, None) (0, 2) (None, None) \n", + "6 (None, None) (None, None) (0, 4) (None, None) \n", + "7 (3, 1) (30, 0) (0, 2) (27, 19) \n", + "\n", + "ligand \n", + "protein VAL213.A VAL227.A \n", + "interaction Hydrophobic Hydrophobic HBAcceptor \n", + "0 (None, None) (0, 8) (None, None) \n", + "1 (None, None) (0, 8) (None, None) \n", + "2 (None, None) (0, 8) (None, None) \n", + "3 (None, None) (0, 8) (None, None) \n", + "4 (None, None) (0, 8) (None, None) \n", + "5 (None, None) (0, 8) (None, None) \n", + "6 (None, None) (0, 8) (None, None) \n", + "7 (21, 14) (12, 8) (3, 1) \n", + "\n", + "[8 rows x 33 columns]" + ] + }, + "execution_count": 27, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "# load protein\n", + "prot = mda.Universe(\"1AZ8_clean_H_fix.pdb\",guess_bonds=True)\n", + "prot = plf.Molecule.from_mda(prot)\n", + "prot.n_residues\n", + "\n", + "# load ligands\n", + "path = str('1AZ8_lig_ledock_out.sdf')\n", + "lig_suppl = list(plf.sdf_supplier(path))\n", + "# generate fingerprint\n", + "fp = plf.Fingerprint()\n", + "fp.run_from_iterable(lig_suppl, prot)\n", + "results_df = fp.to_dataframe(return_atoms=True)\n", + "results_df" + ] + }, + { + "cell_type": "code", + "execution_count": 28, + "id": "36fd7b17-6821-4b47-a6c0-7c1c5568dc11", + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "" + ], + "text/plain": [ + "" + ] + }, + "execution_count": 28, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "net = LigNetwork.from_ifp(results_df,lig_suppl[0],kind=\"frame\", frame=0,rotation=270)\n", + "net.display()" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "AnalysisMD", + "language": "python", + "name": "analysismd" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.7.10" + } + }, + "nbformat": 4, + "nbformat_minor": 5 +} diff --git a/.ipynb_checkpoints/3.-Blind_Docking-checkpoint.ipynb b/.ipynb_checkpoints/3.-Blind_Docking-checkpoint.ipynb new file mode 100644 index 00000000..0ee691a5 --- /dev/null +++ b/.ipynb_checkpoints/3.-Blind_Docking-checkpoint.ipynb @@ -0,0 +1,2970 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "id": "ced177c9-a5d4-432e-bca7-cbc33216ad2d", + "metadata": {}, + "source": [ + "# Blind Docking" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "id": "603c32cc-aee1-40ab-8942-ceb386c3d7ba", + "metadata": {}, + "outputs": [], + "source": [ + "from pymol import cmd\n", + "import py3Dmol\n", + "\n", + "from vina import Vina\n", + "\n", + "import pandas as pd\n", + "import numpy as np\n", + "\n", + "from openbabel import pybel\n", + "\n", + "from rdkit import Chem\n", + "from rdkit.Chem import AllChem,rdFMCS, Draw\n", + "\n", + "from meeko import MoleculePreparation\n", + "from meeko import obutils\n", + "\n", + "import MDAnalysis as mda\n", + "from MDAnalysis.coordinates import PDB\n", + "\n", + "import prolif as plf\n", + "from prolif.plotting.network import LigNetwork\n", + "\n", + "\n", + "import sys, os, random, time\n", + "sys.path.insert(1, 'utilities/')\n", + "\n", + "from multiprocessing import Pool\n", + "\n", + "from utils import fix_protein, getbox, generate_ledock_file, dok_to_sdf\n", + "\n", + "import warnings\n", + "warnings.filterwarnings(\"ignore\")\n", + "\n", + "%config Completer.use_jedi = False" + ] + }, + { + "cell_type": "markdown", + "id": "34311746-4190-49a7-a32b-f4e55cad33d8", + "metadata": {}, + "source": [ + "### Setting the working directory" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "id": "dd617dfe-f742-424d-b19f-54c06ec0ac9c", + "metadata": {}, + "outputs": [], + "source": [ + "os.chdir('test/Blind_Docking/')" + ] + }, + { + "cell_type": "markdown", + "id": "6331c30a-61a9-427a-94a6-cec3c5202237", + "metadata": {}, + "source": [ + "### Loading the system from PDB" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "id": "25cb0479-7f2f-465a-a442-16bb132e08b3", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + " PyMOL not running, entering library mode (experimental)\n" + ] + } + ], + "source": [ + "cmd.fetch(code='1XOZ',type='pdb1')\n", + "cmd.select(name='Prot',selection='polymer.protein')\n", + "cmd.select(name='Lig',selection='organic')\n", + "cmd.save(filename='1XOZ_clean.pdb',format='pdb',selection='Prot')\n", + "cmd.save(filename='1XOZ_lig.mol2',format='mol2',selection='Lig')\n", + "cmd.delete('all')" + ] + }, + { + "cell_type": "markdown", + "id": "d6b23a3f-0e0b-4d3f-a56c-0b9d3f6fffa7", + "metadata": {}, + "source": [ + "### Protein sanitization" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "id": "be753050-0e76-4a8c-91f2-4a3f4056c5ed", + "metadata": {}, + "outputs": [], + "source": [ + "fix_protein(filename='1XOZ_clean.pdb',addHs_pH=7.4,try_renumberResidues=True,output='1XOZ_clean_H.pdb')" + ] + }, + { + "cell_type": "markdown", + "id": "85421ef6-7ee5-439a-abaa-f1d5c80f3265", + "metadata": {}, + "source": [ + "### Ligand sanitization" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "id": "f1359216-c0f1-454d-b876-196223873e2b", + "metadata": {}, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + "RDKit WARNING: [14:26:55] 1XOZ: Warning - no explicit hydrogens in mol2 file but needed for formal charge estimation.\n", + "RDKit ERROR: [14:26:55] Can't kekulize mol. Unkekulized atoms: 0 1 2 3 4 5 6 7 8\n", + "RDKit ERROR: \n" + ] + }, + { + "data": { + "image/png": "\n", + "text/plain": [ + "" + ] + }, + "execution_count": 5, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "m=Chem.MolFromMol2File('1XOZ_lig.mol2',False)\n", + "Draw.MolToImage(m)" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "id": "0ed91920-275e-448b-ac7b-9c842ebac458", + "metadata": {}, + "outputs": [], + "source": [ + "mol= [m for m in pybel.readfile(filename='1XOZ_lig.mol2',format='mol2')][0]\n", + "mol.addh()\n", + "out=pybel.Outputfile(filename='1XOZ_lig_H.mol2',format='mol2',overwrite=True)\n", + "out.write(mol)\n", + "out.close()" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "id": "6c3a3a3d-b14a-488c-aa20-b81eda23ff7e", + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "" + ] + }, + "execution_count": 7, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "m=Chem.MolFromMol2File('1XOZ_lig_H.mol2')\n", + "m" + ] + }, + { + "cell_type": "markdown", + "id": "da26704d-ed4f-4efc-a381-b8e7d083140d", + "metadata": {}, + "source": [ + "### System visualization" + ] + }, + { + "cell_type": "code", + "execution_count": 8, + "id": "e5eb26b4-e609-49d3-a4ae-28614d7a6431", + "metadata": {}, + "outputs": [ + { + "data": { + "application/3dmoljs_load.v0": "
\n

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol

\n
\n", + "text/html": [ + "
\n", + "

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n", + " jupyter labextension install jupyterlab_3dmol

\n", + "
\n", + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "view = py3Dmol.view()\n", + "view.removeAllModels()\n", + "view.setViewStyle({'style':'outline','color':'black','width':0.1})\n", + "\n", + "view.addModel(open('1XOZ_clean_H.pdb','r').read(),format='pdb')\n", + "Prot=view.getModel()\n", + "Prot.setStyle({'cartoon':{'arrows':True, 'tubes':True, 'style':'oval', 'color':'white'}})\n", + "view.addSurface(py3Dmol.VDW,{'opacity':0.6,'color':'white'})\n", + "\n", + "view.addModel(open('1XOZ_lig_H.mol2','r').read(),format='mol2')\n", + "ref_m = view.getModel()\n", + "ref_m.setStyle({},{'stick':{'colorscheme':'greenCarbon','radius':0.2}})\n", + "\n", + "view.zoomTo()\n", + "view.show()" + ] + }, + { + "cell_type": "markdown", + "id": "e421dda8-f402-4f40-8cf6-772e4e4ccc58", + "metadata": {}, + "source": [ + "## Molecular docking" + ] + }, + { + "cell_type": "markdown", + "id": "ccef7d83-a108-4f3d-8d62-f9438c1ef5bc", + "metadata": {}, + "source": [ + "### Protein preparation" + ] + }, + { + "cell_type": "code", + "execution_count": 9, + "id": "c01457d4-c3be-4c38-867b-8e44ae226a60", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "set verbose to True\n", + "set receptor_filename to 1XOZ_clean_H.pdb\n", + "set repairs to hydrogens\n", + "set outputfilename to 1XOZ_clean_H.pdbqt\n", + "read 1XOZ_clean_H.pdb\n", + "setting up RPO with mode= automatic and outputfilename= 1XOZ_clean_H.pdbqt\n", + "charges_to_add= gasteiger\n", + "delete_single_nonstd_residues= None\n", + "adding gasteiger charges to peptide\n", + "Sorry, there are no Gasteiger parameters available for atom 1XOZ_clean_H:A:GLU858:OXT\n", + "Warning: hydrogens, ['HG1', 'HG', 'HG', 'HG1', 'HG1', 'HG', 'HG', 'HG', 'HG1', 'HG1', 'HG1', 'HG', 'HZ2', 'HH', 'HH', 'HG1', 'HG1', 'HG', 'HG', 'HG1', 'HG', 'HH', 'HG', 'HG', 'HG', 'HG', 'HH', 'HG1', 'HG1', 'HG1', 'HH', 'HG1', 'HG', 'HG1', 'HG1', 'HG1', 'HG', 'HH', 'HG1', 'HG'] , with no bonds!\n" + ] + } + ], + "source": [ + "!../../bin/prepare_receptor -v -r 1XOZ_clean_H.pdb -A hydrogens -o 1XOZ_clean_H.pdbqt" + ] + }, + { + "cell_type": "markdown", + "id": "2f42890b-eaa0-4358-9fcf-eb982ae3b07a", + "metadata": {}, + "source": [ + "### Ligand preparation" + ] + }, + { + "cell_type": "markdown", + "id": "e695a701-c83f-492e-9e8c-8ac98dfabc57", + "metadata": {}, + "source": [ + "#### Method 2: pybel" + ] + }, + { + "cell_type": "code", + "execution_count": 10, + "id": "dd98d0ed-123d-4b96-a79e-fee1b697e933", + "metadata": {}, + "outputs": [], + "source": [ + "ligand = [m for m in pybel.readfile(filename='1XOZ_lig_H.mol2',format='mol2')][0]\n", + "out=pybel.Outputfile(filename='1XOZ_lig_H.pdbqt',format='pdbqt',overwrite=True)\n", + "ligand.addh()\n", + "out.write(ligand)\n", + "out.close()" + ] + }, + { + "cell_type": "markdown", + "id": "123ce17b-e006-429d-a9eb-a7a697e55c60", + "metadata": {}, + "source": [ + "### Protein pockets identification" + ] + }, + { + "cell_type": "code", + "execution_count": 11, + "id": "06a66393-f78e-4b4f-b72d-67456ec25e33", + "metadata": {}, + "outputs": [], + "source": [ + "!../../bin/fpocket -f 1XOZ_clean_H.pdb -d > pocket_descriptors.csv" + ] + }, + { + "cell_type": "code", + "execution_count": 12, + "id": "ef2448a5-3395-4f51-93df-beb806b4c847", + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "
\n", + "\n", + "\n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + "
drug_scorevolumenb_asphinter_chainapol_asph_proportionmean_asph_radiusas_densitymean_asph_solv_accmean_loc_hyd_densflex...valtrptyrchain_1_typechain_2_typenum_res_chain_1num_res_chain_2lig_het_tagname_chain_1name_chain_2
cav_id
10.87411053.155813900.63314.02006.86630.464760.29550.0...01200324324NaNAA
20.0452589.67167200.33333.74375.56210.471619.58330.0...01000324324NaNAA
30.0003517.68203700.32434.06055.16420.589410.66670.0...00100324324NaNAA
40.0021390.75293000.46673.98724.24520.612011.71430.0...00000324324NaNAA
50.0009111.89612300.39133.51002.01550.35048.00000.0...00000324324NaNAA
60.0046228.22733400.47063.83983.27870.462315.00000.0...01000324324NaNAA
70.0005223.67121600.62503.92382.46310.67839.00000.0...00000324324NaNAA
80.0004219.37811500.26673.93672.96640.51793.00000.0...01000324324NaNAA
90.0012265.22793300.60614.14022.15930.640019.00000.0...00000324324NaNAA
100.0030634.74295200.25003.89945.42250.517310.15380.0...10000324324NaNAA
110.0017190.29362600.65383.79472.59790.671816.00000.0...00000324324NaNAA
120.00141071.005410700.36453.96228.32260.509621.07690.0...11000324324NaNAA
130.0009354.04233300.57583.83393.40030.468518.00000.0...00000324324NaNAA
140.0002305.84202200.18183.68692.91550.38093.00000.0...00100324324NaNAA
150.0003409.89233400.35293.93463.83910.504410.33330.0...00100324324NaNAA
160.0001292.27641600.18753.84342.99850.55072.00000.0...00000324324NaNAA
170.0012203.94552000.75003.85411.60620.494114.00000.0...10000324324NaNAA
180.0008329.03621600.43753.87323.47500.44476.00000.0...00000324324NaNAA
190.0001276.73211600.37503.92132.65880.60555.00000.0...00000324324NaNAA
200.0000246.74971700.05883.95352.85210.57740.00000.0...00000324324NaNAA
210.0002305.77662200.13643.88772.43830.49182.00000.0...00000324324NaNAA
220.0004405.19312400.29174.19793.54340.59776.00000.0...00000324324NaNAA
\n", + "

22 rows × 46 columns

\n", + "
" + ], + "text/plain": [ + " drug_score volume nb_asph inter_chain apol_asph_proportion \\\n", + "cav_id \n", + "1 0.8741 1053.1558 139 0 0.6331 \n", + "2 0.0452 589.6716 72 0 0.3333 \n", + "3 0.0003 517.6820 37 0 0.3243 \n", + "4 0.0021 390.7529 30 0 0.4667 \n", + "5 0.0009 111.8961 23 0 0.3913 \n", + "6 0.0046 228.2273 34 0 0.4706 \n", + "7 0.0005 223.6712 16 0 0.6250 \n", + "8 0.0004 219.3781 15 0 0.2667 \n", + "9 0.0012 265.2279 33 0 0.6061 \n", + "10 0.0030 634.7429 52 0 0.2500 \n", + "11 0.0017 190.2936 26 0 0.6538 \n", + "12 0.0014 1071.0054 107 0 0.3645 \n", + "13 0.0009 354.0423 33 0 0.5758 \n", + "14 0.0002 305.8420 22 0 0.1818 \n", + "15 0.0003 409.8923 34 0 0.3529 \n", + "16 0.0001 292.2764 16 0 0.1875 \n", + "17 0.0012 203.9455 20 0 0.7500 \n", + "18 0.0008 329.0362 16 0 0.4375 \n", + "19 0.0001 276.7321 16 0 0.3750 \n", + "20 0.0000 246.7497 17 0 0.0588 \n", + "21 0.0002 305.7766 22 0 0.1364 \n", + "22 0.0004 405.1931 24 0 0.2917 \n", + "\n", + " mean_asph_radius as_density mean_asph_solv_acc mean_loc_hyd_dens \\\n", + "cav_id \n", + "1 4.0200 6.8663 0.4647 60.2955 \n", + "2 3.7437 5.5621 0.4716 19.5833 \n", + "3 4.0605 5.1642 0.5894 10.6667 \n", + "4 3.9872 4.2452 0.6120 11.7143 \n", + "5 3.5100 2.0155 0.3504 8.0000 \n", + "6 3.8398 3.2787 0.4623 15.0000 \n", + "7 3.9238 2.4631 0.6783 9.0000 \n", + "8 3.9367 2.9664 0.5179 3.0000 \n", + "9 4.1402 2.1593 0.6400 19.0000 \n", + "10 3.8994 5.4225 0.5173 10.1538 \n", + "11 3.7947 2.5979 0.6718 16.0000 \n", + "12 3.9622 8.3226 0.5096 21.0769 \n", + "13 3.8339 3.4003 0.4685 18.0000 \n", + "14 3.6869 2.9155 0.3809 3.0000 \n", + "15 3.9346 3.8391 0.5044 10.3333 \n", + "16 3.8434 2.9985 0.5507 2.0000 \n", + "17 3.8541 1.6062 0.4941 14.0000 \n", + "18 3.8732 3.4750 0.4447 6.0000 \n", + "19 3.9213 2.6588 0.6055 5.0000 \n", + "20 3.9535 2.8521 0.5774 0.0000 \n", + "21 3.8877 2.4383 0.4918 2.0000 \n", + "22 4.1979 3.5434 0.5977 6.0000 \n", + "\n", + " flex ... val trp tyr chain_1_type chain_2_type num_res_chain_1 \\\n", + "cav_id ... \n", + "1 0.0 ... 0 1 2 0 0 324 \n", + "2 0.0 ... 0 1 0 0 0 324 \n", + "3 0.0 ... 0 0 1 0 0 324 \n", + "4 0.0 ... 0 0 0 0 0 324 \n", + "5 0.0 ... 0 0 0 0 0 324 \n", + "6 0.0 ... 0 1 0 0 0 324 \n", + "7 0.0 ... 0 0 0 0 0 324 \n", + "8 0.0 ... 0 1 0 0 0 324 \n", + "9 0.0 ... 0 0 0 0 0 324 \n", + "10 0.0 ... 1 0 0 0 0 324 \n", + "11 0.0 ... 0 0 0 0 0 324 \n", + "12 0.0 ... 1 1 0 0 0 324 \n", + "13 0.0 ... 0 0 0 0 0 324 \n", + "14 0.0 ... 0 0 1 0 0 324 \n", + "15 0.0 ... 0 0 1 0 0 324 \n", + "16 0.0 ... 0 0 0 0 0 324 \n", + "17 0.0 ... 1 0 0 0 0 324 \n", + "18 0.0 ... 0 0 0 0 0 324 \n", + "19 0.0 ... 0 0 0 0 0 324 \n", + "20 0.0 ... 0 0 0 0 0 324 \n", + "21 0.0 ... 0 0 0 0 0 324 \n", + "22 0.0 ... 0 0 0 0 0 324 \n", + "\n", + " num_res_chain_2 lig_het_tag name_chain_1 name_chain_2 \n", + "cav_id \n", + "1 324 NaN A A \n", + "2 324 NaN A A \n", + "3 324 NaN A A \n", + "4 324 NaN A A \n", + "5 324 NaN A A \n", + "6 324 NaN A A \n", + "7 324 NaN A A \n", + "8 324 NaN A A \n", + "9 324 NaN A A \n", + "10 324 NaN A A \n", + "11 324 NaN A A \n", + "12 324 NaN A A \n", + "13 324 NaN A A \n", + "14 324 NaN A A \n", + "15 324 NaN A A \n", + "16 324 NaN A A \n", + "17 324 NaN A A \n", + "18 324 NaN A A \n", + "19 324 NaN A A \n", + "20 324 NaN A A \n", + "21 324 NaN A A \n", + "22 324 NaN A A \n", + "\n", + "[22 rows x 46 columns]" + ] + }, + "execution_count": 12, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "pockets_data=pd.read_csv('pocket_descriptors.csv',sep=' ',index_col=[0])\n", + "pockets_data" + ] + }, + { + "cell_type": "markdown", + "id": "c1811211-650d-4c49-aed0-8fe0a6e7af7f", + "metadata": {}, + "source": [ + "### Pockets visualization" + ] + }, + { + "cell_type": "code", + "execution_count": 13, + "id": "36609e74-92ca-47ef-8baa-6be9bd5e9c93", + "metadata": {}, + "outputs": [ + { + "data": { + "application/3dmoljs_load.v0": "
\n

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol

\n
\n", + "text/html": [ + "
\n", + "

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n", + " jupyter labextension install jupyterlab_3dmol

\n", + "
\n", + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "view = py3Dmol.view()\n", + "view.removeAllModels()\n", + "view.setViewStyle({'style':'outline','color':'black','width':0.1})\n", + "\n", + "view.addModel(open('1XOZ_clean_H.pdb','r').read(),'pdb')\n", + "Prot=view.getModel()\n", + "Prot.setStyle({'cartoon':{'arrows':True, 'tubes':True, 'style':'oval', 'color':'white'}})\n", + "#view.addSurface(py3Dmol.VDW,{'opacity':0.6,'color':'white'})\n", + "\n", + "\n", + "view.addModel(open('1XOZ_lig_H.mol2','r').read(),'mol2')\n", + "ref_m = view.getModel()\n", + "ref_m.setStyle({},{'stick':{'colorscheme':'greenCarbon','radius':0.1}})\n", + "\n", + "\n", + "for file in os.listdir(path='1XOZ_clean_H_out/'):\n", + " if '.pqr' in file:\n", + " color = [\"#\"+''.join([random.choice('0123456789ABCDEF') for j in range(6)])]\n", + " view.addModel(open('1XOZ_clean_H_out/'+file,'r').read(),'pqr')\n", + " x = view.getModel()\n", + " x.setStyle({},{'sphere':{'color':color[0],'opacity':0.6}}) \n", + " \n", + "view.zoomTo()\n", + "view.show()" + ] + }, + { + "cell_type": "markdown", + "id": "108eacd9-bca4-4294-b85a-3e92a450e17b", + "metadata": {}, + "source": [ + "### Per-pocket docking box set up (vina)" + ] + }, + { + "cell_type": "markdown", + "id": "023a2e49-522a-4c05-981f-79bd399a10e5", + "metadata": {}, + "source": [ + "Hint: You may think about extending the pocket box by 4-5 armostrongs ..." + ] + }, + { + "cell_type": "code", + "execution_count": 14, + "id": "51aeefdc-cdc0-49c3-aeb9-bbea41fbb158", + "metadata": {}, + "outputs": [], + "source": [ + "for file in os.listdir('1XOZ_clean_H_out/'):\n", + " if 'pqr' in file:\n", + " pocket_num=int(file.split('_')[0].replace('pocket',''))\n", + " cmd.load(filename='1XOZ_clean_H_out/'+file,format='pqr',object=pocket_num)\n", + " \n", + " center,size=getbox(selection=pocket_num,extending=5.0,software='vina')\n", + " \n", + " pockets_data.loc[pocket_num,'center_x']=center['center_x']\n", + " pockets_data.loc[pocket_num,'center_y']=center['center_y']\n", + " pockets_data.loc[pocket_num,'center_z']=center['center_z']\n", + " \n", + " pockets_data.loc[pocket_num,'size_x']=size['size_x']\n", + " pockets_data.loc[pocket_num,'size_y']=size['size_y']\n", + " pockets_data.loc[pocket_num,'size_z']=size['size_z']\n", + " \n", + " cmd.delete('all')" + ] + }, + { + "cell_type": "code", + "execution_count": 15, + "id": "1d43de75-8369-4921-86b8-0237d740da96", + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "
\n", + "\n", + "\n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + "
drug_scorevolumenb_asphinter_chainapol_asph_proportionmean_asph_radiusas_densitymean_asph_solv_accmean_loc_hyd_densflex...num_res_chain_2lig_het_tagname_chain_1name_chain_2center_xcenter_ycenter_zsize_xsize_ysize_z
cav_id
10.87411053.155813900.63314.02006.86630.464760.29550.0...324NaNAA47.15300236.18700015.22900020.32600018.05600024.312000
20.0452589.67167200.33333.74375.56210.471619.58330.0...324NaNAA39.91750046.82200112.00250018.32099923.43600115.865001
30.0003517.68203700.32434.06055.16420.589410.66670.0...324NaNAA24.39100119.91200019.75300018.07600018.21999919.172001
40.0021390.75293000.46673.98724.24520.612011.71430.0...324NaNAA26.42700030.58150035.59800017.52800016.16700216.634003
50.0009111.89612300.39133.51002.01550.35048.00000.0...324NaNAA33.26450032.00750029.85400013.45700113.22300011.883999
60.0046228.22733400.47063.83983.27870.462315.00000.0...324NaNAA38.22349929.39400029.17849911.91300212.83799917.578999
70.0005223.67121600.62503.92382.46310.67839.00000.0...324NaNAA35.52650115.04500026.63800015.50099911.12200012.118000
80.0004219.37811500.26673.93672.96640.51793.00000.0...324NaNAA28.45750048.13449940.51349814.11899913.15100114.659000
90.0012265.22793300.60614.14022.15930.640019.00000.0...324NaNAA33.53700017.91950015.52300014.96800013.76300013.890000
100.0030634.74295200.25003.89945.42250.517310.15380.0...324NaNAA29.66699936.0830003.64250024.19999915.59999819.619000
\n", + "

10 rows × 52 columns

\n", + "
" + ], + "text/plain": [ + " drug_score volume nb_asph inter_chain apol_asph_proportion \\\n", + "cav_id \n", + "1 0.8741 1053.1558 139 0 0.6331 \n", + "2 0.0452 589.6716 72 0 0.3333 \n", + "3 0.0003 517.6820 37 0 0.3243 \n", + "4 0.0021 390.7529 30 0 0.4667 \n", + "5 0.0009 111.8961 23 0 0.3913 \n", + "6 0.0046 228.2273 34 0 0.4706 \n", + "7 0.0005 223.6712 16 0 0.6250 \n", + "8 0.0004 219.3781 15 0 0.2667 \n", + "9 0.0012 265.2279 33 0 0.6061 \n", + "10 0.0030 634.7429 52 0 0.2500 \n", + "\n", + " mean_asph_radius as_density mean_asph_solv_acc mean_loc_hyd_dens \\\n", + "cav_id \n", + "1 4.0200 6.8663 0.4647 60.2955 \n", + "2 3.7437 5.5621 0.4716 19.5833 \n", + "3 4.0605 5.1642 0.5894 10.6667 \n", + "4 3.9872 4.2452 0.6120 11.7143 \n", + "5 3.5100 2.0155 0.3504 8.0000 \n", + "6 3.8398 3.2787 0.4623 15.0000 \n", + "7 3.9238 2.4631 0.6783 9.0000 \n", + "8 3.9367 2.9664 0.5179 3.0000 \n", + "9 4.1402 2.1593 0.6400 19.0000 \n", + "10 3.8994 5.4225 0.5173 10.1538 \n", + "\n", + " flex ... num_res_chain_2 lig_het_tag name_chain_1 name_chain_2 \\\n", + "cav_id ... \n", + "1 0.0 ... 324 NaN A A \n", + "2 0.0 ... 324 NaN A A \n", + "3 0.0 ... 324 NaN A A \n", + "4 0.0 ... 324 NaN A A \n", + "5 0.0 ... 324 NaN A A \n", + "6 0.0 ... 324 NaN A A \n", + "7 0.0 ... 324 NaN A A \n", + "8 0.0 ... 324 NaN A A \n", + "9 0.0 ... 324 NaN A A \n", + "10 0.0 ... 324 NaN A A \n", + "\n", + " center_x center_y center_z size_x size_y size_z \n", + "cav_id \n", + "1 47.153002 36.187000 15.229000 20.326000 18.056000 24.312000 \n", + "2 39.917500 46.822001 12.002500 18.320999 23.436001 15.865001 \n", + "3 24.391001 19.912000 19.753000 18.076000 18.219999 19.172001 \n", + "4 26.427000 30.581500 35.598000 17.528000 16.167002 16.634003 \n", + "5 33.264500 32.007500 29.854000 13.457001 13.223000 11.883999 \n", + "6 38.223499 29.394000 29.178499 11.913002 12.837999 17.578999 \n", + "7 35.526501 15.045000 26.638000 15.500999 11.122000 12.118000 \n", + "8 28.457500 48.134499 40.513498 14.118999 13.151001 14.659000 \n", + "9 33.537000 17.919500 15.523000 14.968000 13.763000 13.890000 \n", + "10 29.666999 36.083000 3.642500 24.199999 15.599998 19.619000 \n", + "\n", + "[10 rows x 52 columns]" + ] + }, + "execution_count": 15, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "descriptors_data.head(10)" + ] + }, + { + "cell_type": "markdown", + "id": "67187e16-f988-4ecd-870f-ea767bb68b19", + "metadata": {}, + "source": [ + "### Docking with Vina" + ] + }, + { + "cell_type": "code", + "execution_count": 16, + "id": "56533b5b-875d-4864-b957-9508e10d421c", + "metadata": {}, + "outputs": [], + "source": [ + "def vina(receptor='',ligand='',center=[0,0,0],size=[0,0,0],exhaustiveness=8,n_poses=10,output=''):\n", + " v = Vina(sf_name='vina')\n", + "\n", + " v.set_receptor(receptor)\n", + "\n", + " v.set_ligand_from_file(ligand)\n", + "\n", + " v.compute_vina_maps(center=center, box_size=size)\n", + "\n", + " v.dock(exhaustiveness=exhaustiveness, n_poses=n_poses)\n", + " v.write_poses(output, n_poses=n_poses, overwrite=True)" + ] + }, + { + "cell_type": "code", + "execution_count": 17, + "id": "263ba8af-c41b-41ba-8ed1-ad977dff82bf", + "metadata": {}, + "outputs": [], + "source": [ + "for pocket in pockets_data.index:\n", + " vina(receptor='1XOZ_clean_H.pdbqt',ligand='1XOZ_lig_H.pdbqt',\n", + " center=[pockets_data.loc[pocket,'center_x'],pockets_data.loc[pocket,'center_y'],pockets_data.loc[pocket,'center_z']],\n", + " size=[pockets_data.loc[pocket,'size_x'],pockets_data.loc[pocket,'size_y'],pockets_data.loc[pocket,'size_z']],\n", + " exhaustiveness= 8,\n", + " n_poses=5,\n", + " output='vina_outfiles/'+'1XOZ_vina_pock_'+str(pocket)+'.pdbqt')" + ] + }, + { + "cell_type": "markdown", + "id": "04065a08-ae65-4556-b315-c4399ec16bdc", + "metadata": {}, + "source": [ + "#### File conversion from pdbqt to sdf" + ] + }, + { + "cell_type": "code", + "execution_count": 18, + "id": "ea10d91a-07cb-4eb3-be73-de22eb56cf6b", + "metadata": {}, + "outputs": [], + "source": [ + "for file in os.listdir('../Blind_Docking/vina_outfiles/'):\n", + " if 'vina' in file:\n", + " results = [m for m in pybel.readfile(filename='vina_outfiles/'+file,format='pdbqt')]\n", + " out=pybel.Outputfile(filename='vina_outfiles/'+file.replace('pdbqt','sdf'),format='sdf',overwrite=True)\n", + " for pose in results:\n", + " pose.addh()\n", + " out.write(pose)\n", + " out.close()" + ] + }, + { + "cell_type": "code", + "execution_count": 19, + "id": "cf83212c-cab3-44c5-a586-de87c9611f2b", + "metadata": {}, + "outputs": [], + "source": [ + "all_mols=[]\n", + "for file in os.listdir('vina_outfiles/'):\n", + " if 'sdf' in file:\n", + " mols=Chem.SDMolSupplier('vina_outfiles/'+file)\n", + " for mol in mols:\n", + " all_mols.append(mol) \n", + "out.close()" + ] + }, + { + "cell_type": "code", + "execution_count": 20, + "id": "56af894b-0a0c-4563-b560-e9f784cab4f5", + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "103" + ] + }, + "execution_count": 20, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "len(all_mols)" + ] + }, + { + "cell_type": "code", + "execution_count": 21, + "id": "cf2816fb-4484-4d6f-9d3a-f9e737e6aadd", + "metadata": {}, + "outputs": [ + { + "data": { + "application/3dmoljs_load.v0": "
\n

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol

\n
\n", + "text/html": [ + "
\n", + "

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n", + " jupyter labextension install jupyterlab_3dmol

\n", + "
\n", + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "view = py3Dmol.view()\n", + "view.removeAllModels()\n", + "view.setViewStyle({'style':'outline','color':'black','width':0.1})\n", + "\n", + "view.addModel(open('1XOZ_clean_H.pdb','r').read(),'pdb')\n", + "Prot=view.getModel()\n", + "Prot.setStyle({'cartoon':{'arrows':True, 'tubes':True, 'style':'oval', 'color':'white'}})\n", + "view.addSurface(py3Dmol.VDW,{'opacity':0.8,'color':'white'})\n", + "\n", + "\n", + "view.addModel(open('1XOZ_lig_H.mol2','r').read(),'mol2')\n", + "ref_m = view.getModel()\n", + "ref_m.setStyle({},{'stick':{'colorscheme':'greenCarbon','radius':0.1}})\n", + "\n", + "\n", + "for file in os.listdir(path='1XOZ_clean_H_out/'):\n", + " if '.pqr' in file:\n", + " color = [\"#\"+''.join([random.choice('0123456789ABCDEF') for j in range(6)])]\n", + " view.addModel(open('1XOZ_clean_H_out/'+file,'r').read(),'pqr')\n", + " x = view.getModel()\n", + " x.setStyle({},{'sphere':{'color':color[0],'opacity':0.5}})\n", + " \n", + "for mol in all_mols:\n", + " p=Chem.MolToMolBlock(mol)\n", + " color = [\"#\"+''.join([random.choice('0123456789ABCDEF') for j in range(6)])]\n", + " view.addModel(p,'mol')\n", + " z= view.getModel()\n", + " z.setStyle({},{'stick':{'color':color[0],'radius':0.05}})\n", + "\n", + "view.zoomTo()\n", + "view.show()" + ] + }, + { + "cell_type": "markdown", + "id": "2c8447ac-fe09-40e8-b179-50c5e09b0bd9", + "metadata": {}, + "source": [ + "## Docking with Ledock" + ] + }, + { + "cell_type": "markdown", + "id": "94e3a017-c0d2-4bb2-a972-782b9077e9d4", + "metadata": {}, + "source": [ + "### Per-pocket docking box set up (Ledock)" + ] + }, + { + "cell_type": "code", + "execution_count": 22, + "id": "25c0e695-4fd3-48b0-bf98-b2290dae77f6", + "metadata": {}, + "outputs": [], + "source": [ + "for file in os.listdir('1XOZ_clean_H_out/'):\n", + " if 'pqr' in file:\n", + " \n", + " pocket_num=int(file.split('_')[0].replace('pocket',''))\n", + " cmd.load(filename='1XOZ_clean_H_out/'+file,format='pqr',object=pocket_num)\n", + " \n", + " X,Y,Z=getbox(selection=pocket_num,extending=5.0,software='ledock')\n", + " \n", + " pockets_data.loc[pocket_num,'minX']=X['minX']\n", + " pockets_data.loc[pocket_num,'maxX']=X['maxX']\n", + " \n", + " pockets_data.loc[pocket_num,'minY']=Y['minY']\n", + " pockets_data.loc[pocket_num,'maxY']=Y['maxY']\n", + " \n", + " pockets_data.loc[pocket_num,'minZ']=Z['minZ']\n", + " pockets_data.loc[pocket_num,'maxZ']=Z['maxZ']\n", + " \n", + " cmd.delete('all')" + ] + }, + { + "cell_type": "code", + "execution_count": 23, + "id": "ea5f9e5f-2ab9-4416-bcbf-0060728f000e", + "metadata": {}, + "outputs": [ + { + "data": { + "text/html": [ + "
\n", + "\n", + "\n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + "
drug_scorevolumenb_asphinter_chainapol_asph_proportionmean_asph_radiusas_densitymean_asph_solv_accmean_loc_hyd_densflex...center_zsize_xsize_ysize_zminXmaxXminYmaxYminZmaxZ
cav_id
10.87411053.155813900.63314.02006.86630.464760.29550.0...15.22900020.32600018.05600024.31200036.99000257.31600227.15900045.2150003.07300027.385000
20.0452589.67167200.33333.74375.56210.471619.58330.0...12.00250018.32099923.43600115.86500130.75700049.07799935.10400058.5400014.07000019.935000
30.0003517.68203700.32434.06055.16420.589410.66670.0...19.75300018.07600018.21999919.17200115.35300133.42900110.80200029.02199910.16700029.339001
40.0021390.75293000.46673.98724.24520.612011.71430.0...35.59800017.52800016.16700216.63400317.66300035.19100022.49799938.66500127.28099843.915001
50.0009111.89612300.39133.51002.01550.35048.00000.0...29.85400013.45700113.22300011.88399926.53599939.99300025.39600038.61899923.91200135.796000
60.0046228.22733400.47063.83983.27870.462315.00000.0...29.17849911.91300212.83799917.57899932.26699844.18000022.97500035.81300020.38900037.967999
70.0005223.67121600.62503.92382.46310.67839.00000.0...26.63800015.50099911.12200012.11800027.77600143.2770009.48400020.60600020.57900032.697001
80.0004219.37811500.26673.93672.96640.51793.00000.0...40.51349814.11899913.15100114.65900021.39800135.51700041.55899854.70999933.18399847.842999
90.0012265.22793300.60614.14022.15930.640019.00000.0...15.52300014.96800013.76300013.89000026.05299941.02100011.03800024.8010018.57800022.468000
100.0030634.74295200.25003.89945.42250.517310.15380.0...3.64250024.19999915.59999819.61900017.56699941.76699828.28300143.882999-6.16700013.452000
\n", + "

10 rows × 58 columns

\n", + "
" + ], + "text/plain": [ + " drug_score volume nb_asph inter_chain apol_asph_proportion \\\n", + "cav_id \n", + "1 0.8741 1053.1558 139 0 0.6331 \n", + "2 0.0452 589.6716 72 0 0.3333 \n", + "3 0.0003 517.6820 37 0 0.3243 \n", + "4 0.0021 390.7529 30 0 0.4667 \n", + "5 0.0009 111.8961 23 0 0.3913 \n", + "6 0.0046 228.2273 34 0 0.4706 \n", + "7 0.0005 223.6712 16 0 0.6250 \n", + "8 0.0004 219.3781 15 0 0.2667 \n", + "9 0.0012 265.2279 33 0 0.6061 \n", + "10 0.0030 634.7429 52 0 0.2500 \n", + "\n", + " mean_asph_radius as_density mean_asph_solv_acc mean_loc_hyd_dens \\\n", + "cav_id \n", + "1 4.0200 6.8663 0.4647 60.2955 \n", + "2 3.7437 5.5621 0.4716 19.5833 \n", + "3 4.0605 5.1642 0.5894 10.6667 \n", + "4 3.9872 4.2452 0.6120 11.7143 \n", + "5 3.5100 2.0155 0.3504 8.0000 \n", + "6 3.8398 3.2787 0.4623 15.0000 \n", + "7 3.9238 2.4631 0.6783 9.0000 \n", + "8 3.9367 2.9664 0.5179 3.0000 \n", + "9 4.1402 2.1593 0.6400 19.0000 \n", + "10 3.8994 5.4225 0.5173 10.1538 \n", + "\n", + " flex ... center_z size_x size_y size_z minX \\\n", + "cav_id ... \n", + "1 0.0 ... 15.229000 20.326000 18.056000 24.312000 36.990002 \n", + "2 0.0 ... 12.002500 18.320999 23.436001 15.865001 30.757000 \n", + "3 0.0 ... 19.753000 18.076000 18.219999 19.172001 15.353001 \n", + "4 0.0 ... 35.598000 17.528000 16.167002 16.634003 17.663000 \n", + "5 0.0 ... 29.854000 13.457001 13.223000 11.883999 26.535999 \n", + "6 0.0 ... 29.178499 11.913002 12.837999 17.578999 32.266998 \n", + "7 0.0 ... 26.638000 15.500999 11.122000 12.118000 27.776001 \n", + "8 0.0 ... 40.513498 14.118999 13.151001 14.659000 21.398001 \n", + "9 0.0 ... 15.523000 14.968000 13.763000 13.890000 26.052999 \n", + "10 0.0 ... 3.642500 24.199999 15.599998 19.619000 17.566999 \n", + "\n", + " maxX minY maxY minZ maxZ \n", + "cav_id \n", + "1 57.316002 27.159000 45.215000 3.073000 27.385000 \n", + "2 49.077999 35.104000 58.540001 4.070000 19.935000 \n", + "3 33.429001 10.802000 29.021999 10.167000 29.339001 \n", + "4 35.191000 22.497999 38.665001 27.280998 43.915001 \n", + "5 39.993000 25.396000 38.618999 23.912001 35.796000 \n", + "6 44.180000 22.975000 35.813000 20.389000 37.967999 \n", + "7 43.277000 9.484000 20.606000 20.579000 32.697001 \n", + "8 35.517000 41.558998 54.709999 33.183998 47.842999 \n", + "9 41.021000 11.038000 24.801001 8.578000 22.468000 \n", + "10 41.766998 28.283001 43.882999 -6.167000 13.452000 \n", + "\n", + "[10 rows x 58 columns]" + ] + }, + "execution_count": 23, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "pockets_data.head(10)" + ] + }, + { + "cell_type": "code", + "execution_count": 24, + "id": "bd48e9b8-3160-42b0-a670-62b6d6340994", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ OXT--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ OXT--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ OXT--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ H2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ H3--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ H2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ H3--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ OXT--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ H2--------------\n", + "------------Warning: Missing Parameters for Residue: GLU @ H3--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n", + "------------Warning: Missing Parameters for Residue: HIS @ HE2--------------\n" + ] + } + ], + "source": [ + "for pocket in pockets_data.index:\n", + " generate_ledock_file(receptor='1XOZ_clean_H.pdb',\n", + " l_list='1XOZ_lig_H.mol2',\n", + " l_list_outfile='ligand.list',\n", + " x=[descriptors_data.loc[pocket,'minX'],descriptors_data.loc[pocket,'maxX']],\n", + " y=[descriptors_data.loc[pocket,'minY'],descriptors_data.loc[pocket,'maxY']],\n", + " z=[descriptors_data.loc[pocket,'minZ'],descriptors_data.loc[pocket,'maxZ']],\n", + " n_poses=5,\n", + " rmsd=1.0,\n", + " out='dock.in')\n", + " \n", + " !../../bin/ledock_linux_x86 dock.in\n", + " \n", + " os.rename('1XOZ_lig_H.dok','ledock_outfiles/1XOZ_ledock_pock_'+str(pocket)+'.dok')" + ] + }, + { + "cell_type": "code", + "execution_count": 25, + "id": "30c01787-075c-4bac-9fa2-97992ebbd6bd", + "metadata": {}, + "outputs": [], + "source": [ + "for file in os.listdir('ledock_outfiles/'):\n", + " if 'dok' in file:\n", + " dok_to_sdf(dok_file='ledock_outfiles/'+file,output='ledock_outfiles/'+file.replace('dok','sdf'))" + ] + }, + { + "cell_type": "code", + "execution_count": 26, + "id": "8595c3f8-70ea-487a-955b-e36d6a1e2d44", + "metadata": {}, + "outputs": [], + "source": [ + "all_mols=[]\n", + "for file in os.listdir('ledock_outfiles//'):\n", + " if 'sdf' in file:\n", + " mols=Chem.SDMolSupplier('ledock_outfiles/'+file)\n", + " for mol in mols:\n", + " all_mols.append(mol) \n", + "out.close()" + ] + }, + { + "cell_type": "code", + "execution_count": 27, + "id": "47b683c2-dc51-4e4c-a987-cad59408b255", + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "75" + ] + }, + "execution_count": 27, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "len(all_mols)" + ] + }, + { + "cell_type": "code", + "execution_count": 28, + "id": "8423aa59-690e-4778-9943-e0b6aa722514", + "metadata": {}, + "outputs": [ + { + "data": { + "application/3dmoljs_load.v0": "
\n

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n jupyter labextension install jupyterlab_3dmol

\n
\n", + "text/html": [ + "
\n", + "

You appear to be running in JupyterLab (or JavaScript failed to load for some other reason). You need to install the 3dmol extension:
\n", + " jupyter labextension install jupyterlab_3dmol

\n", + "
\n", + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "view = py3Dmol.view()\n", + "view.removeAllModels()\n", + "view.setViewStyle({'style':'outline','color':'black','width':0.1})\n", + "\n", + "view.addModel(open('1XOZ_clean_H.pdb','r').read(),'pdb')\n", + "Prot=view.getModel()\n", + "Prot.setStyle({'cartoon':{'arrows':True, 'tubes':True, 'style':'oval', 'color':'white'}})\n", + "view.addSurface(py3Dmol.VDW,{'opacity':0.8,'color':'white'})\n", + "\n", + "\n", + "view.addModel(open('1XOZ_lig_H.mol2','r').read(),'mol2')\n", + "ref_m = view.getModel()\n", + "ref_m.setStyle({},{'stick':{'colorscheme':'greenCarbon','radius':0.1}})\n", + "\n", + "\n", + "for file in os.listdir(path='1XOZ_clean_H_out/'):\n", + " if '.pqr' in file:\n", + " color = [\"#\"+''.join([random.choice('0123456789ABCDEF') for j in range(6)])]\n", + " view.addModel(open('1XOZ_clean_H_out/'+file,'r').read(),'pqr')\n", + " x = view.getModel()\n", + " x.setStyle({},{'sphere':{'color':color[0],'opacity':0.5}})\n", + " \n", + "for mol in all_mols:\n", + " p=Chem.MolToMolBlock(mol)\n", + " color = [\"#\"+''.join([random.choice('0123456789ABCDEF') for j in range(6)])]\n", + " view.addModel(p,'mol')\n", + " z= view.getModel()\n", + " z.setStyle({},{'stick':{'color':color[0],'radius':0.05}})\n", + "\n", + "view.zoomTo()\n", + "view.show()" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "d66076e0-28bd-4c43-b532-b8a2554f79c6", + "metadata": {}, + "outputs": [], + "source": [] + } + ], + "metadata": { + "kernelspec": { + "display_name": "AnalysisMD", + "language": "python", + "name": "analysismd" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.7.10" + } + }, + "nbformat": 4, + "nbformat_minor": 5 +} diff --git a/.ipynb_checkpoints/README-checkpoint.ipynb b/.ipynb_checkpoints/README-checkpoint.ipynb index 46ce395f..7f1554a3 100644 --- a/.ipynb_checkpoints/README-checkpoint.ipynb +++ b/.ipynb_checkpoints/README-checkpoint.ipynb @@ -49,28 +49,28 @@ "\n", "**Jupyter Dock is a set of Jupyter Notebooks for performing molecular docking protocols interactively, as well as visualizing, converting file formats and analyzing the results.**

\n", "\n", - "\n", "**See Jupyter Docks in action in my personal website: [chem-workflows](https://chem-workflows.com/)**

\n", "\n", - "\n", "These notebooks are Python 3 compatible. Each protocol and Jupyter notebook has its own test folder for testing and reproducibility evaluation.\n", "\n", + "For all notebooks, the demonstration includes the use of AutoDock Vina and Ledock. When available, some alternatives are mentioned in the protocol.\n", + "\n", "The notebooks includes whole protocols for:\n", "\n", "**1. Molecular Docking**\n", "> For any new user, this is a good place to start. Jupyter Docks' main stages for molecular docking, as well as all functions, methods and codes are described here along with brief explanations, hints, and warnings.\n", "\n", "**2. Virtual Screening**\n", - ">\n", + "> Interested in docking multiple ligands into a single target site? This is what you require. This protocol covers all steps from ligand preparation to docking pose visualization in the target site of interest.\n", "\n", "**3. Blind Docking**\n", - ">\n", + "> Do you want to dock multiple ligands into whole target surface and/or its pockets? This protocol demonstrates the entire process of pocket search and their use as potential organic molecule binding sites.\n", "\n", - "**4. Reverse Docking (Target fishing)**\n", - ">\n", + "**4. Reverse Docking / Target fishing)**\n", + "> Interested in docking one or a few molecules into a set of proteins to identify the most promising target(s)? This notebook covers all of the steps required to achieve such a goal in a condensed manner, making the process seem like a walk in the park.\n", "\n", "**5. Docking Analysis**\n", - ">\n", + "> Have you completed your docking experiments with Jupyter Dock or another approach and want to conduct a rational analysis? You've come to the right place. This notebook summarizes the most common docking analysis techniques, including score comparisons, z-score calculation between softwares, pose clustering, molecular interactions mapping, and more.\n", "\n", "\n", "Question about usage or troubleshooting? Please leave a comment here" @@ -125,6 +125,22 @@ "- [Smina](https://sourceforge.net/projects/smina/)" ] }, + { + "cell_type": "markdown", + "id": "c8e8a7c1-1402-4edf-8db5-a412a2bf8e20", + "metadata": {}, + "source": [ + "## Limitations" + ] + }, + { + "cell_type": "markdown", + "id": "b0cda790-83fd-46c7-b05b-54faa0bc19c5", + "metadata": {}, + "source": [ + "## Examples" + ] + }, { "cell_type": "markdown", "id": "696f484c-7179-4cc7-900f-bbc5b5db3657", diff --git a/1.-Molecular_Docking.ipynb b/1.-Molecular_Docking.ipynb index d59f93bd..d1cb8e03 100644 --- a/1.-Molecular_Docking.ipynb +++ b/1.-Molecular_Docking.ipynb @@ -145,13 +145,13 @@ ] }, { - "cell_type": "code", - "execution_count": 5, - "id": "ae18d2ff-7ece-48d4-99f1-092e398e58ae", + "cell_type": "markdown", + "id": "e1e6f4b6-d96a-458b-9b33-3f4846ff3e68", "metadata": {}, - "outputs": [], "source": [ - "#fix_protein(filename='1AZ8_clean.pdb',addHs_pH=7.4,try_renumberResidues=True,output='1AZ8_clean_H.pdb')" + "```\n", + "fix_protein(filename='1AZ8_clean.pdb',addHs_pH=7.4,try_renumberResidues=True,output='1AZ8_clean_H.pdb')\n", + "```" ] }, { @@ -491,31 +491,18 @@ ] }, { - "cell_type": "code", - "execution_count": 12, - "id": "39c43cc9-dc39-4a67-bc48-8d8149bda190", + "cell_type": "markdown", + "id": "4bcf45df-7781-4243-bc89-881693c03875", "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "\"\\nmol = obutils.load_molecule_from_file('1AZ8_lig_H.mol2')\\n\\npreparator = MoleculePreparation(merge_hydrogens=True,hydrate=False)\\npreparator.prepare(mol)\\n\\npreparator.write_pdbqt_file('1AZ8_lig_H.pdbqt')\\n\"" - ] - }, - "execution_count": 12, - "metadata": {}, - "output_type": "execute_result" - } - ], "source": [ - "'''\n", + "```\n", "mol = obutils.load_molecule_from_file('1AZ8_lig_H.mol2')\n", "\n", "preparator = MoleculePreparation(merge_hydrogens=True,hydrate=False)\n", "preparator.prepare(mol)\n", "\n", "preparator.write_pdbqt_file('1AZ8_lig_H.pdbqt')\n", - "'''" + "```" ] }, { @@ -527,29 +514,16 @@ ] }, { - "cell_type": "code", - "execution_count": 13, - "id": "7caf04e8-5860-48e2-84e2-3a56aea353c6", + "cell_type": "markdown", + "id": "f0141f8b-86cf-4450-95d9-93dc58a34a1b", "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "\"\\nligand = [m for m in pybel.readfile(filename='1AZ8_lig_H.mol2',format='mol2')][0]\\nout=pybel.Outputfile(filename='1AZ8_lig_H.pdbqt',format='pdbqt',overwrite=True)\\nout.write(ligand)\\nout.close()\\n\"" - ] - }, - "execution_count": 13, - "metadata": {}, - "output_type": "execute_result" - } - ], "source": [ - "'''\n", + "```\n", "ligand = [m for m in pybel.readfile(filename='1AZ8_lig_H.mol2',format='mol2')][0]\n", "out=pybel.Outputfile(filename='1AZ8_lig_H.pdbqt',format='pdbqt',overwrite=True)\n", "out.write(ligand)\n", "out.close()\n", - "'''" + "```" ] }, { @@ -748,26 +722,13 @@ ] }, { - "cell_type": "code", - "execution_count": 16, - "id": "f729fb8c-07af-4591-8b08-4225c066c144", + "cell_type": "markdown", + "id": "800679a2-6785-4c0a-93a2-ac1d396e4c04", "metadata": {}, - "outputs": [ - { - "data": { - "text/plain": [ - "'\\n!../../bin/smina -r 1AZ8_clean_H.pdbqt -l 1AZ8_lig_H.pdbqt --center_x 31.859 --center_y 13.34 --center_z 17.065 --size_x 24.569 --size_y 18.12 --size_z 17.37 --exhaustiveness 8 --num_modes 5\\n'" - ] - }, - "execution_count": 16, - "metadata": {}, - "output_type": "execute_result" - } - ], "source": [ - "'''\n", + "```\n", "!../../bin/smina -r 1AZ8_clean_H.pdbqt -l 1AZ8_lig_H.pdbqt --center_x 31.859 --center_y 13.34 --center_z 17.065 --size_x 24.569 --size_y 18.12 --size_z 17.37 --exhaustiveness 8 --num_modes 5\n", - "'''" + "```" ] }, { @@ -2046,14 +2007,6 @@ "net = LigNetwork.from_ifp(results_df,lig_suppl[0],kind=\"frame\", frame=0,rotation=270)\n", "net.display()" ] - }, - { - "cell_type": "code", - "execution_count": null, - "id": "e8f32c97-7ded-4a09-8419-d4e3bf0154f5", - "metadata": {}, - "outputs": [], - "source": [] } ], "metadata": { diff --git a/3.-Blind_Docking.ipynb b/3.-Blind_Docking.ipynb index 0ee691a5..ad836197 100644 --- a/3.-Blind_Docking.ipynb +++ b/3.-Blind_Docking.ipynb @@ -2936,14 +2936,6 @@ "view.zoomTo()\n", "view.show()" ] - }, - { - "cell_type": "code", - "execution_count": null, - "id": "d66076e0-28bd-4c43-b532-b8a2554f79c6", - "metadata": {}, - "outputs": [], - "source": [] } ], "metadata": { diff --git a/README.ipynb b/README.ipynb index 46ce395f..7f1554a3 100644 --- a/README.ipynb +++ b/README.ipynb @@ -49,28 +49,28 @@ "\n", "**Jupyter Dock is a set of Jupyter Notebooks for performing molecular docking protocols interactively, as well as visualizing, converting file formats and analyzing the results.**

\n", "\n", - "\n", "**See Jupyter Docks in action in my personal website: [chem-workflows](https://chem-workflows.com/)**

\n", "\n", - "\n", "These notebooks are Python 3 compatible. Each protocol and Jupyter notebook has its own test folder for testing and reproducibility evaluation.\n", "\n", + "For all notebooks, the demonstration includes the use of AutoDock Vina and Ledock. When available, some alternatives are mentioned in the protocol.\n", + "\n", "The notebooks includes whole protocols for:\n", "\n", "**1. Molecular Docking**\n", "> For any new user, this is a good place to start. Jupyter Docks' main stages for molecular docking, as well as all functions, methods and codes are described here along with brief explanations, hints, and warnings.\n", "\n", "**2. Virtual Screening**\n", - ">\n", + "> Interested in docking multiple ligands into a single target site? This is what you require. This protocol covers all steps from ligand preparation to docking pose visualization in the target site of interest.\n", "\n", "**3. Blind Docking**\n", - ">\n", + "> Do you want to dock multiple ligands into whole target surface and/or its pockets? This protocol demonstrates the entire process of pocket search and their use as potential organic molecule binding sites.\n", "\n", - "**4. Reverse Docking (Target fishing)**\n", - ">\n", + "**4. Reverse Docking / Target fishing)**\n", + "> Interested in docking one or a few molecules into a set of proteins to identify the most promising target(s)? This notebook covers all of the steps required to achieve such a goal in a condensed manner, making the process seem like a walk in the park.\n", "\n", "**5. Docking Analysis**\n", - ">\n", + "> Have you completed your docking experiments with Jupyter Dock or another approach and want to conduct a rational analysis? You've come to the right place. This notebook summarizes the most common docking analysis techniques, including score comparisons, z-score calculation between softwares, pose clustering, molecular interactions mapping, and more.\n", "\n", "\n", "Question about usage or troubleshooting? Please leave a comment here" @@ -125,6 +125,22 @@ "- [Smina](https://sourceforge.net/projects/smina/)" ] }, + { + "cell_type": "markdown", + "id": "c8e8a7c1-1402-4edf-8db5-a412a2bf8e20", + "metadata": {}, + "source": [ + "## Limitations" + ] + }, + { + "cell_type": "markdown", + "id": "b0cda790-83fd-46c7-b05b-54faa0bc19c5", + "metadata": {}, + "source": [ + "## Examples" + ] + }, { "cell_type": "markdown", "id": "696f484c-7179-4cc7-900f-bbc5b5db3657", diff --git a/README.md b/README.md index 88b5986c..b2900eb2 100644 --- a/README.md +++ b/README.md @@ -25,28 +25,28 @@ **Jupyter Dock is a set of Jupyter Notebooks for performing molecular docking protocols interactively, as well as visualizing, converting file formats and analyzing the results.**

- **See Jupyter Docks in action in my personal website: [chem-workflows](https://chem-workflows.com/)**

- These notebooks are Python 3 compatible. Each protocol and Jupyter notebook has its own test folder for testing and reproducibility evaluation. +For all notebooks, the demonstration includes the use of AutoDock Vina and Ledock. When available, some alternatives are mentioned in the protocol. + The notebooks includes whole protocols for: **1. Molecular Docking** > For any new user, this is a good place to start. Jupyter Docks' main stages for molecular docking, as well as all functions, methods and codes are described here along with brief explanations, hints, and warnings. **2. Virtual Screening** -> +> Interested in docking multiple ligands into a single target site? This is what you require. This protocol covers all steps from ligand preparation to docking pose visualization in the target site of interest. **3. Blind Docking** -> +> Do you want to dock multiple ligands into whole target surface and/or its pockets? This protocol demonstrates the entire process of pocket search and their use as potential organic molecule binding sites. -**4. Reverse Docking (Target fishing)** -> +**4. Reverse Docking / Target fishing)** +> Interested in docking one or a few molecules into a set of proteins to identify the most promising target(s)? This notebook covers all of the steps required to achieve such a goal in a condensed manner, making the process seem like a walk in the park. **5. Docking Analysis** -> +> Have you completed your docking experiments with Jupyter Dock or another approach and want to conduct a rational analysis? You've come to the right place. This notebook summarizes the most common docking analysis techniques, including score comparisons, z-score calculation between softwares, pose clustering, molecular interactions mapping, and more. Question about usage or troubleshooting? Please leave a comment here @@ -88,6 +88,10 @@ Jupyter Dock is reliant on a variety of academic software. The Jupyter Dock.yaml - [Meeko](https://pypi.org/project/meeko/) - [Smina](https://sourceforge.net/projects/smina/) +## Limitations + +## Examples + ## Citation If you use these notebooks, please credit this repository and the required tools as follows: