Skip to content

Using data from PDBePISA with the convenience of Python and Jupyter.

License

Notifications You must be signed in to change notification settings

fomightez/pdbepisa-binder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

75 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdbepisa-binder

Analysis of PDBePISA-related data via active Jupyter sessions provided via MyBinder.org. Adapt the demonstrations to analyze your favorite structures.

Binder

tl;dr:
Click any launch binder badge on this page to use the demonstrations inside your browser.


pdbepisa-binder: Jupyter notebook environment for analysis of PDBePISA-related data.

A launchable, working Jupyter-based environment that has a collection of demonstrations of analysis of PDBePISA-related data served via MyBinder.org.

You can also easily adapt the demonstrations to analyze your favorite structures.

Meant to be self-contained and ready-to-go. No installations or copying of notebooks is necessary if launch binder is clicked. Everything will just work. Of course, static versions of the notebooks can also be used. I recommend rendering the static versions by placing the URLs into the nbviewer. The views provided by nbviewer look best and Github's rendering often times out (your mileage may vary).

PDBePISA itself is accessible online here.

Usage

This repository is set up to allow analysis of PDBePISA-related data after pressing the launch binder button above or below. The target use case is where you want to be thorough in your analyses with the add of compuatation or analyze multiple structures. You shouldn't need to install anything.

In the notebooks that can be launched, I have added some examples illustrating how to get data and process ir easily with Python and convert to other forms. Alternatively, the notebooks with most of resources can be viewed statically and nbviewer is recommended for that as discussed above.

Attributions

Users of PDBePISA-sourced data should probably cite the following, as directed here:

Clarifying Software Attribution: I, Wayne, am not involved with PDBePISA in any way. I simply set up this repository to make analysis of the data easier without installation headaches. See the links above and below for the materials by those who established and maintain PDBePISA, and the related site jsPISA.

I, Wayne, did share Jupyter/Python-based utilities for use with the data available from PDBePISA; these are available here and utilized in the notebooks in this repository to process data and allow easily converting the results to other forms.

Related items by others

There is also jsPISA, which is supposedly an improved user interface; however, I don't see a way to use access as an API, several PDB entries I put in gave the error that they did not exist, and when I tried with 4fgf that shows a very informative interface table and PDBePISA, I was not able to see equivalent at jsPISA. jsPISA is maintained by CCP4 here.

Louis has a script that gets HTML (or is it XML?) for a lot of PDB indentifiers. It allows for interrupted/resumed download.

AlphaFold-Multimer Local Interaction Score (AFM-LIS) calculates the Local Interaction Score (LIS) and the Local Interaction Area (LIA) which may be useful in conjunction / comparison with PDBePISA data.

Related items by me

  • My pdbepisa-utilities sub-repo for the associated scripts.

  • My pdbsum-utilities sub-repo has a number of scripts, although I note the interface handling ones only deal with chains of protein-protein interfaces. These scripts are demonstrated in sessions that can be launched by pressing the launch binder button at my repo pdbsum-binder.

  • My repo pdbsum-binder demonstrates scripts from my pdbsum-utilities sub-repo that enable handling data from the PDBsum with Jupyter/Python. Importantly, data from that site will only summarize interface surface area between protein chains of a structure. It does detail protein and nucleic acid residue-residue contacts but only graphically, and so I haven't found/developed a way to extract the data from there into Pandas dataframes yet.

  • See here for a listing of resources in a similar vein yet targeted to macromolecular structure data. In particular, see cl_demo-binder for the companion set to this one.

Technical notes

This repository is set up to make use of the binder service offered by MyBinder.org. See their site for more information about Binder.

I borrrowed the 'warning' highlight/introductory text about notebooks at the top of the included notebook from Tim Sherratt's notebook here.

Click launch binder below to start using the demonstrations.

Binder