BioDataHack

Team J-SCAMP's project for BioDataHack. Repurposing drugs and reclassifying diseases through unsupervised machine learning.

Prerequisites

Versions used for development:

python (v3.4)
numpy (v1.11.3)
pandas (v0.19.2)
matplotlib (v2.0.0)
sklearn (v0.18.1)
opentargets (v3.1.0) (http://opentargets.readthedocs.io/en/stable/)

Running

Running main.py will pull down a disease-gene data from OpenTargets (unless that dataset is in the current working directory) and return two plots. Those include 'pca.png', which shows Principal Components 1 and 2 where diseases are red points and drugs are blue, and 'kmeans.png', where the four most likely clusters are colourised. The drug-gene (& disease-gene) dataset should present in the current working directory.

Parameters

By default main.py will perform dimension reduction with Principal Component Analysis, but Singular-Value Decomposition and t-SNE can be performed if clusterType is set to 'gene-svd' or 'gene-tsne'. Nb. these options will require editing main.py. Similarly, if disease-gene information needs to be pulled from OpenTargets the script will pull down 500 diseases by default, with more or less being pulled by changing the maxDiseases variable.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
disease_gene_data.csv		disease_gene_data.csv
gdi.csv		gdi.csv
id_dis.txt		id_dis.txt
id_drug.txt		id_drug.txt
kmeans.png		kmeans.png
main.py		main.py
pca.png		pca.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

disease_gene_data.csv

disease_gene_data.csv

gdi.csv

gdi.csv

id_dis.txt

id_dis.txt

id_drug.txt

id_drug.txt

kmeans.png

kmeans.png

main.py

main.py

pca.png

pca.png

Repository files navigation

BioDataHack

Prerequisites

Running

Parameters

Example Output

About

Releases

Packages

Languages

mattravenhall/BioDataHack

Folders and files

Latest commit

History

Repository files navigation

BioDataHack

Prerequisites

Running

Parameters

Example Output

About

Topics

Resources

Stars

Watchers

Forks

Languages