Skip to content

Team J-SCAMP's project for BioDataHack. Repurposing drugs through unsupervised machine learning.

Notifications You must be signed in to change notification settings

mattravenhall/BioDataHack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BioDataHack

Team J-SCAMP's project for BioDataHack. Repurposing drugs and reclassifying diseases through unsupervised machine learning.

Prerequisites

Versions used for development:

Running

Running main.py will pull down a disease-gene data from OpenTargets (unless that dataset is in the current working directory) and return two plots. Those include 'pca.png', which shows Principal Components 1 and 2 where diseases are red points and drugs are blue, and 'kmeans.png', where the four most likely clusters are colourised. The drug-gene (& disease-gene) dataset should present in the current working directory.

Parameters

By default main.py will perform dimension reduction with Principal Component Analysis, but Singular-Value Decomposition and t-SNE can be performed if clusterType is set to 'gene-svd' or 'gene-tsne'. Nb. these options will require editing main.py. Similarly, if disease-gene information needs to be pulled from OpenTargets the script will pull down 500 diseases by default, with more or less being pulled by changing the maxDiseases variable.

Example Output

About

Team J-SCAMP's project for BioDataHack. Repurposing drugs through unsupervised machine learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages