Skip to content

dhimmel/entrez-gene

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Creating user-friendly Entrez Gene datasets for humans

DOI: 10.5281/zenodo.45524

Entrez Gene is the NCBI database of gene-specific information. It provides "tracked, unique identifiers for genes" and reports "information associated with those identifiers for unrestricted public use [source]." We use Entrez Gene as the primary gene vocabulary for our drug repuposing research.

This repository creates user-friendly datasets from Entrez Gene. We currently focus on human genes only.

The python notebook process.ipynb executes the analysis. Files downloaded from external locations are stored in download. The following created datasets reside in data: