Drugbank Scraper

Run

See Pipenv & Virtual Environments guide to create a virtual environment and activate the virtual environment
Install graphviz brew install graphviz
Install requirements with pip install -r requirements.txt.
Create PostgreSQL database.
Create .env file with cp .env.template .env and fill environment variables.

Spiders

Drug

Run scrapy crawl drug to run drug spider and populate database. This will scrape data, create and populate database tables. Final data will be in drugbank schema. This will:

Scrape following data:
- DrugBank ID
- SMILES string
- Gene name
- Actions and alternative identifiers of every target.
Save scraped data into the previously created PostgreSQL database.

Development

See the virtual environment step above.
Install requirements with pip install -r requirements_dev.txt.
Run pre-commit install to install pre-commit hooks. This repo is already set up to use some pre-commit hooks for code quality purposes. Configuration file is available here. More information about pre-commit is available on their website.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github		.github
drugbank		drugbank
static		static
.env.template		.env.template
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
README.md		README.md
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

drugbank

drugbank

static

static

.env.template

.env.template

.gitignore

.gitignore

.isort.cfg

.isort.cfg

.pre-commit-config.yaml

.pre-commit-config.yaml

.python-version

.python-version

README.md

README.md

requirements.txt

requirements.txt

requirements_dev.txt

requirements_dev.txt

scrapy.cfg

scrapy.cfg

Repository files navigation

Drugbank Scraper

Run

Spiders

Drug

Development

About

Releases 2

Sponsor this project

Contributors 2

Languages

aliavni/drugbank-scraper

Folders and files

Latest commit

History

Repository files navigation

Drugbank Scraper

Run

Spiders

Drug

Development

About

Resources

Stars

Watchers

Forks

Sponsor this project

Languages