Installations

This is a FastAPI project which uses strawberry to query an elasticsearch database.

Live API

This API is live at - http://annoq.org/api-v2/

API Endpoints

/graphql - Graphql endpoint of the API where all the queries can be made through strawberry graphiQL.
/annotations - Returns a json with the annotation tree which has field names for strawberry queries.
/download/{folder}/{name} - Downloads a text file using the download path returned in the download graphql query.

Installations

Before you begin, make sure you have the following installed:

Python: Install Python 3.9 or later. You can download it from python.org.
Docker Desktop: Install Docker Desktop. You can download it from docker-desktop.

Project Setup

Clone this repository.

git clone https://github.com/USCbiostats/annoq-api-v2.git
cd annoq-api-v2

Create a python virtual environment and activate it.

python3 -m venv venv
source venv/bin/activate

Install the dependencies

pip install -r requirements.txt

Make sure that the Docker Desktop is running. Build the Docker image and start the container.

docker-compose up --build

Once the image and containers are made, the containers can be started from Docker Desktop or using the following command

docker-compose up

The fastAPI application would be running on http://0.0.0.0:8000 and the elasticsearch instance would be on http://0.0.0.0:9200

Sample Elasticsearch Data Setup

Follow the https://github.com/USCbiostats/annoq-database repository and use the sample_data folder to setup the sample data for elasticsearch

Dynamic Snps class generation

Annoq has 500+ attributes, so the strawberry type for it had to be generated dynamically as it would not make sense to manually write 500 fields. Since the class is already present in this repository there is no need to run the following code again, but just for knowledge:

First a json schema was generated which takes the mapping for the elasticsearch database and creates a schema for a pydantic Baseclass. After this scripts/class_generators/class_schema.json was generated. The python file of the pydantic Baseclass - models/Snps.py is generated using datamodel-codegen.

All of this can be done using the bash script and running the following command -

scripts/class_generators/generate_model.sh

Make sure that the above scripts has permissions, if not run

chmod +x scripts/class_generators/generate_model.sh

Cron job setup

Change the cron_job.sh file change downloads in line 1 to the absolute path of the download folder in this repo after cloning and then run the following command which clears the download folder once a week at midnight.

chmod +x cron_job.sh
./cron_job.sh

To run the project

uvicorn src.main:app --reload

Testing

To run the tests on the code use the following command

python -m pytest test

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
.vscode		.vscode
data		data
demo		demo
sample_data		sample_data
scripts/class_generators		scripts/class_generators
src		src
test		test
.env-example		.env-example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
cron_job.sh		cron_job.sh
docker-compose.yaml		docker-compose.yaml
dynamic_data_es.sh		dynamic_data_es.sh
get_data_from_api.sh		get_data_from_api.sh
log.ini		log.ini
playground.ipynb		playground.ipynb
process_downloaded_json_files.py		process_downloaded_json_files.py
query.txt		query.txt
requirements.txt		requirements.txt

USCbiostats/annoq-api-v2

Folders and files

Latest commit

History

Repository files navigation

Live API

API Endpoints

Installations

Project Setup

Sample Elasticsearch Data Setup

Dynamic Snps class generation

Cron job setup

To run the project

Testing

About

Resources

Stars

Watchers

Forks

Languages