GenoAssist

An all-encompassing bioinformatics tool for genome assembly and annotation projects

One of the challenges that computational biologists face during genome assembly projects is choosing from the plethora of assembly software. This is highly time-consuming as there are various parameters for each of the assemblers that the user needs to learn about. In addition, even if users learn about the various parameters of each assembler, various assemblers still need to be run, and statistical results need to be compared to identify the best assembly. GenoAssist helps computational biologists by centralizing all the assemblers, their parameters, running environments, and results reporting in a single place.

2. Installation

You can either use go (will be added to $GOPATH/):

$ go get -u github.com/genoassist/genoassist

Or clone the repository:

$ git clone https://github.com/genoassist/genoassist

Build the main.go file
```
$ go build main.go
```

If you are missing packages, run go mod vendor to collect the necessary packages

3. GenoAssist usage

GenoAssist only requires a YAML file that contains the configuration it should use to run its processes. A template can be found in this repository. For convenience, here's an example specification:

assemblers:
  megahit:
    kmers: "27"
  abyss:
    kmers: "27"
genoassist:
  assemblers: ['abyss','megahit','flye']
  inputFilePath: "/test/raw_sequences.fastq"
  outputPath: "/test/output"
  threads: 2
  prep: true
  qualityControl: true
  fileType: "fasta"

Notes:

All paths used with GenoAssist have to be absolute paths (a Docker requirement)
The accepted assembler values are:

'abyss'
'megahit'
'flye'

The accepted file types are:

FASTA
FASTQ

Installing Docker images through GenoAssist

If you are encountering problems with Docker, make sure that:

The Docker daemon is running in the background
You have the necessary Docker images, which can be installed via GenoAssist specifying prep: true under genoassist in the YAML configuration. This will install the necessary Docker images for the containers that GenoAssist runs.

Sample assembly result visualization

4. Architecture

The overall model follows the primary/replica architecture. The primary is what users interact with. The users specify the files containing the contigs and what type of read they have e.g Illumina. The primary takes the user's input and schedules assembly, parsing of results, and reporting, in that order.

5. Maintainers

Tayab Soomro
Flaviu Vadan

Feel free to contact any of the maintainers if you would like to be an active maintainer and contributor to GenoAssist! If you would like to contribute only, you are encouraged to grab an issue and submit a pull request with proposed changes for review!

6. Feedback and bug reports

Submit feedback and bug reports by using the Issues section of the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 341 Commits
.github		.github
config_parser		config_parser
constants		constants
prepper		prepper
primary		primary
replica		replica
reporter		reporter
result		result
visualizer		visualizer
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md
architecture.png		architecture.png
example_viz.png		example_viz.png
genoassist.drawio		genoassist.drawio
go.mod		go.mod
go.sum		go.sum
main.go		main.go

License

genoassist/genoassist

Folders and files

Latest commit

History

Repository files navigation

GenoAssist

Table of contents

1. About

2. Installation

3. GenoAssist usage

Notes:

Installing Docker images through GenoAssist

Sample assembly result visualization

4. Architecture

5. Maintainers

6. Feedback and bug reports

About

Resources

License

Stars

Watchers

Forks

Languages