Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[New pipeline] RepeatMaskMyGenome #33

Open
Juke34 opened this issue May 18, 2020 · 0 comments
Open

[New pipeline] RepeatMaskMyGenome #33

Juke34 opened this issue May 18, 2020 · 0 comments

Comments

@Juke34
Copy link
Collaborator

Juke34 commented May 18, 2020

See #17 for the general picture.

Maybe can be merge with the DeNovoRepeatLib pipeline (see #32).

The purpose of RepeatMaskMyGenome is to repeat mask a genome based on a repeat library (made de novo or provided within a lib e.g dfam or RepBase).
Having this pipeline could help to move easily to other annotation tool than MAKER if needed.

This pipeline consist in mains 3 steps:

  • split the genome into chunks (overlap or not? we can see in MAKER code to see how they do.)
  • Mask the chunks
  • Merge the annotation of the different chunks (if overlapping chunks we need to find a way to resolve the merge in a good way)

Input:

  • Path to a lib (fasta file is de-novo a name if from DFam or RepBase)
  • genome
  • param for the split size

Output: single gff file + stats

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant