Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG: Indexing of Spur_5.0 assembly fails #436

Open
Maarten-vd-Sande opened this issue Jul 23, 2020 · 7 comments
Open

BUG: Indexing of Spur_5.0 assembly fails #436

Maarten-vd-Sande opened this issue Jul 23, 2020 · 7 comments
Labels
bug Something isn't working wontfix This will not be worked on

Comments

@Maarten-vd-Sande
Copy link
Member

Describe the bug
When making an index for assembly Spur_5.0 rule trackhub_index fails:
NC_001453.1 is not found in chromosome sizes file

To Reproduce
I ran the atac-seq pipeline with many samples, but I guess this should be enough to reproduce:

sample              assembly
GSM2546188   Spur_5.0

Expected behavior
No failure! Is this a genomepy+annotation thingy? Or is it on our side?

@Maarten-vd-Sande Maarten-vd-Sande added the bug Something isn't working label Jul 23, 2020
@siebrenf
Copy link
Member

This happens with immature genomes/annotations.

I suggest sticking with the older assembly for a while longer, but we could make the function "skip groups with errors" using flag -allErrors. What would you prefer?

@Maarten-vd-Sande
Copy link
Member Author

What would that do? I guess in my case I would prefer it to ignore the annotation, since the alignment is more important than the annotation visualization in the ucsc trackhub

@siebrenf
Copy link
Member

Since the rule is trackhub_index, I guess that any genes on that contig are unsearchable on UCSC.
So perhaps its fine to ignore the error then...

@Maarten-vd-Sande
Copy link
Member Author

I don't know if that is what you proposed. What would -allErrors do?

@siebrenf
Copy link
Member

-allErrors - skip groups with errors rather than aborting.
      Useful for getting infomation about as many errors as possible.

I think it means it just skips the lines in the annotation that cause an error.

@siebrenf siebrenf added the wontfix This will not be worked on label Aug 3, 2020
@siebrenf
Copy link
Member

siebrenf commented Aug 3, 2020

Spur_5.0 has different contig names in the genome fasta and the annotation. NCBI will have to fix this, or you must manually change them :(

@Maarten-vd-Sande
Copy link
Member Author

I would like to keep this one open (at least for now), until I have a solution for myself

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants