Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inaccurate coordinates in MAF file #27

Open
smallfade opened this issue May 12, 2021 · 2 comments
Open

Inaccurate coordinates in MAF file #27

smallfade opened this issue May 12, 2021 · 2 comments

Comments

@smallfade
Copy link

smallfade commented May 12, 2021

After running SibeliaZ between two fasta files, I found the coordinate is different from the original fasta file, and the sequence did not match back to the reference or query file. Both input files contain hard-masking regions. Did the output remove the hard-masking region length from original position? How could I get the same coordiantes as in the original file? Otherwise, it would be difficult for the postprocessing, when the reference is an established genome with annotations.

@iminkin
Copy link
Collaborator

iminkin commented May 12, 2021

Hi, that is very interesting. Could you please share the input and the parameters you used?

@smallfade
Copy link
Author

Hi, that is very interesting. Could you please share the input and the parameters you used?

I did not change the default parameters.
sibeliaz -t 40 -o out_dir GRCm38.primary_assembly.genome.fa xxxx_strain_assembly.fa

I'm sorry, it may not be a problem with SibeliaZ. The output I checked is from mafFilter, which might be the culprit here. The original MAF from SibeliaZ does not have the "inaccurate" coordiantes. But I wonder how to process the output from SibeliaZ to get VCF file for counting the differences. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants