Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extracting insertion/deletion data from aligned region of interest #281

Open
jsromanowski opened this issue Feb 25, 2024 · 5 comments
Open

Comments

@jsromanowski
Copy link

jsromanowski commented Feb 25, 2024

Hello IGV team,

I am currently analyzing amplicons from an insect genomic DNA sample edited with CRISPR/Cas9. I am aware IGV offers insertion and deletion data by selecting a specific base pair of your aligned read(s). Is it possible to access and export this data for a highlighted region of interest from IGV?

Side note: I am aware this data is contained within .bam files, however extracting the information in a clean way is not so simple and can get a bit messy (i.e. with mpileup offered by bcftools). I would also like to add I do not intend to use this data in a clinical or therapeutic context, but instead am using it to analyze gene editing in insects - so although I'm aware this web app is not particularly intended for this purpose, the ability to this data from IGV would be extremely useful and sufficient for my application.

Thanks,

Joe

@jrobinso
Copy link
Contributor

Extracting the data with samtools is pretty trivial if you know the region

samtools view chr1:100-200

What information are you looking for, and in what format?

@jsromanowski
Copy link
Author

jsromanowski commented Feb 27, 2024

With samtools view, I can see the alignment results (mismatches, insertions, deletions) and count the sum of any one of these (or all) over a given range (i.e. chr1:100-200), however I would lose base pair resolution of this information (i.e. if my region is chr1:100-200, I will not be able to see indel counts for chr1:100, chr1:101, chr1:102, and so on). By manually clicking a position of my aligned read on IGV web app, I can obtain this information, but it is not scalable for many alignment files - is there a faster, higher-throughput way to do this via IGV or some package offered by IGV?

Output format of this information is not particular important , but something simple such as columns for reference position, insertions, and deletions would be ideal.

Thanks for the quick response!

@jrobinso
Copy link
Contributor

I would find it really surprising if there isn't a simple tool to do this, but I assume you have looked. This isn't the sort of thing we design IGV for, which is a visual tool, but leave this open we will consider it for the future.

@jrobinso
Copy link
Contributor

BTW, on a tangent here, but where would you expect an insertion between positions 10 and 11 to be counted?

@jsromanowski
Copy link
Author

Great question - I am exploring variant calling programs to count these. Out of curiosity, how does IGV do this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants