Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mm10 vs Grcm38(Gencode) #143

Open
pdellorusso opened this issue Jan 4, 2019 · 1 comment
Open

mm10 vs Grcm38(Gencode) #143

pdellorusso opened this issue Jan 4, 2019 · 1 comment

Comments

@pdellorusso
Copy link

This is a question, not an issue, but I am curious about whether there is a specific reason to use mm10 over the GRCm38 (https://www.gencodegenes.org/mouse/) primary assembly available from Gencode?

Is this the standard mouse genome assembly to use for all Encode standardized pipelines?

@strattan
Copy link

strattan commented Jan 7, 2019

@pdellorusso Thanks for your question. The GRCm38 build ENCODE uses is based on what GRC calls the "latest major release", which is at the "GRCm38" tab here: https://www.ncbi.nlm.nih.gov/grc/mouse

We do not apply the periodic patches GRC applies, which is up to p6 at this time.

The mm10 ENCODE uses for mapping has chromosome names in "UCSC format" (like "chr1"), and includes autosomes, both sex chromosomes, M, and the unplaced and unlocalized scaffolds. Downstream analysis may choose to use any subset of those mappings but the mapping is always to the same reference.

For transcript annotations, we have used GENCODE M4 https://www.gencodegenes.org/mouse/release_M4.html. We anticipate upgrading to a more recent GENCODE build this year, but the ENCODE RNA working group have not decided on exactly which build or what that timeline is. When we do decide, we will make an announcement on https://www.encodeproject.org/

I hope that's helpful!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants