Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switching from HGNC to EntrezID #17

Open
amyecampbell opened this issue May 19, 2017 · 0 comments
Open

Switching from HGNC to EntrezID #17

amyecampbell opened this issue May 19, 2017 · 0 comments

Comments

@amyecampbell
Copy link
Collaborator

amyecampbell commented May 19, 2017

once updated pipeline is working, we should switch feature names to be EntrezIDs. Currently, CuratedOvarianData deals with non-specific probe sets (probes that map to multiple genes) by including all possible genes separated by ///. For now, we are dropping these examples when we include AACES. One concern, however, is that some single HGNC gene names in the CuratedOvarian and Mayo sets are different aliases of HTA HGNC names in AACES. Mapping to EntrezIDs would likely provide more complete mapping, and therefore a greater number of common genes between the 6 datasets.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant