Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make a consistent way to deal with complicated column formats #31

Open
mikegloudemans opened this issue May 4, 2020 · 2 comments
Open
Labels
code-base Maintaining reasonable and intuitive code enhancement New feature or request munge Pertaining to the "munge" module usability Making tools more intuitive for the user
Milestone

Comments

@mikegloudemans
Copy link
Owner

Probably the best way to do this will be to make it so that attributes like "rsid_index" or "or_index" or "effect_allele_index" can be specified as objects with attributes

  • split_char (list of strings) [characters on which to split the column entry, could be more than one]
  • ignore_chars (list of strings) [characters to remove completely]
  • split_index (integer) [1-based index of the field to grab after splitting on split_char]

This is already done for a few traits using an rsid_split object, but in an ad-hoc way not generalized to other attribute columns

Should be able to do this for ANY column. Ref and alt too.

@mikegloudemans mikegloudemans added enhancement New feature or request munge Pertaining to the "munge" module usability Making tools more intuitive for the user code-base Maintaining reasonable and intuitive code labels May 4, 2020
@mikegloudemans
Copy link
Owner Author

See also: #32

@mikegloudemans
Copy link
Owner Author

mikegloudemans commented May 4, 2020

Pull ref and alt alleles from traits with unconventional formats

  • Multiple-Traits_Gurdasani_2019
  • Osteoarthritis_Zeggini_2012
  • Sarcoidosis_Rivera_2016

@mikegloudemans mikegloudemans added this to the v2 milestone May 4, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
code-base Maintaining reasonable and intuitive code enhancement New feature or request munge Pertaining to the "munge" module usability Making tools more intuitive for the user
Projects
None yet
Development

No branches or pull requests

1 participant