Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

externally-defined criteria for keeping/dropping records #57

Open
nhoffman opened this issue Jan 5, 2017 · 0 comments
Open

externally-defined criteria for keeping/dropping records #57

nhoffman opened this issue Jan 5, 2017 · 0 comments
Assignees

Comments

@nhoffman
Copy link
Member

nhoffman commented Jan 5, 2017

Let's add an option 'filter_outliers --filter-functions' that will allow the user to specify a file containing python code for some (optional) functions, each with the same signature: func(seq, info) where seq is a single sequence record, and info represents annotation (a dict) for the corresponding sequence. One or more of the following functions may be defined:

  • keep_before(seq, info) - keep if True
  • drop_before(seq, info) - drop if True

These override other criteria that are applied before outlier detection (so that we have more fine-grained control of what sequences are considered for outlier detection).

  • keep_after(seq, info) - keep if True
  • drop_after(seq, info) - drop if True

These override the results of outlier detection.

Comments? Questions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants