Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bin-table operations #116

Open
5 tasks
Tracked by #92
agalitsyna opened this issue Jun 15, 2022 · 1 comment
Open
5 tasks
Tracked by #92

bin-table operations #116

agalitsyna opened this issue Jun 15, 2022 · 1 comment

Comments

@agalitsyna
Copy link
Member

agalitsyna commented Jun 15, 2022

  • definitions
  • functions
    • regions_to_bins to assign binspans to a set of bp spans
    • align_tables Find a way to ensure alignment of two interval tables. #37
    • adjust_view_to_bintable to synchronize view start/end with bin start/end. We frequently have viewframe made at in bp resolution, while bins are much larger, and it is not clear to what regions to assign them. This tool will provide a standard for making this decision.
@gfudenberg gfudenberg mentioned this issue Jun 15, 2022
27 tasks
@agalitsyna
Copy link
Member Author

agalitsyna commented Jun 15, 2022

What we already have in bioframe and what might be useful for the discussion.

  • filtering bin table by viewframe has at least three bioframe-ish solutions:
  1. bioframe.select with pd.concat - will output bins that overlap multiple regions multiple times
  2. bioframe.overlap with DataFrame.dropna - does not verify the viewframe for sorting/non-overlapping
  3. bioframe.assign_regions with DataFrame.dropna - the most reasonable way. Bin that overlaps multiple regions will be assigned to the one with the highest overlap.

@nvictus nvictus changed the title binops bin-table operations Apr 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants