Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

0.5.0 roadmap #92

Closed
11 of 27 tasks
gfudenberg opened this issue Aug 26, 2021 · 1 comment
Closed
11 of 27 tasks

0.5.0 roadmap #92

gfudenberg opened this issue Aug 26, 2021 · 1 comment

Comments

@gfudenberg
Copy link
Member

gfudenberg commented Aug 26, 2021

interval operations

io

  • move encode client to sandbox, move parquet to sandbox
  • delete read_ucsc_mrnafile
  • sandbox parse_gtf_attributes().
  • add docstrings for: read_tabix, read_pairix, read_bam, etc.
  • develop tutorial for fetchers, fileops, & io
  • refactor read_gapfile into fetch_centromeres
  • tests for io

Tutorials and Docs

  • add a 'how do I' for finding strand-specific upstream/downstream genomic features

Data

  • populate the data folder with chromsizes for yeast, celegans, mouse, drosophila, ...

Future

  • behavior of on=[] #91
    • add on=[] option for subtract, complement.
    • add on=[] for closest(). challenge is that filtering is done inside the function. consider passing a user-defined filtering function. This could replace ignore_upstream, ignore_overlaps flags.
  • consider replacing the repeated get_default_colnames() and _verify_columns() in each ops with either a decorator or a function that does both.

io

  • either refactor parse_gtf_attributes as read_gtf() with an option to parse, or wrap an existing tool for gtf parsing into a pandas dataframe.

  • new frontend function for locally stored genome assemblies Local sources of truth for common genome assembly metadata #107

    • tutorial entry
    • change default for filter_chroms
    • add a test for the data (e.g. download from original site and check that names & lengths match)
  • allow extract_centromeres to accept a path to a file

  • io.to_bed function to save bedframe into bed (incl. resorting columns, dropping those not compatible with the bed format, and formatting the comment line) io.to_bed() #72

bin ops

Tutorials and Docs

  • add tutorial for finding strand-specific upstream/downstream genomic features
@nvictus nvictus changed the title 0.4.0 roadmap 0.5.0 roadmap Apr 3, 2023
@nvictus
Copy link
Member

nvictus commented Apr 3, 2023

Closing in favor of project

@nvictus nvictus closed this as completed Apr 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants