Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: BWA mem2 index - remove prefix parameter and determine prefix from output #515

Merged
merged 11 commits into from Aug 18, 2022

Conversation

christopher-schroeder
Copy link
Contributor

Description

Remove the param prefix and detect prefix by output

QC

For all wrappers added by this PR, I made sure that

  • there is a test case which covers any introduced changes,
  • input: and output: file paths in the resulting rule can be changed arbitrarily,
  • either the wrapper can only use a single core, or the example rule contains a threads: x statement with x being a reasonable default,
  • rule names in the test case are in snake_case and somehow tell what the rule is about or match the tools purpose or name (e.g., map_reads for a step that maps reads),
  • all environment.yaml specifications follow the respective best practices,
  • wherever possible, command line arguments are inferred and set automatically (e.g. based on file extensions in input: or output:),
  • all fields of the example rules in the Snakefiles and their entries are explained via comments (input:/output:/params: etc.),
  • stderr and/or stdout are logged correctly (log:), depending on the wrapped tool,
  • temporary files are either written to a unique hidden folder in the working directory, or (better) stored where the Python function tempfile.gettempdir() points to (see here; this also means that using any Python tempfile default behavior works),
  • the meta.yaml contains a link to the documentation of the respective tool or command,
  • Snakefiles pass the linting (snakemake --lint),
  • Snakefiles are formatted with snakefmt,
  • Python wrapper scripts are formatted with black.

bio/bwa-mem2/index/wrapper.py Outdated Show resolved Hide resolved
Copy link

@tedil tedil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@fgvieira
Copy link
Collaborator

fgvieira commented Jul 18, 2022

Wouldn't it be cleaner to infer the prefix with os.path.commonprefix()?

Btw, since you are fixing the bwa-mem2 wrapper, do you think you could look into issue #522? It seems a straightforward fix...
Thanks!

@fgvieira fgvieira requested a review from tedil July 18, 2022 18:32
@fgvieira
Copy link
Collaborator

Like mentioned on issue #494, maybe we can define a function to infer the prefixes and add it to snakemake-wrapper-utils. Someting like:

def infer_prefix(files, suffixes, strict = True):
    prefixes = []
    suffixes = set(suffixes)
    
    for file in files:
        for suffix in suffixes:
            if file.endswith(suffix):
                prefixes.append(file[:-len(suffix)])

    if len(prefixes) != len(files) and strict:
        raise ValueError("All files must have a valid suffix.")
    if len(set(prefixes)) != 1:
        raise ValueError("All files must share common prefix.")

    return prefixes[0]

Copy link
Contributor

@johanneskoester johanneskoester left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work, some suggestions below. In general, I like the idea of @fgvieira to move the suffix handling code into snakemake-wrapper-utils, but to get things going I am fine with committing it here for now and moving it there later.

bio/bwa-mem2/index/wrapper.py Outdated Show resolved Hide resolved
bio/bwa-mem2/index/wrapper.py Outdated Show resolved Hide resolved
Co-authored-by: Johannes Köster <johannes.koester@uni-due.de>
Co-authored-by: Filipe G. Vieira <fgarrettvieira@gmail.com>
@johanneskoester johanneskoester merged commit ab8d4ad into master Aug 18, 2022
@johanneskoester johanneskoester deleted the bwamem2index branch August 18, 2022 09:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants