Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix: RSEM-calculate-expression wrapper with FASTQ-files as input + In…
…tegration of test case (#970) <!-- Ensure that the PR title follows conventional commit style (<type>: <description>)--> <!-- Possible types are here: https://github.com/commitizen/conventional-commit-types/blob/master/index.json --> ### Description - Replaced deprecated argument for alignment inputs with appropriate argument (`--bam`-> `--alignments`) - Fixed wrong argument for FASTQ-input: Deleted `input_bam = False` - Integrated test-case for FASTQ-input - Fixed environment-yaml-file: bowtie is needed for FASTQ-file input - updated meta.yaml ### Notes The original provided test-case from the authors for BAM-file inputs includes only an empty BAM-file (except of the header). My test case also includes reads, which do not map to the reference sequence (since the provided reference sequence is too short anyways). So as I understand, the purpose of these tests is just to ensure, whether the pipeline is able to run through. However, to make it more robust additional tests with real working examples would be needed. Co-authored-by: Küchler, Oliver <oliver.kuechler@bih-charite.de> Co-authored-by: Johannes Köster <johannes.koester@uni-due.de>
- Loading branch information
1 parent
26271ac
commit ea9c897
Showing
14 changed files
with
50 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -4,3 +4,4 @@ channels: | |
- nodefaults | ||
dependencies: | ||
- rsem =1.3.3 | ||
- bowtie =1.3.1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
rule calculate_expression: | ||
input: | ||
# input.bam or input.fq_one must be specified (and if input.fq_one, optionally input.fq_two if paired-end) | ||
fq_one = "fastq/a_R1.fastq", | ||
fq_two = "fastq/a_R2.fastq", | ||
# Index files created by rsem-prepare-reference | ||
reference=multiext("index/reference", ".grp", ".ti", ".transcripts.fa", ".seq", ".idx.fa", ".n2g.idx.fa"), | ||
# reference_bowtie: Additionally needed for FASTQ input; Index files created (by bowtie-build) from the reference transcriptome | ||
reference_bowtie=multiext("index/reference", ".1.ebwt", ".2.ebwt", ".3.ebwt", ".4.ebwt", ".rev.1.ebwt", ".rev.2.ebwt"), | ||
output: | ||
# genes_results must end in .genes.results; this suffix is stripped and passed to rsem as an output name prefix | ||
# this file contains per-gene quantification data for the sample | ||
genes_results="output/a.genes.results", | ||
# isoforms_results must end in .isoforms.results and otherwise have the same prefix as genes_results | ||
# this file contains per-transcript quantification data for the sample | ||
isoforms_results="output/a.isoforms.results", | ||
params: | ||
# optional, specify if sequencing is paired-end | ||
paired_end=True, | ||
# additional optional parameters to pass to rsem, for example, | ||
extra="--seed 42", | ||
log: | ||
"logs/rsem/calculate_expression/a.log", | ||
threads: 2 | ||
wrapper: | ||
"master/bio/rsem/calculate-expression" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
@test:sequence:a/1 | ||
AATAAAGTAAAAGCATGTTCCAGTTTAAGATCAATGCCAAATTTCTGAACTGGGGCATAATACTTACTCTGCATCTTTTACAACACTTCTAAGAGTTAATCACATAGATCGGAAGAGCACACGTC | ||
+ | ||
/<BBBF<FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
@test:sequence:a/2 | ||
TTATTTCATTTTCGTACAAGGTCAAATTCTAGTTACGGTTTAAAGACTTGACCCCGTATTATGAATGAGACGTAGAAAATGTTGTGAAGATTCTCAATTAGTGTATCTAGCCTTCTCGTGTGCAG | ||
+ | ||
BBBBBFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFBFB |
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters