Bump star version #5529

matthdsm · 2024-04-26T11:57:57Z

bump star version
drop mulled container
fix tests
arriba: pytest > nf-test

PR checklist

Closes #XXX

MatthiasZepper

I have no objections against the current implementation, but if you anyway implement a separate input for the index, you could make it optional and have a fallback solution for determining NUM_BASES with bash in the genomegenerate module?

MatthiasZepper · 2024-04-26T12:38:11Z

modules/nf-core/star/genomegenerate/main.nf

-    tuple val(meta), path(fasta)
-    tuple val(meta2), path(gtf)
+    tuple val(meta) , path(fasta)
+    tuple val(meta2), path(fai)


Why did you not make this a tuple val(meta) , path(fasta), path(fai) input?

I'm following the example of a bunch of other modules, AFAIK the guidelines aren't adamant on this. (Can't find where it's specified though)

I didn't mean to suggest that your approach is against the rules.

To me, it just seemed easier to keep the Fasta and its associated index in one channel rather than in two separate (which might get out of sync and become disordered).

But if other modules handle it with separate channels, I am fine with that, too. It does not require some possibly confusing .map() operation to pass it to the tool.

Oh yeah, I agree about that. TBH, when this gets merged, the first thing I do is patch the module and make one tuple of all the args, but that's just for me

Until we have a proper way to deal with it in an automatic way, I'd rather we keep each single reference file separated, otherwise it might get complex and messy.
But I agree that I don't like this too much

To me, an index is just the table of contents for the reference, so conceptually not a separate entity. I think, there is simply too little use for it on its own.

Personally, I find it convenient, if it is automatically attached to the source file like 42basepairs does, but I see why this opinionated approach is discouraged for a module. Thus, I think you were correct separating the channels.

MatthiasZepper · 2024-04-26T12:51:03Z

modules/nf-core/star/genomegenerate/main.nf

        END_VERSIONS
        """
    } else {
        """
-        samtools faidx $fasta
-        NUM_BASES=`gawk '{sum = sum + \$2}END{if ((log(sum)/log(2))/2 - 1 > 14) {printf "%.0f", 14} else {printf "%.0f", (log(sum)/log(2))/2 - 1}}' ${fasta}.fai`
+        NUM_BASES=`awk '{sum = sum + \$2}END{if ((log(sum)/log(2))/2 - 1 > 14) {printf "%.0f", 14} else {printf "%.0f", (log(sum)/log(2))/2 - 1}}' ${fai}`


Indeed, there is no need for a specific awk implementation, since nothing in this code would require gawk specifically.

I'm going to replace it with your bash script, awk doesn't publish any version info and it breaks the conda tests

MatthiasZepper · 2024-04-26T12:52:05Z

modules/nf-core/star/genomegenerate/main.nf

@@ -61,8 +59,6 @@ process STAR_GENOMEGENERATE {
        cat <<-END_VERSIONS > versions.yml
        "${task.process}":
            star: \$(STAR --version | sed -e "s/STAR_//g")
-            samtools: \$(echo \$(samtools --version 2>&1) | sed 's/^.*samtools //; s/Using.*\$//')
-            gawk: \$(echo \$(gawk --version 2>&1) | sed 's/^.*GNU Awk //; s/, .*\$//')


However, if you keep using awk, you also need to publish its version information.

MatthiasZepper · 2024-04-26T12:53:53Z

modules/nf-core/star/genomegenerate/meta.yml

@@ -28,6 +28,14 @@ input:
      description: |
        Groovy Map containing reference information
        e.g. [ id:'test' ]
+  - fai:


See above, I would not define a separate channel for the index input, but rather extend the tuple.

modules/nf-core/arriba/arriba/tests/main.nf.test

Co-authored-by: Maxime U Garcia <max.u.garcia@gmail.com>

adamrtalbot

This should be two separate PRs (one for Arriba, one for STAR), but hey ho.

Code looks fine, I'm a little concerned that the Docker volume mounts stuff isn't very clear on why it's needed, what it achieves, how it works etc. Future developers will need to know why that exists.

adamrtalbot · 2024-05-01T10:52:23Z

modules/nf-core/star/align/tests/nextflow.arriba.config

@@ -11,4 +11,4 @@ process {
 }

 // Fix chown issue for the output star folder
-docker.runOptions = '--platform=linux/amd64 -u $(id -u):$(id -g)'
+docker.runOptions = '--platform=linux/amd64 -u $(id -u):$(id -g) -e "HOME=${HOME}" -v /etc/passwd:/etc/passwd:ro -v /etc/shadow:/etc/shadow:ro -v /etc/group:/etc/group:ro -v $HOME:$HOME'


Oof. I'm not a fan of injecting extra config into a test, it makes it deviate from a real use case.

At the very least, can we get some comments on why this is here and what it achieves?

matthdsm and others added 3 commits April 26, 2024 11:57

bump STAR, drop mulled container, fix tests

8f1f908

Merge branch 'master' into bump/star

088a73f

prettier

af9ba09

matthdsm marked this pull request as ready for review April 26, 2024 12:05

matthdsm requested review from JoseEspinosa, KevinMenden, ggabernet, grst, fmalmeida, RHReynolds, apeltzer, Vivian-chen16, maxulysse, Joaodemeirelles, drpatelh, praveenraj2018 and a team as code owners April 26, 2024 12:05

linting

c737eb5

MatthiasZepper reviewed Apr 26, 2024

View reviewed changes

matthdsm added 3 commits April 29, 2024 09:05

fix arriba tests

981bee7

arriba: fix permission denied error

85e7439

try to fix tests

8d39589

maxulysse reviewed Apr 29, 2024

View reviewed changes

modules/nf-core/arriba/arriba/tests/main.nf.test Outdated Show resolved Hide resolved

Update modules/nf-core/arriba/arriba/tests/main.nf.test

d0e0a57

Co-authored-by: Maxime U Garcia <max.u.garcia@gmail.com>

adamrtalbot reviewed May 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump star version #5529

Bump star version #5529

matthdsm commented Apr 26, 2024 •

edited

MatthiasZepper left a comment

MatthiasZepper Apr 26, 2024

matthdsm Apr 29, 2024

MatthiasZepper Apr 29, 2024

matthdsm Apr 29, 2024

maxulysse Apr 29, 2024

MatthiasZepper Apr 29, 2024 •

edited

MatthiasZepper Apr 26, 2024

matthdsm Apr 29, 2024

MatthiasZepper Apr 26, 2024

MatthiasZepper Apr 26, 2024

adamrtalbot left a comment

adamrtalbot May 1, 2024

Bump star version #5529

Are you sure you want to change the base?

Bump star version #5529

Conversation

matthdsm commented Apr 26, 2024 • edited

PR checklist

MatthiasZepper left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MatthiasZepper Apr 29, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adamrtalbot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

matthdsm commented Apr 26, 2024 •

edited

MatthiasZepper Apr 29, 2024 •

edited