Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

after skip_diagnostics: true modification. next strain fails #1035

Open
Valentin-Bio-zz opened this issue Dec 1, 2022 · 3 comments
Open
Labels
bug Something isn't working

Comments

@Valentin-Bio-zz
Copy link

Hello, I'm trying to run a next strain analysis using gisaid data. (export augur needed data from gisaid). After exporting the data (fasta and metadata files). I made my own configuration file (attaching here in txt format but it is in iml on my computer).
my_config.txt

The analysis is removing all my samples (7 samples actually) because they fails on the diagnostic

@Valentin-Bio-zz Valentin-Bio-zz added the bug Something isn't working label Dec 1, 2022
@joverlee521
Copy link
Contributor

I'm transferring this issue to the nextstrain/ncov repository because the skip_diagnostics configuration parameter is specific to the ncov workflow.

@joverlee521 joverlee521 transferred this issue from nextstrain/augur Dec 5, 2022
@joverlee521
Copy link
Contributor

Hi @Valentin-Bio,

I did a dry run of the build using your config with:

nextstrain build . --configfile my-ncov-analyses/my_config.yaml --dry-run

I do not see the diagnostic rule in the list of jobs, which means your config file should be properly set to skip the diagnostic rule.

Can you provide some more details to help us diagnose the issue?

  1. Can you share the command you are using to run your build?
  2. Could your samples be filtered out due to the other filter params you have set in your config file? Can you check the results/everything/filtered_log.tsv file to confirm the filter reasons?

@huddlej
Copy link
Contributor

huddlej commented Dec 14, 2022

@Valentin-Bio Another possibility is that Snakemake is pulling in older intermediate files into your analysis that were generated before you updated the skip_diagnostics setting in the config file. Older versions of Snakemake won't re-run the workflow based on parameter changes like this, so your workflow would appear to be running as you expected and still produce the wrong output.

One way around this issue is to force the workflow to run from scratch by adding the --forceall argument to the snakemake or nextstrain build . command you're using.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
No open projects
Status: In Progress
Development

No branches or pull requests

3 participants