-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[False ->dir.makeDir.Success] #53
Comments
Hi Sabeen, First question - are you running this an a local server or one with a job queue? Second - The output you sent me is what is expected when you do a 'print' run - this will let you know which jobs need to be run (at the start, this is all of them). To get the pipe to actually run, you need to add '--style run' to the reddog command ('print' is the default). If you haven't done this, it could be the problem... There are other possible reasons, but we should walk through this if you want to get it working locally. cheers, |
Hi David, I understand the error code 127 means that it cannot find an executable as it's not ins PATH but I can't figure out which executable it needs to find ! Hopefully there is a simple solution to this. Thanks |
What queuing system are you using, SLURM or PBS (qsub)? |
I don't think I'm using any queing system. I've installed it on my local server and trying to run it but it isn't working. I think there is a setting that needs to be changed/fixed and I don't know what it is, I'm running RHEL 7 |
Ah, then the solution may be really easy... in the RedDog_config file move down until you find the following: and change "distributed" to False Change the number of "procs" to fit within the number of processors available for RedDog on your local server - I do advise not all of them! Make sure you don't remove the comma on either line when you edit them. |
Hi David, I get a slightly different error but it's still exit code 127 (see attached file. This is what lscpu gives me.. just to let you know what kind of system I'm trying to run this on. |
Can you send me the config file (sorry, been doing my six month report for PhD) |
yes of course.. no worries, I had to save it as a txt file as this system was not allowing me to attach a python (~.py) file |
Had a read through the config file you sent me and the changes for "distributed" and "procs" were the old settings - looks like the changes weren't saved. Have another go at editing the config file and check the changes are saved before trying the run again... BTW if it does run properly you won't see any messages after hitting 'y' to start the run. The commands will be sent to the command line silently (as long as it keeps working) |
Hi David, It's still nt working. |
Hi Sabeen, Well at least it did get to checkBam which means RedDog is working, just not all the stages yet... Are there BAM files for the two isolates in the output bam folder? If so, what size are they, and what size are the original sequence files? There may be a delay problem between stages (i.e. between the mapping of reads and checking that the BAM was produced) - the files may not be immediately available to RedDog so it thinks they haven't been made. One way to check this is to restart RedDog (with the same command) and see if it gets through the 'checkBam' step. David |
No BAM files in the output bam folder. |
Have a look in the log folder (in the folder where reddog is installed): are there any 'log' files? |
RedDog-pipeline_log.log |
As I thought - running on a local server provides no output on errors... but the log file does provide a way to find out what is going wrong with the mapping. Run the following command and report what happens: This is the smaller set of the two and should take about ten to twenty minutes to map, if it works. |
Ok so it first gave a readable output (see attached) then a whole bunch of gobbledygook and froze the terminal. When I logged back into the machine again and did "top" there was no activity at all. it did produce 2 bam files in the folder: /root/Documents/RedDog_test_output/temp/MS5862459-ID-wgs96-020-CD630/
What next ? |
I removed the larger sequence pair from that folder and am re-running the pipeline: [root@tcmc_sandbox2 RedDog_v1b10_3]# rubra RedDog --config test_config --style run RedDog V1beta.10.3 - phylogeny run Copyright (c) 2016 David Edwards, Bernie Pope, Kat Holt Mapping: Bowtie2 V2.2.9 Output folder: Start Pipeline? (y/n) y Starting pipeline... |
ok that gave me the same error as before but I re-ran the previous step one more time: here is the command I ran: [root@tcmc_sandbox2 RedDog_v1b10_3]# bowtie2 --sensitive-local -x /root/Documents/RedDog_test_output/temp/GCF_000009205.1_ASM920v1_genomic -1 /home/jspinler/RedDog_Test/MS5862459-ID-wgs96-020-CD630_1.fastq.gz -2 /home/jspinler/RedDog_Test/MS5862459-ID-wgs96-020-CD630_2.fastq.gz -X 2000 | samtools view -ubS - | samtools sort - -o /root/Documents/RedDog_test_output/temp/MS5862459-ID-wgs96-020-CD630/MS5862459-ID-wgs96-020-CD630.bam *********output begin I didn't see any changes in any of the folders. |
Can you instead try the test set as found in the reddog wiki? - I know these reads work, so makes trouble shooting the pipeline easier. Then we can look at read sets that then 'break' the pipeline. |
OK SO I ran the test data sets and I literally got the exact same issues. Same error message that ~.bam file does not exist. Then I ran the bowtie on just one seq: *****Command I ran this above command again and directed the output to a log file (attached). Also, here is the pipelog (attached) |
also when I ran it the second time it first output this: |
First of all, this is actually useful as I think I know what is going on... Bowtie2 is working and the bam is being produced, BUT it looks like samtools is not woking, and its at this step in the command the pipe looks to be failing. Can you check which version of samtools you are using? The pipe currently uses SAMtools v1.3.1. As to the reference, reddog only maps to a single reference file, but that reference can be a single genbank or fasta file, or a concatenated set of genbank or fasta files, so you can concatenate CP00038 and CP00039 (genome and plasmid) into a single file then map to that... |
OK after a bit of digging and updating my Samtools to v1.3.1 I managed to runt he whole test dataset through. My system couldnt find bcftools (I fixed that) then it couldn't find vcfutils.pl (I fixed that as well).. then when I finally ran it again it didn't give me any errors. Right now I'm running the original test dataset (jspinler's) through the pipeline and following it on "top" I see that bowtie is only using 1 core for each instance of bowtie even though other users on the system as able to run bowtie faster. I have 2 CPUs with 8 cores each. right now my "proc" is set to 24.. do you think I can increase that so that bowtie runs faster ? Thanks for ALL your help ! We really do appreciate it. |
Hi Sabeen, |
Hi,
I'm very new at this and not sure what it is that I'm doing wrong.
I think I've installed all the dependencies correctly but when I try to run a small test dataset through the pipeline it's failing.
I'm attaching the output for you. I'm sure it's because the pipeline is failing to make an output dir.
it keeps giving thew error:
Job needs update: Missing file /root/Documents/RedDog_test_output/temp/~.Success
at every stage.
Please help.
test_run1.txt
Thanks,
Sabeen.
The text was updated successfully, but these errors were encountered: