Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to make RedDog fast? #64

Open
MostafaYA opened this issue Jul 19, 2019 · 2 comments
Open

how to make RedDog fast? #64

MostafaYA opened this issue Jul 19, 2019 · 2 comments

Comments

@MostafaYA
Copy link

Hi,
could be any tips to run the RedDog pipeline fast? takes longtime even on only 2 samples.
Do I understand it correctly that the processing of each sample include only one cpu, not more even if more cpus are available? do you recommend parallelization using "parallel" command?

Thanks

@quocviet0908
Copy link

quocviet0908 commented Jul 21, 2019

Hi there,
From what I've read in the manual, you should only run 1 sample/1 pipeline at the given time.
You can increase the number of CPU to boost the performance (I think) by editing the config file, the manual has shown that already.

@d-j-e
Copy link
Collaborator

d-j-e commented Jul 25, 2019

Hi MostafaYA,

You don't say what system you are running on - reddog is designed to run on a distributed system, where lots of jobs are sent out. And yes, we make do with one core per read set (sample) as many of our data sets are hundreds or thousands in number. You could tinker with the commands in the config file to use more than one core for certain steps, though you would also have to add the extra cpus to the command as well... A lot of the shorter steps don't really need to be parallelised. Happy to help if you want to give it a try, though I am no 'parallel' expert.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants