Skip to content
This repository has been archived by the owner on Mar 16, 2022. It is now read-only.

fc_ovlp_filter adding support for las files rather only preads_lasfiles.fofn #611

Open
mictadlo opened this issue Feb 26, 2018 · 2 comments

Comments

@mictadlo
Copy link

Hi,
Could you please add support to fc_ovlp_filter for las files rather only preads_lasfiles.fofn? At the moment I do it with the following bash script.

#!/bin/bash

rm fc_ovlp_filter_cmds.txt
for filename in $(find . -type f -name "assemblyDB.*.las"); do
   fn=$(basename $filename)
   echo $fn
   no=$(ls $fn | sed 's/assemblyDB.//' | sed 's/.las//')
   echo $no
   $(ls $fn > ${fn}.fofn)
   
   echo "fc_ovlp_filter --db assemblyDB --max_diff 100 --max_cov 100 --min_cov 1 --bestn 10 --n_core 1 --fofn ${fn}.fofn --min_len 6973 --out-fn preads.${no}.ovl" >> fc_ovlp_filter_cmds.txt
done

The above solution allows me to distrubute the task on multiple nodes.

Can I just do cat preads.*.ovl > preads.ovl or do I have to sort the file?

Thank you in advance.

Michal

@pb-cdunn
Copy link

Your solution is fine. The order of preads.ovl does not matter, aside from reproducibility of results.

But are you sure that step needs to be split and parallelized? What is your resource constraint? We would split that ourselves if we were aware of a problem.

@mictadlo
Copy link
Author

I think it used a lot of memory and took a while to run.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants