New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why not use dxfuse throughout the entire regenie workflow #33
Comments
Hello, |
Thank you for your response. I'd like to suggest that for the regenie_workflow, from Parts C to G, utilizing dxfuse seems more convenient. This method might allow us to skip the -iin arguments and simply use /mnt/project in our commands. I'm not sure if I'm correct; perhaps there are downsides to dxfuse that I'm not aware of? |
@oklempir-cf Do you have experience with using |
-iin arguments versus dxfuse in Swiss Army Knife Yes, @iamyingzhou I think you are right. Using dxfuse can be also viable working solutionmfor the specific parts of this pipeline. I consider it as an alternative to -iin in terms of functionality and it may be even more useful in some cases, especially for processing larger files that can be read in sequential order (for dxfuse, I observed that non random read access is required - dxfuse might fail when reading in random order and when the program is "jumping from one place to another" in the file being processed). There might be other reasons why to avoid dxfuse and rather use -iin https://github.com/dnanexus/dxfuse (see section about several limitations) Therefore, for advanced users who can use dxfuse efficiently, dxfuse might be definitely better choice. Now, why I would prefer and why I would go first with -iin option, IMO:
In the end, to complicate the things even more :), try to check the so called or similarly called "dx-mount-all-inputs" in SAK which is kind of hybrid of the two solutions above (if I understand and remember it correctly, it has been a while since I have used it in my work). Ondrej |
Thank you for your insightful suggestions and detailed explanation. |
Thanks for your wonderful work.
I've noticed that performing the regenie workflow using dxfuse is quite convenient and can significantly reduce the time spent on data downloading, especially when running step 2. Why doesn't our example code fully incorporate dxfuse?"
The text was updated successfully, but these errors were encountered: