Update tests to use new version of Sailfish #1

rob-p · 2015-09-04T17:41:24Z

Hi guys,

I've made the changes necessary to use the new version of Sailfish (v 0.7.0 and greater) with these benchmarks. There should be substantial improvements in both speed and accuracy, but the interface with the program has changed a bit. Let me know if you have any questions or want any more information.

Best,
Rob

Update Sailfish version

Any line starting with '#' is a header. The estimated number of reads is now in column 4 instead of 7.

Since sailfish includes the entire FASTA header as the name of the target, we have to process these output files for the rest of the pipeline to work.

Add code to fix the target names

-k argument no longer required for sailfish

mqbssppe · 2015-09-08T10:26:51Z

Dear Rob,

we'll try the most recent version of sailfish in our benchmarks. Thank you for letting us know.

Best

Panos

rob-p · 2015-09-08T14:28:30Z

Hi Panos,

Thanks! Please keep me posted on how it performs in your tests; I'll be very interested to know. As you can see, some things about the interface have changed and my pull request should deal with all of them. However, if you run into any issues (as I've not run all of the tests with these changes), please let me know.

Best,
Rob

mqbssppe · 2015-10-01T17:05:53Z

Hi Rob,

finally I've found some time to run the new version of sailfish. It seems that your modifications are quite cool, since they are improving sailfish performance. The main difference I observed is that now sailfish exhibits a significantly larger mapping rate (approximately 95%) compared to version 0.6.3 which mapped almost 63% (as we report in the manuscript).

The new results are shown in the attached *pdf files

simulation criteria (corresponds to Fig.2 of the manuscript)
simCriteriaUpdatedwithSailfish0.7.3.pdf

run-times (corresponds to Fig.3 of the manuscript)
sim-spanki-times-updatedWithSailfish0.7.3.pdf

Thank you for letting us know for this updated version.

Best

Panos

rob-p · 2015-10-02T15:14:28Z

Hi Panos,

Thanks so much for testing this and reporting it here! The big change between Sailfish v0.6.3 and versions >= v0.7 is the move from simple k-mer counting to making use of quasi-alignments produced by RapMap. This improves the speed, as quasi-alignments, which make use of a suffix array, get rid of the need to hash all k-mers independently. It also improves the memory usage as well since the suffix array acts as a fairly compact representation of the transcriptome. This has re-invigorated development of Sailfish — we're up to version 0.7.6 now — and led us to back-port some useful features from Salmon (e.g. optional posterior Gibbs sampling --useGSOpt and Variational Bayesian EM optimization --useVBOpt) and even implement some new ones (e.g. bootstrap sampling).

Interestingly, we find that the Variational Bayesian EM seems to perform better in our testing than the "standard" EM algorithm originally adopted by Sailfish and used by Kallisto. Thus, we're very excited by your even better natural gradient-based approach to optimizing the variational objective (since our optimization step is already a minor slice of the overall runtime, we're primarily interested in the improved accuracy)!

Thanks!
Rob

rob-p added 7 commits September 4, 2015 08:15

Update set-path-variable.sh

51b8eed

Update Sailfish version

Remove deprecated SF bias correction

1f05c48

Change to new Sailfish library format string

651f0bb

Convert to new SF output format

a469004

Any line starting with '#' is a header. The estimated number of reads is now in column 4 instead of 7.

Create processSailfishNames.py

90f646a

Since sailfish includes the entire FASTA header as the name of the target, we have to process these output files for the rest of the pipeline to work.

Update commands.sh

ea3216b

Add code to fix the target names

Update generate-known-fa-files.sh

6223620

-k argument no longer required for sailfish

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tests to use new version of Sailfish #1

Update tests to use new version of Sailfish #1

rob-p commented Sep 4, 2015

mqbssppe commented Sep 8, 2015

rob-p commented Sep 8, 2015

mqbssppe commented Oct 1, 2015

rob-p commented Oct 2, 2015

Update tests to use new version of Sailfish #1

Are you sure you want to change the base?

Update tests to use new version of Sailfish #1

Conversation

rob-p commented Sep 4, 2015

mqbssppe commented Sep 8, 2015

rob-p commented Sep 8, 2015

mqbssppe commented Oct 1, 2015

rob-p commented Oct 2, 2015