Skip to content

Releases: torognes/vsearch

VSEARCH 2.28.1

26 Apr 13:20
d98284e
Compare
Choose a tag to compare

The sintax command has been improved in several ways in this version of vsearch. Please note that several details of this algorithm is not clearly described in the preprint, and the implementation in vsearch differs from that in usearch.

The former vsearch version did not always choose the most common taxonomic entity over the 100 bootstraps among the database sequences with the highest amount of word similarity to the query. Instead, if several sequences had an equal similarity with the query, the sequence encountered in the earliest bootstrap was chosen. The confidence level was calculated based on this sequence compared to the selected sequences from the other 99 bootstraps. This could lead to a suboptimal choice with a low confidence. In the new version, the most common of the sequences with the highest amount of word similarity across the 100 bootstraps will be selected, and ties will be broken randomly.

Another problem with the old implementation was that if several sequences had the same amount of word similarity, the shortest one in the reference database would be chosen, and if they were equally long, the earliest in the database file would be chosen. A new option called sintax_random has now been introduced. This option will randomly select one of the sequences with the highest number of shared words with the query, without considering their length or position. This avoids a bias towards shorter reference sequences. This option is strongly recommended and will probably soon be the default.

Furthermore, a ninth taxonomic rank, strain (letter t), is now recognized. The speed of the sintax command has also been significantly improved at least in some cases. Run vsearch with the randseed option and 1 thread to ensure reproducibility of the random choices in the algorithm.

VSEARCH 2.27.1

06 Apr 15:46
8586fae
Compare
Choose a tag to compare

This version fixes the weak_id option and makes searches report weak hits in some cases. It also updates the names of the compression libraries to libz.so.1 and libbz2.so.1 on Linux to make them work on common Linux distributions without installing additional packages. README.md has been updated with information about compression libraries on Windows.

VSEARCH 2.27.0

19 Jan 09:25
0907b6f
Compare
Choose a tag to compare

The usearch_global and search_exact commands now support FASTQ files as well as FASTA files as input. This version of vsearch includes clarifications and updates to the manual. Some code has been refactored. Generic Dockerfiles for major Linux distributions have been included. Some warnings from compilers and other tools have been eliminated. The release for Windows will also include DLL's for the two compression libraries.

VSEARCH 2.26.1

25 Nov 13:22
480c7f2
Compare
Choose a tag to compare

Enable the maxseqlength and minseqlength options for the chimera detection commands. When the usearch_global or search_exact commands are used, OTU tables will include samples and OTUs with no matches.

The previous release 2.26.0 was removed because the version number had not been updated from 2.25.0 in the enclosed source code. To avoid confusion the previous release has been removed and replaced by this version 2.26.1.

VSEARCH 2.25.0

10 Nov 17:33
ddf6073
Compare
Choose a tag to compare

Allow a given percentage of mismatches, specified with the chimeras_diff_pct option, between chimeras and parents for the experimental chimeras_denovo command.

VSEARCH 2.24.0

26 Oct 15:40
8db8277
Compare
Choose a tag to compare

Update documentation. Improve code. Allow up to 20 parents for the undocumented and experimental chimeras_denovo command. Fix compilation warnings for sha1.c. Compile for release (not debug) by default.

VSEARCH 2.23.0

07 Jul 11:20
e6e7db3
Compare
Choose a tag to compare

Update documentation. Add citation file. Modernize and improve code. Fix several minor bugs. Fix compilation with GCC 13. Print stats after fastq_mergepairs to log file instead of stderr. Handle sizein option correctly with dbmatched option for usearch_global. Allow maxseqlength option for makeudb_usearch. Fix memory allocation problem with chimera detection. Add lengthout and xlength options. Increase precision for eeout option. Add warning about sintax algorithm, random seed and multiple threads. Refactor chimera detection code. Add undocumented experimental long_chimeras_denovo command. Fix segfault with clustering. Add more references.

VSEARCH 2.22.1

19 Sep 09:11
Compare
Choose a tag to compare

Add the derep_smallmem command for dereplication with less memory usage. Remove compiler warnings.

VSEARCH 2.21.2

12 Sep 16:13
Compare
Choose a tag to compare

Fix problems with the --lcaout option.

VSEARCH 2.21.1

18 Jan 17:35
Compare
Choose a tag to compare

Fix dereplication with empty input file.