Skip to content
@daac-tools

daac-tools

Pinned

  1. daachorse daachorse Public

    🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

    Rust 190 12

  2. vaporetto vaporetto Public

    🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

    Rust 218 10

  3. crawdad crawdad Public

    🦞 Rust library of natural language dictionaries using character-wise double-array tries.

    Rust 27 2

  4. vibrato vibrato Public

    🎤 vibrato: Viterbi-based accelerated tokenizer

    Rust 303 14

  5. rucrf rucrf Public

    Conditional Random Fields implemented in pure Rust

    Rust 6 2

  6. trie-match trie-match Public

    Fast match expression optimized for string comparison

    Rust 31

Repositories

Showing 10 of 13 repositories
  • vaporetto Public

    🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

    Rust 218 Apache-2.0 10 0 4 Updated May 31, 2024
  • vibrato Public

    🎤 vibrato: Viterbi-based accelerated tokenizer

    Rust 303 Apache-2.0 14 5 0 Updated May 30, 2024
  • daachorse Public

    🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

    Rust 190 Apache-2.0 12 1 1 Updated May 30, 2024
  • trie-match Public

    Fast match expression optimized for string comparison

    Rust 31 Apache-2.0 0 0 0 Updated Jan 29, 2024
  • python-vibrato Public

    Viterbi-based accelerated tokenizer (Python wrapper)

    Rust 34 Apache-2.0 1 0 0 Updated Sep 5, 2023
  • python-vaporetto Public

    🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.

    Rust 20 Apache-2.0 1 0 0 Updated Sep 5, 2023
  • vaporetto-models Public

    Tokenization models and training scripts for Vaporetto fast tokenizer

    Rust 0 Apache-2.0 0 0 0 Updated May 30, 2023
  • crawdad Public

    🦞 Rust library of natural language dictionaries using character-wise double-array tries.

    Rust 27 Apache-2.0 2 0 0 Updated Feb 20, 2023
  • include-bytes-zstd Public

    Includes a file with zstd compression in Rust

    Rust 9 Apache-2.0 0 0 0 Updated Feb 17, 2023
  • guidelines Public

    Guidelines for daac-tools community

    0 0 0 0 Updated Feb 16, 2023

Top languages

Loading…

Most used topics

Loading…