Skip to content
@daac-tools

daac-tools

Pinned

  1. daachorse daachorse Public

    🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

    Rust 189 12

  2. vaporetto vaporetto Public

    🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

    Rust 215 10

  3. crawdad crawdad Public

    🦞 Rust library of natural language dictionaries using character-wise double-array tries.

    Rust 26 2

  4. vibrato vibrato Public

    🎤 vibrato: Viterbi-based accelerated tokenizer

    Rust 299 14

  5. rucrf rucrf Public

    Conditional Random Fields implemented in pure Rust

    Rust 6 2

  6. trie-match trie-match Public

    Fast match expression optimized for string comparison

    Rust 31

Repositories

Showing 10 of 13 repositories
  • vaporetto Public

    🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

    Rust 215 Apache-2.0 10 0 3 Updated Apr 15, 2024
  • vibrato Public

    🎤 vibrato: Viterbi-based accelerated tokenizer

    Rust 299 Apache-2.0 14 5 0 Updated Feb 19, 2024
  • trie-match Public

    Fast match expression optimized for string comparison

    Rust 31 Apache-2.0 0 0 0 Updated Jan 29, 2024
  • python-vibrato Public

    Viterbi-based accelerated tokenizer (Python wrapper)

    Rust 34 Apache-2.0 1 0 0 Updated Sep 5, 2023
  • python-vaporetto Public

    🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.

    Rust 20 Apache-2.0 1 0 0 Updated Sep 5, 2023
  • daachorse Public

    🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

    Rust 189 Apache-2.0 12 1 0 Updated Aug 27, 2023
  • vaporetto-models Public

    Tokenization models and training scripts for Vaporetto fast tokenizer

    Rust 0 Apache-2.0 0 0 0 Updated May 30, 2023
  • crawdad Public

    🦞 Rust library of natural language dictionaries using character-wise double-array tries.

    Rust 26 Apache-2.0 2 0 0 Updated Feb 20, 2023
  • include-bytes-zstd Public

    Includes a file with zstd compression in Rust

    Rust 9 Apache-2.0 0 0 0 Updated Feb 17, 2023
  • guidelines Public

    Guidelines for daac-tools community

    0 0 0 0 Updated Feb 16, 2023

Top languages

Loading…

Most used topics

Loading…