Skip to content
View kbatsuren's full-sized avatar
Block or Report

Block or report kbatsuren

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. sigmorphon/2024TokenST sigmorphon/2024TokenST Public

    SIGMORPHON 2024 Shared Task on Subword Tokenization

    2

  2. CogNet CogNet Public

    CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates

    40 9

  3. MorphyNet MorphyNet Public

    MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)

    32 12

  4. wiktra wiktra Public

    Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)

    Lua 25 5

  5. sigmorphon/2022SegmentationST sigmorphon/2022SegmentationST Public

    SIGMORPHON 2022 Shared Task on Morpheme Segmentation

    Jupyter Notebook 23 13

  6. unimorph/umLabeller unimorph/umLabeller Public

    Inspection tool for characterizing the semantic compositionality of subword tokenization in English

    Python 3