Skip to content
This repository has been archived by the owner on Jul 30, 2022. It is now read-only.

Releases: martijndeb/haxe-linguistics

Butter, bread and green cheese

16 Aug 15:11
Compare
Choose a tag to compare
Pre-release

This release adds Frisian as a language. It's being treated as a secend class citizen for now whilst English, Dutch and German remain to have the primary focus.

Also new in this release are the separation of token filtering from the tokenizers, and thus all tokenizers must implement the new ITokenFilter. One such example is the new StopwordTokenFilter, which uses the updated stopwords lists in languages.

You can now use the new BasicStringBuilder to convert a token list back to a string

Smack my Bayes up

08 Aug 14:16
Compare
Choose a tag to compare
Smack my Bayes up Pre-release
Pre-release

Adds support for Dictionaries, which allow you to extract unique words from a text and keep a count of them.
Adds support for calculation Levenshtein distance in strings.
Adds support for Naive Bayes classification.

The Basics

07 Aug 18:33
Compare
Choose a tag to compare
The Basics Pre-release
Pre-release

First release created and submitted as haxelib.
Contains a basic tokenizer supporting English and Dutch as languages.
Provides complete test coverage for this release.