Releases: INGEOTEC/microtc
Releases · INGEOTEC/microtc
Version - 2.4.10
Version - 2.4.9
No unique order in TFIDF.counter2weight
Version - 2.4.8
fit method in TFIDF
Version - 2.4.7
This version includes a parameter to disable unit vector
Version - 2.4.6
This version improves the performance of SparseMatrix
Version - 2.4.5
This version solves the bugs corresponding to the non-breaking space and the single quotation.
Version - 2.4.4
This version has a parameter to disable text transformations. This might be useful if another process applied the text transformations.
Version - 2.4.3
This version includes a parameter to create the Bag of Word model with a maximum number of terms. For example, the following code can be used to create a model with at most 1024 terms (tokens) using q-grams (2, 3, 4) and words.
from microtc import TextModel
tm = TextModel(token_list=[-1, 2, 3, 4],
token_max_filter=2024,
max_dimension=True)