This is a repo for my text classification algorithms
Inputs:
- Text articles from a given pre-defined topic
- Labels of which articles belong to the topic and which ones don't
Outputs:
-
Logistic regression model based on tf-idf features, that gives a prediction for each article regarding whether the article is appropriate for the topic.
-ngrams with the corresponding parameter weights
-confidence values for each prediction
-test results from parameter tuning