Skip to content
#

20newsgroup

Here are 23 public repositories matching this topic...

We created a topic modeling pipeline to evaluate different topic modeling algorithms, including their performance on short and long text, preprocessed and not preprocessed datasets, and with different embedding models. Finally, we summarized the results and suggested how to choose algorithms based on the task.

  • Updated Aug 26, 2022
  • Jupyter Notebook

FEMDA: Robust classification with Flexible Discriminant Analysis in heterogeneous data. Flexible EM-Inspired Discriminant Analysis is a robust supervised classification algorithm that performs well in noisy and contaminated datasets.

  • Updated Sep 6, 2022
  • Python

This project offers advanced techniques in text preprocessing, word embeddings, and text classification. Explore methods like Word2Vec and GloVe, and master Multinomial Naive Bayes for accurate predictions. Dive into the world of text clustering and conquer challenges like unbalanced data.

  • Updated Oct 31, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the 20newsgroup topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the 20newsgroup topic, visit your repo's landing page and select "manage topics."

Learn more