20newsgroup

Here are 23 public repositories matching this topic...

deekshakoul / Graph-based-text-classification

Project work as part of the E0-334 Deep Learning for Natural Language Processing course at IISc, Bengaluru. We had proposed a graph-based model for text classification.

deep-learning text-classification cnn r8 20newsgroup

Updated Sep 28, 2021
Python

filipefilardi / text-mining

Star

Clean corpus generic script made with tm package

machine-learning text-mining corpora corpus-data 20newsgroup

Updated Nov 10, 2020
R

VeereshElango / tsne-visualizations

Star

This repository contains notebooks which explores the tsne algorithm by applying it on various datasets

python pca dimensionality-reduction tsne tsne-plot 20newsgroup 20-newsgroup-dataset

Updated Apr 5, 2020
Jupyter Notebook

rahup97 / 20Newsgroups-classifier

Star

Naive Bayes classifier and boolean retrieval done on the 20Newsgroups dataset that has been written from scratch. Extremely lightweight and produces decent results. Also currently working on classification using word embeddings.

python machine-learning information-retrieval cpp keras naive-bayes-classifier glove 20newsgroup

Updated Jul 18, 2018
Python

tyrannorrec / CS6120-NLP-Final-Project

Star

Classified human and machine generated text using 1) a single score threshold classifier and 2) a neural network classifier approach, based on perplexities and probability scores generated from n-grams. Best results are 77% for the single score classifier and 80% for the ANN classifier.

python tensorflow numpy pandas nltk ngram neuralnetwork 20newsgroup gpt2 ablation-study

Updated May 24, 2023
Jupyter Notebook

Purushothaman-natarajan / NLP-TEXT-PROCESSING

Star

This project offers advanced techniques in text preprocessing, word embeddings, and text classification. Explore methods like Word2Vec and GloVe, and master Multinomial Naive Bayes for accurate predictions. Dive into the world of text clustering and conquer challenges like unbalanced data.

python data-science machine-learning natural-language-processing data-mining text-classification word2vec text-processing glove-embeddings text-clustering 20newsgroup k-fold-cross-validation

Updated Oct 31, 2023
Jupyter Notebook

rvitorgomes / kmeans-20news

Star

Kmeans and SOM clustering for 20newsgroup

machine-learning scikit-learn som conda kmeans preprocessing tokenization scikit 20newsgroup

Updated Jun 6, 2018
Jupyter Notebook

sagahansson / lt2212-v20-a2

Star

Assignment 2 – Dimensionality reduction and text classification: converted news text into a machine readable representation, reduced the dimensions of the text representation and trained classifiers to decide which of 20 news groups a sample belongs to.

text-classification scikit-learn nltk dimensionality-reduction 20newsgroup

Updated Jun 26, 2020
Python

Jonadler1 / Topic-Modeling-Techniques-

Star

NLP Topic Modeling Techniques (LDA, LSA & BERTopic)

nlp topic-modeling latent-dirichlet-allocation nlp-machine-learning latent-semantic-analysis ted-talks 20newsgroup bertopic

Updated Nov 6, 2022
HTML

iremkaraoglu / Logistic-vs-NaiveBayes

Star

python machine-learning naive-bayes jupyter-notebook logistic-regression 20newsgroup

Updated Apr 8, 2019
Jupyter Notebook

Vaibhav-Khera / Naive-Bayes_Text-Classification

Star

Implemented Naive Bayes text classifier for the 20newsgroups dataset

text-classification naive-bayes-classifier multinomial-naive-bayes gaussian-naive-bayes-implementation 20newsgroup

Updated May 25, 2020
Jupyter Notebook

Andrewwango / femda

Star

FEMDA: Robust classification with Flexible Discriminant Analysis in heterogeneous data. Flexible EM-Inspired Discriminant Analysis is a robust supervised classification algorithm that performs well in noisy and contaminated datasets.

machine-learning classification em-algorithm quadratic-discriminant-analysis linear-discriminant-analysis fashion-mnist discriminant-analysis 20newsgroup robust-estimation robust-statistics

Updated Sep 6, 2022
Python

screddy1313 / Language-modelling

Star

In this project we will generate the sentences using ngrams

ngrams language-modelling textgeneration 20newsgroup perplexity log-probabilty

Updated Dec 18, 2019
Jupyter Notebook

Gokultcr / NLP-20newsgroup-data

Star

news nlp-machine-learning nltk-library 20newsgroup

Updated Nov 17, 2021
Jupyter Notebook

Soumyajain29 / Graph-Based-Text-Classification

Star

This repository contains code for our project work as part of the E0-334 Deep Learning for Natural Language Processing course at IISc, Bengaluru. We had proposed a graph-based model for text classification.

deep-learning graph graph-algorithms text-classification movie-reviews r8 20newsgroup