Skip to content

Implementation of a news headline classification model on Bengali News Headline Dataset over PyTorch.

Notifications You must be signed in to change notification settings

Mahamudul-Hasan-Rafi/Bengali-News-Headline-Classification

Repository files navigation

Bangla News Headlines Categorization Using Long Short-Term Memory (LSTM): Project Overview

Created a tool that can categorizes the Bengali news headlines into six category (National, Politics, International, Sports, Amusement, IT) using deep recurrent neural network.
A dataset of 0.13 Million news headlines is used which has been forked from Eftekhar-Hossain's Git Repository.
News headlines are from different Bengali online news portals such as Dainik Jugantor, Dainik Ittefaq, Dainik Kaler Kontho and so on.

Word embeeding feature represtations technique (Bengali Glove 300d) from sagorbrur Repository is used for extracting the semantic meaning of the words.

A deep learning model has been built by using a 1 layer Unidirectional Recurrent Neural Network LSTM.

Finally, the model performance is evaluated using various evaluation measures such as confusion matrix, accuracy , precision, recall and f1-score.

Resources Used

Developement Environment : Jupyter Notebook
Python Version : 3.8
Framework and Packages : PyTorch, Scikit-Learn, Pandas, Numpy, Matplotlib, Seaborn

About

Implementation of a news headline classification model on Bengali News Headline Dataset over PyTorch.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published