Fake News Detector

This is the submission for Pravega'21, hackathon conducted by IISc bengalore.

A Brief Overview

After the covid 19 outbreak we have seen a huge amount of information dissemination. With the current usage of social media platforms, consumers are creating and sharing more information than ever before, some of which are misleading with no relevance to reality. Automated classification of a text article as misinformation or disinformation is the target of this project. In this presentation, we’ll describe our approach to reach the solution.

What is special in our approach?

We see that most of the related works focus on improving the prediction quality by adding additional features. The fact is that these features are not always available, for instance some article may not contain images. There is also the fact that using social media information is problematic because it is easy to create a new account on these media and fool the detection system. That’s why we chose to focus on the article body.

Brief overview of our work

Web Development flow

The project has a express-node server, having a multipage application in the frontend created using ejs. The pages are styled through vanilla CSS and bootstrap. We have connected our node.js backend with the machine learning model with child process module using the command line utilities.

Machine Learning Framework

Tokenizer

We used spaCy to segment the sentences into words, punctuation, etc. This is done according to rules specified by each language. The vocabulary is built according to the occurrence of the words in the corpus.

Embeddings

Glove Embeddings is used to convert the corpus into embeddings. The Glove embedding is trained on aggregated global word-word co-occurrence statistics from a corpus. The resulting representations showcase interesting linear substructures of the word vector space.

Developer Instructions

You need to have the following things installed

Once you are done with that follow the next instructions as stated

Clone the repository

> git clone https://github.com/sudip-mondal-2002/fake-news-detector.git
> cd fake-news-detector

Install the node dependencies

> yarn install

OR

> npm install

Install the python dependencies

> pip install -r requirements.txt

Start the server

> yarn run start

OR

> npm start

Now go to your browser and browse to (http://localhost:3000)

Notebooks created during the project

LSTM using Tensorflow
LSTM using Pytorch

OR

Check out the GitHub Gists

Bibliography

Fakenews Dataset from Kaggle
GloVe Embeddings, Stanford University (2014)
Recurrent Neural Networks, Lipton et al. (2015)
Long Short-Term Memory, Greff et al. (2015)

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github		.github
Machine-Learning/embeddings		Machine-Learning/embeddings
public		public
views		views
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
detector.py		detector.py
package.json		package.json
requirements.txt		requirements.txt
server.js		server.js
yarn.lock		yarn.lock

License

sudip-mondal-2002/fake-news-detector

Folders and files

Latest commit

History

Repository files navigation

A Brief Overview

What is special in our approach?

Brief overview of our work

Web Development flow

Machine Learning Framework

Tokenizer

Embeddings

Developer Instructions

Clone the repository

Install the node dependencies

Install the python dependencies

Start the server

Notebooks created during the project

Bibliography

Team Hackers

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Sponsor this project

Languages