Skip to content

felixpeters/retweet-prediction

Repository files navigation

Retweet Prediction Research Project

The aim of this project is to make accurate predictions of the eventual retweet cascade size using NLP transfer learning approaches.

Getting started

Tweet data for model training is not published yet, but AWS S3 storage bucket will be opened for public access shortly.

This project is optimized for running on the Paperspace Gradient platform, but scripts are generally able to run in any environment containing the required packages. See the accompanying Dockerfile for details about environment setup. You might need to adjust paths for model and data storage.

Structure

This repository is structured as follows:

  • The root folder contains configuration files (Dockerfile, Makefile) and scripts to reproduce experiments
  • The retpred folder contains the corresponding module which implements basic utilities for data processing and model training

Contact

Contact me via GitHub, Twitter or LinkedIn.

About

Predict retweet cascade sizes based on NLP transfer learning approaches.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published