Skip to content

IanGross/RedditDataCollection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

RedditDataCollection

Collection of reddit submissions from various computer science subreddits to find trends in the computer science domain

File Directory Structure

Contains python file data collection, general metadata file, and sample generated datasets

Run Instructions

  1. Install the following programs/plugins
  1. Clone this repository (git clone https://github.com/IanGross/RedditDataCollection/)
  2. Create a reddit account and setup and Script App with OAuth2
  1. Copy the values for user_agent, client_id and client_secret of your application
  2. Open the Reddit_Data_Collection.py file and insert the values in the appropriate fields of "reddit = praw.Reddit()" (located right after import statements.
  3. Optional
  • Go to the website: https://www.unixtimestamp.com/ and get a utc start and end value. Insert those values under start_utc and end_utc
  • Add and/or remove subreddit names in the subreddit_list list
  1. Run the file with: python Reddit_Data_Collection.py

About

Collection of reddit submissions from various computer science subreddits

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages