Skip to content

Twitter bot that uses an improved word frequency algorithm based on gradient heuristics for extractive summarization

Notifications You must be signed in to change notification settings

Tritium-Glass/Twitter-Trend-Summarization-Using-Improved-Word-Frequency

Repository files navigation

Twitter Bot For Summarizing Articles Based on Twitter Trends

Created as Final Year Project for B. Tech in Computer Engineering

Working of the software:

  • the Twitter interface interface takes the top few trending topics from Twitter, cleans them, and sends them to Web Scraping module
  • the Web Scraping module searches the web for articles related to that topic that are most recent and are from trusted sources and then forwards them to the text summarization module
  • the text summarization module uses a modified word frequency method to get the gist of the news article in fewer than 240 characters and then sends it to the Twitter interface module
  • a shortened hyperlink to the article is appended to the short gist, and the final result is posted on Twitter


For selecting the most appropriate extractive summarization method, the following algorithms were compared:

  1. Word Frequency
  2. Term Frequency - Inverse Document Frequency
  3. TextRank
  4. Latent Semantic Analysis
  5. Improved Word Frequency (we modified the word frequency algorithm)