Skip to content

luizfranca/twitter-information-retrieval

Repository files navigation

Twitter Information Retrieval Framework

The project code is under the Twitter-Information-Retrieval-Project directory

The report is under Paper directory

###To run this code the following libs are needed:

  • stanford-corenlp-3.7.0-models.jar

This project comes with two data example for the indexes and the twittes. They are tweetsEN and tweetsEN20 for the tweets and Indexes and Indexes20 for the indexes

###There are test classes for all the main modules as follows:

  1. TestTokenizer.java
  2. TestStopWordsRemover.java
  3. TestEnglishLemmatisation.java
  4. TestPreprocessEnglish.java
  5. TestVocabulary.java
  6. TestCrawler.java
  7. TestIndexer.java
  8. TestSearch.java

Link to the todo list