Skip to content

Latest commit

 

History

History
61 lines (46 loc) · 1.22 KB

README.md

File metadata and controls

61 lines (46 loc) · 1.22 KB

Requirements

  • python3
  • perl
  • nltk (for stanford pos tagger)
  • java (for stanford tools)
  • zsh
  • task datasets (see below)

Links to tasks/data sets

Please note that ACE corpora are not free.

Usage

download Stanford Core NLP & POS tagger

cd common
wget http://nlp.stanford.edu/software/stanford-corenlp-full-2015-04-20.zip
wget http://nlp.stanford.edu/software/stanford-postagger-2015-04-20.zip
unzip stanford-corenlp-full-2015-04-20.zip
unzip stanford-postagger-2015-04-20.zip
cd ..

copy and convert each corpus

Please set the environment variables for the directories, or directly put the directories in the following commands beforehand.

ACE 2004

cp -r ${ACE2004_DIR}/*/english ace2004/
cd ace2004
zsh run.zsh
cd ..

ACE 2005

cp -r ${ACE2005_DIR}/*/English ace2005/
cd ace2005
zsh run.zsh
cd ..

SemEval 2010 Task 8

cp ${SEMEVAL_TRAIN_DIR}/TRAIN_FILE.TXT semeval-2010/
cp ${SEMEVAL_TEST_DIR}/TEST_FILE.txt semeval-2010/
cd semeval-2010/
zsh run.zsh
cd ..