Skip to content

tticoin/LSTM-ER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LSTM-ER

Implementation of End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures.

Requirements

  • Fedora Core 22
  • clang++ 3.4
  • boost 1.57
  • yaml-cpp 0.5.1
  • ICU4C 54.1

This code may work on other linux environments, but we have not tried them. For convenience, this package includes snapshot versions of clab/cnn (https://github.com/clab/cnn) and eigen (http://eigen.tuxfamily.org/). These follow the original license.

Usage

Compilation

tar xzf cnn.tar.gz
tar xzf eigen.tar.gz
mkdir build
cd build
cmake .. -DEIGEN3_INCLUDE_DIR=eigen -DCMAKE_CXX_COMPILER=/usr/bin/clang++
make
cd ..

Creation of directories

mkdir dict models

Preparation of the pretrained embeddings

cd dict/
wget http://tti-coin.jp/data/wikipedia200.bin
cd ..

Preparation of the data sets

see data/README.md

Obtaining pretrained models

These models are trained on the environments above (with Intel CPU). If you use other environments, please retrain the models to avoid the incompatibility problem.

ACE 2005 (Relation extraction)

cd models/
wget http://tti-coin.jp/data/ace2005-test.txt.gz
gunzip ace2005-test.txt.gz
cd ..

SemEval 2010 Task 8 (Relation classification)

cd models/
wget http://tti-coin.jp/data/semeval-test.txt.gz
gunzip semeval-test.txt.gz
cd ..

Testing models

Prediction results will be written as *.pred.ann in the test corpus directory.

ACE 2005 (Relation extraction)

build/relation/RelationExtraction --test -y yaml/parameter-ace2005.yaml

SemEval 2010 Task 8 (Relation classification)

build/relation/RelationExtraction --test -y yaml/parameter-semeval-2010.yaml

Training models

ACE 2005 (Relation extraction)

build/relation/RelationExtraction --train -y yaml/parameter-ace2005.yaml

SemEval 2010 Task 8 (Relation classification)

build/relation/RelationExtraction --train -y yaml/parameter-semeval-2010.yaml

Notes

YAML files for ACE2004 are not included. Please modify yaml/parameter-ace2005.yaml.

Scores may not be consistent with those in our paper due to the differences in the environments.

Please cite our ACL paper when using this software.

About

Implementation of End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures in ACL2016.

Resources

License

Stars

Watchers

Forks

Packages

No packages published