Skip to content

Latest commit

 

History

History
63 lines (51 loc) · 2.41 KB

sick.md

File metadata and controls

63 lines (51 loc) · 2.41 KB

Running the RTE pipeline on SICK.

First, ensure that you have downloaded C&C parser and EasyCCG parser and wrote their installation locations in the files en/candc_location.txt and en/easyccg_location.txt, respectively.

Second, you need to download the SICK dataset and VerbOcean by running the following script:

./en/download_dependencies.sh

You can evaluate the end-to-end system performance of a certain list of semantic templates on the test split of SICK by doing:

./en/eacl2017exp.sh 10 test en/semantic_templates_en_event.yaml

This script will coordinate the tokenization, syntactic parsing (with C&C and EasyCCG), semantic parsing and theorem proving (with Coq) using 10 processes. Syntactic and semantic parsing results will be written in parsed directory. Entailment judgements and an HTML graphical representation of semantic composition (and constructed theorem) will be written in results directory. You can see a summary of performance by doing:

cat results/score.txt

and you should see something similar to this:

Correct parsing: 0.9748 (4803/4927)
Accuracy: 0.8313 (4096/4927)
Recall: 0.6265
Precision: 0.9688
F1 score: 0.7608
Gold_correct_total: 2134
System_answer_total: 1380
System_correct_total: 1337
----------------------------------------------------------------
                            system                              
     |        |     yes |      no | unknown |   error |   total 
----------------------------------------------------------------
     |     yes|     838 |       6 |     565 |       5 |    1414 
gold |      no|       3 |     499 |     176 |      42 |     720 
     | unknown|      22 |      12 |    2682 |      77 |    2793 
     |   total|     863 |     517 |    3423 |     124 |    4927 
----------------------------------------------------------------

If you want to see the results (syntactic/semantic parses, entailment judgements and HTML visualizations) but do not wish to run the software, you can uncompress the file en/sick_intermediate_results.tgz by doing:

tar xvzf en/sick_intermediate_results.tgz

which will create the plain/, parsed/ and results/ directories.