SemEval-2018 Task 1 Affect in Tweets Evaluation Script

Evaluation Script for SemEval-2018 Task 1: Affect in Tweets.

1. Evaluation Script

The evaluation script evaluate.py calculates the performance metrics between the gold standard scores and the given predictions.

1.1. Prerequisites

1.2. Usage

python evaluate.py <task_type> <file-predictions> <file-gold>

The parameter <task_type> can take three possible values:

1 for regression (EI-reg and V-reg tasks)
2 for ordinal classification (EI-oc and V-oc tasks)
3 for multi-label classification (E-c tasks).

1.3. Format

Each input file must have the format specified in the competition's website.

If you want to use the script for format checking, evaluate your predictions against themselves:

python evaluate.py 1 EI-reg_en_fear_pred.txt EI-reg_en_fear_pred.txt

2. Weka Baseline System

We have implemented a WEKA package called AffectiveTweets to be used as a baseline system. The package allows calculating multiple features from a tweet and can be used together with many machine learning methods implemented in WEKA. Installation instructions are given in the project's webpage. Make sure to install the latest version (1.0.1) as well as the LibLinear package before running the examples from below.

2.1. Data to Arff

The tweets_to_arff.py script allows you to convert the task data into the arff WEKA format.

Usage

python tweets_to_arff.py <data_type> <input_file> <output file>

The parameter <data_type> can take three possible values:

1 for regression data (EI-reg, V-reg)
2 for ordinal classification data (EC-oc, V-oc)
3 for multi-label emotion classification data (E-C).

2.2. Convert Weka Predictions into the Task's Submission Format

The fix_weka_output.py script can be used for converting weka predictions into the official submission format.

Usage

python fix_weka_output.py <data_type> <original_test_data> <weka_predictions> <output file>

The parameter <data_type> can take two possible values:

1 for regression data (EI-reg, V-reg)
2 for ordinal classification data (EC-oc, V-oc)

Note: the current version of this script can only convert predictions made for the regression and ordinal classification tasks.

2.3. Examples

SVM Regression on EI-reg-En Anger

In this example we will train an SVM regression (from LibLinear) on EI-reg-En-anger-train using unigrams as features, and we will deploy the classifier on the corresponding development set.

Convert training and dev sets into arff format:

python tweets_to_arff.py 1 EI-reg-En-anger-train.txt EI-reg-En-anger-train.arff
python tweets_to_arff.py 1 2018-EI-reg-En-anger-dev.txt 2018-EI-reg-En-anger-dev.arff

Train the classifier using Weka and save the predictions made on the dev set as a csv file:

java -Xmx4G -cp $WEKA_FOLDER/weka.jar weka.Run weka.classifiers.meta.FilteredClassifier -t EI-reg-En-anger-train.arff -T 2018-EI-reg-En-anger-dev.arff -classifications "weka.classifiers.evaluation.output.prediction.CSV -use-tab -p first-last -file EI-reg-En-anger-weka-predictions.csv" -F "weka.filters.MultiFilter -F \"weka.filters.unsupervised.attribute.TweetToSparseFeatureVector -E 5 -D 3 -I 0 -F -M 2 -G 0 -taggerFile $HOME/wekafiles/packages/AffectiveTweets/resources/model.20120919 -wordClustFile $HOME/wekafiles/packages/AffectiveTweets/resources/50mpaths2.txt.gz -Q 1 -stemmer weka.core.stemmers.NullStemmer -stopwords-handler \\\"weka.core.stopwords.Null \\\" -I 2 -U -tokenizer \\\"weka.core.tokenizers.TweetNLPTokenizer \\\"\" -F \"weka.filters.unsupervised.attribute.Reorder -R 5-last,4\"" -W weka.classifiers.functions.LibLINEAR -- -S 12 -C 1.0 -E 0.001 -B 1.0 -L 0.1 -I 1000

Make sure that the LibLinear Weka package is properly installed. Note: the snippet from above may look cryptic. Bear in mind that you can configure your classifier from the WEKA GUI and then copy the resulting configuration to the command line.

Convert the predictions into the task format:

python fix_weka_output.py 1 2018-EI-reg-En-anger-dev.txt EI-reg-En-anger-weka-predictions.csv EI-reg_en_anger_pred.txt

Evaluate the predictions:

python evaluate.py 1 EI-reg_en_anger_pred.txt 2018-EI-reg-En-anger-dev.txt

SVM Classification on EI-oc-En Anger

In this example we will train an SVM classifier (from LibLinear) on EI-oc-En-anger-train using unigrams as features. We will deploy the classifier on the corresponding development set.

Convert training and dev sets into arff format:

python ../workspace/SemEval_2018_Task_1_Eval/tweets_to_arff.py 2 EI-oc-En-anger-train.txt EI-oc-En-anger-train.arff
python ../workspace/SemEval_2018_Task_1_Eval/tweets_to_arff.py 2 2018-EI-oc-En-anger-dev.txt 2018-EI-oc-En-anger-dev.arff

Train the classifier using Weka and save the predictions made on the dev set as a csv file:

java -Xmx4G -cp $WEKA_FOLDER/weka.jar weka.Run weka.classifiers.meta.FilteredClassifier -t EI-oc-En-anger-train.arff -T 2018-EI-oc-En-anger-dev.arff -classifications "weka.classifiers.evaluation.output.prediction.CSV -use-tab -p first-last -file EI-oc-En-anger-weka-predictions.csv" -F "weka.filters.MultiFilter -F \"weka.filters.unsupervised.attribute.TweetToSparseFeatureVector -E 5 -D 3 -I 0 -F -M 0 -G 0 -taggerFile $HOME/wekafiles/packages/AffectiveTweets/resources/model.20120919 -wordClustFile $HOME/wekafiles/packages/AffectiveTweets/resources/50mpaths2.txt.gz -Q 1 -stemmer weka.core.stemmers.NullStemmer -stopwords-handler \\\"weka.core.stopwords.Null \\\" -I 2 -U -tokenizer \\\"weka.core.tokenizers.TweetNLPTokenizer \\\"\" -F \"weka.filters.unsupervised.attribute.Reorder -R 5-last,4\"" -W weka.classifiers.functions.LibLINEAR -- -S 1 -C 1.0 -E 0.001 -B 1.0 -L 0.1 -I 1000

Convert the predictions into the task format:

python fix_weka_output.py 2 2018-EI-oc-En-anger-dev.txt EI-oc-En-anger-weka-predictions.csv EI-oc_en_anger_pred.txt

Evaluate the predictions:

python evaluate.py 2  EI-oc_en_anger_pred.txt 2018-EI-oc-En-anger-dev.txt

Binary Relevance Multi-label Classifier on E-c-En

In this example we will train a binary relevance multi-label model on 2018-E-c-En-train using unigrams as features and an SVM as the base learner. We will deploy the classifier on the corresponding development set. We use MEKA (a Multi-label Extension to WEKA).

Convert training and dev sets into arff format:

python tweets_to_arff.py 3 2018-E-c-En-train.txt 2018-E-c-En-train.arff
python tweets_to_arff.py 3 2018-E-c-En-dev.txt 2018-E-c-En-dev.arff

Extract unigram features using AffectiveTweets:

java -Xmx4G -cp $WEKA_FOLDER/weka.jar  weka.Run weka.filters.MultiFilter -b -i 2018-E-c-En-train.txt 2018-E-c-En-train.arff -o train_trans.arff -r 2018-E-c-En-dev.txt 2018-E-c-En-dev.arff -s test_trans.arff -F "weka.filters.unsupervised.attribute.TweetToSparseFeatureVector -E 5 -D 3 -I 0 -F -M 0 -G 0 -taggerFile $HOME/wekafiles/packages/AffectiveTweets/resources/model.20120919 -wordClustFile $HOME/wekafiles/packages/AffectiveTweets/resources/50mpaths2.txt.gz -Q 1 -stemmer weka.core.stemmers.NullStemmer -stopwords-handler \"weka.core.stopwords.Null \" -I 13 -tokenizer \"weka.core.tokenizers.TweetNLPTokenizer \"" -F "weka.filters.unsupervised.attribute.Reorder -R 1-11,14-last"

Train a BR multi-label model with an SVM as the base learner using MEKA:

java -Xmx4G -jar TrainBR.jar train_trans.arff test_trans.arff 018-E-c-En-dev.txt E-C_en_pred.txt

Evaluate the predictions:

python evaluate.py 3 E-C_en_pred.txt 2018-E-c-En-dev.txt

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TrainBR.jar		TrainBR.jar
TrainBR.java		TrainBR.java
evaluate.py		evaluate.py
evaluation.py		evaluation.py
fix_weka_output.py		fix_weka_output.py
tweets_to_arff.py		tweets_to_arff.py
utils.py		utils.py

License

felipebravom/SemEval_2018_Task_1_Eval

Folders and files

Latest commit

History

Repository files navigation

SemEval-2018 Task 1 Affect in Tweets Evaluation Script

1. Evaluation Script

1.1. Prerequisites

1.2. Usage

1.3. Format

2. Weka Baseline System

2.1. Data to Arff

Usage

2.2. Convert Weka Predictions into the Task's Submission Format

Usage

2.3. Examples

SVM Regression on EI-reg-En Anger

SVM Classification on EI-oc-En Anger

Binary Relevance Multi-label Classifier on E-c-En

About

Resources

License

Stars

Watchers

Forks

Languages