Robust-Representation

Introduction

This code is an implementation the following work:

Learning Robust Representation of Text. Li, Yitong, Trevor Cohn and Timothy Baldwin. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP 2016).

The code implements high-order derivative as regularization to learning robust model aganist noise based on Convolutional Neural Network (CNN) model.

CNN implementations

The CNN-model is following YoonKim's Convolutional Neural Networks for Sentence Classification and Denny Britz's implementation (https://github.com/dennybritz/cnn-text-classification-tf) in tensorflow.

Data

We evaluate on Pang and Lee's movie review dataset (MR) using 10-fold cross validation.

More data in the paper can be found at HarvardNLP.

And you can download a pre-trained word embeddings word2vec.

Please refer the original paper for details.

Requrements

Python 2.7 or 3
Tensorflow
Numpy

Running the model

Running the model using following commnad:

>python train.py [parameters]

parameters:
	--dropout_keep_prob 0.5
		Dropout keep probability (default: 0.5)
	--rb_lambda 1e-2
		Robust Regularizaion lambda (default: 1e-2)
	--alpha 0.1
		Data noise level alpha (default: 0.1)

For exmaple, training using robust regularization only (with 0.3 noise level):

>python train.py --alpha 0.3 --rb_lambda 1e-2  --dropout_keep_prob 1

CV score: 0.764398878813

training models with dropout only (with 0.3 noise level):

>python train.py --alpha 0.3 --dropout_keep_prob 0.5 --rb_lambda 0

CV score: 0.754831451178

Models can be found under 'runs' fold.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data/rt-polaritydata		data/rt-polaritydata
README.md		README.md
data_helpers.py		data_helpers.py
text_cnn.py		text_cnn.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data/rt-polaritydata

data/rt-polaritydata

README.md

README.md

data_helpers.py

data_helpers.py

text_cnn.py

text_cnn.py

train.py

train.py

Repository files navigation

Robust-Representation

Introduction

CNN implementations

Data

Requrements

Running the model

About

Releases

Packages

Languages

lrank/Robust-Representation

Folders and files

Latest commit

History

Repository files navigation

Robust-Representation

Introduction

CNN implementations

Data

Requrements

Running the model

About

Resources

Stars

Watchers

Forks

Languages