Hierarchical Attention Network for Sentiment Classification

A PyTorch implementation of the Hierarchical Attention Network for Sentiment Analysis on the Amazon Product Reviews datasets. The system uses the review text and the summary text to classify the reviews as one of positive, negative or neutral. These classes correspond to ratings 4-5, 1-2 and 3 respectively in the dataset.

Requirements

python 3.5
pytorch
torchtext
spacy

Organisation

The code in the repository are organised in following modules:

main.py: driver code
model.py: Hierachical Attention Network implementation
train.py: training/validation/testing code
preprocess.py: data preprocessing code
vocab.py: code for building vocab
dataset.py: custom pytorch dataset for review data
utils.py: logging, config generation, experiment analysis scripts

Following utility scripts have been added for training/testing:

train.sh: will clean and preprocess train data, generate vocabulary pickles, and then train the model on the preprocessed data.
test.sh: clean and preprocess test data, evaluate model on the preprocessed data and write model predictions to file.

Usage

$ ./train.sh <train_data_json>
$ ./test.sh <test_data_json> <result_file>

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
README.md		README.md
clean.sh		clean.sh
config_sample.yaml		config_sample.yaml
config_test.yaml		config_test.yaml
dataset.py		dataset.py
main.py		main.py
model.py		model.py
preprocess.py		preprocess.py
run.sh		run.sh
test.sh		test.sh
train.py		train.py
train.sh		train.sh
utils.py		utils.py
vocab.py		vocab.py

Shivanshu-Gupta/hierarchical-attention-network

Folders and files

Latest commit

History

Repository files navigation

Hierarchical Attention Network for Sentiment Classification

Requirements

Organisation

Usage

References

About

Topics

Resources

Stars

Watchers

Forks

Languages