Skip to content

A TensorFlow implementation of "Sequence Modeling with Hierarchical Deep Generative Models with Dual Memory" (published in CIKM2017).

License

Notifications You must be signed in to change notification settings

zheng-yanan/hierarchical-deep-generative-models

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hierarchical Deep Generative Models with Dual Memory

A TensorFlow implementation of the Hierarchical Deep Generative Models with Dual Memory described in Sequence Modeling with Hierarchical Deep Generative Models with Dual Memory, published as a long paper in CIKM2017. Please refer to the paper for more details.

This repository also implements baseline model: Variational Autoencoder for Sentences described in Generating Sentences from a Continuous Space for comparison.

Introduction

This paper proposes Hierarchical Deep Generative Models with Dual Memory (HDGM for short) to address the ineffective difficulties in discrete sequence modeling. It is architecturally based on the latent variable model framework, and particularly highlights in the following two aspects. First, it extends an improved hierarchically organized multi-layer architecture. Second, it adopts memory mechanism along dual directions, respectively denoted as broad memory and deep memory. The proposed HDGM is validated on mulitple tasks, including language modeling, document modeling and sentiment analysis, using various measurements, and is proved highly-effective.

Usage:

python trainer.py --dataset <dataset_name> --model <model_name>

will run default training and save model to ./save/hdgm_ptb. Here we provide utitities for two datasets, including ptb and imdb. Two models, respectively hdgm and vae, could be model selection.

Prerequisites

  • TensorFlow 1.4.0
  • Python 2.7

Reference

If you use source codes included in this toolkit in your work, please cite the following paper. The bibtex are listed below:

@inproceedings{DBLP:conf/cikm/ZhengWWYJ17,
  author    = {Yanan Zheng and
               Lijie Wen and
               Jianmin Wang and
               Jun Yan and
               Lei Ji},
  title     = {Sequence Modeling with Hierarchical Deep Generative Models with Dual
               Memory},
  booktitle = {{CIKM}},
  pages     = {1369--1378},
  publisher = {{ACM}},
  year      = {2017}
}

About

A TensorFlow implementation of "Sequence Modeling with Hierarchical Deep Generative Models with Dual Memory" (published in CIKM2017).

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages