Skip to content

chuckcho/Activity-Recognition-with-CNN-and-RNN

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LRCN and Temporal CNN for Activity Recognition

Chih-Yao Ma*, Min-Hung Chen*

(* equal contribution)


Abstract

We examine and implement several leading techniques for Activity Recognition (video classification), while proposing and investigating a novel convolution on temporally-constructed feature vectors.

How we tackle Activity Recognition problem?

CNN as baseline, CNN + RNN (LRCN), Temporal CNN

CNN as baseline CNN + RNN (LRCN) Temporal CNN
CNN as baseline CNN + RNN (LRCN) Temporal CNN

Demo

IMAGE ALT TEXT HERE

The above YouTube video demonstrates the top-3 predictions results of our LRCN and temporal CNN model. The text on the top is the ground truth, three texts are the predictions for each of the method, and the bar right next to the predictions are how confident the model makes predictions.


Dataset

We are currently using UCF101 dataset for our project. This dataset has 13320 videos from 101 action categories.

UCF101 Dataset

We will move onto Sports-1M dataset to see how much our performance will be changed in the near future.

SPORTS-1M Dataset


Installation

Our work is currently implemented in Torch, and depends on the following packages: torch/torch7, torch/nn, torch/nngraph, torch/image, cudnn ...

If you are on Ubuntu, please follow the instruction here to install Torch. For a more comprehensive installation guilde, please check Torch installation.

$ git clone https://github.com/torch/distro.git ~/torch --recursive
$ cd ~/torch; bash install-deps;
$ ./install.sh
$ source ~/.bashrc

You will also need to install some of the packages we used from LuaRocks. LuaRocks should already be installed with your Torch.

$ luarocks install torch
$ luarocks install pl
$ luarocks install trepl
$ luarocks install image
$ luarocks install nn
$ luarocks install dok
$ luarocks install gnuplot
$ luarocks install qtlua
$ luarocks install sys
$ luarocks install xlua
$ luarocks install optim

If you would like to use CUDA on your NVIDIA graphic card, you will need to install CUDA toolkit and some additional packages.

$ luarocks install cutorch
$ luarocks install cunn

You need to install the CUDNN package properly since we use the pre-trained ResNet model. First, you need to download the package from Nvidia (You need to register to download it.)

Then, follow this instruction:

$ tar -xzvf cudnn-7.0-linux-x64-v4.0-prod.tgz
$ cd cuda
$ sudo cp lib* /usr/local/cuda/lib64/
$ sudo cp cudnn.h /usr/local/cuda/include/
$ luarocks install cudnn

(note: There may be problems if you use CUDNN v5 since currently Torch can only detect CUDNN v4.)


Usage

We provide three different methods to train the models for activity recognition: CNN, CNN with RNN, and Temporal CNN.

Inputs

Our models will take the feature vectors generated by the first CNN as input for training. You can generate the features using our codes under "/CNN_Spatial/". You can also download the feature vectors generated by ourselves. (please refer to the Dropbox link below.) We followed the first training/testing split from UCF-101. If you would like to compare with our results, please use the same training and testing list, as it will affect your overall performance a lot.

CNN with RNN

We use the RNN library provided by Element-Research. Simply install it by:

$ luarocks install rnn

After you downloaded the feature vectors, please modify the code in ./RNN/data.lua to the director where you put your feature vector files.

To start the training process, go to ./RNN and simply execute:

$ th RNN_LSTM.lua

The training and testing performance will be plotted, and the results will be saved into log files. The learning rate and best testing accuracy will be reported each epoch if there is any update.

Temporal CNN

To start the training process, go to ./TCNN and simply execute:

$ qlua run.lua -r 15e-5

For more details, please refer to the readme file in the folder ./TCNN/.

You also need to modify the code in ./TCNN/data.lua to the director where you put your feature vector files.

The training and testing performance will be plotted, and the results will be saved into log files. The best testing accuracy will be reported each epoch if there is any update.


Acknowledgment

This work was initialized as a class project for deep learning class in Georgia Tech 2016 Spring. We were teamed up with Hao Yan and Casey Battaglino to work on this class project, who have been a great help and provide valuable discussions as we go long this class project.

This is an ongoing project. Please contact us if you have any questions.

Chih-Yao Ma at cyma@gatech.edu or [LinkedIn]

Min-Hung Chen at cmhungsteve@gatech.edu

Last updated: 05/05/2016

About

LRCN and Temporal CNN for Activity Recognition

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Lua 56.0%
  • Python 24.6%
  • C++ 18.6%
  • C 0.8%