Lipreading Chainer

This is the chainer code for the paper Combining Residual Networks with LSTMs for Lipeading. You can find the paper here.
The authors present a word level lipreading model based on Resnets. The input to the model is a silent video and the model then outputs the word it thinks was spoken. In the paper this task of visual speech recognition has been modelled as video classification.

The code is based on PyTorch implementation of the same work which can be found here.

Dataset

The model has been trained on Oxford-BBC Lip Reading in the Wild (LRW) dataset. The dataset consists of short video clips of news anchor speaking a single word. The words dictionary size is 500. The dataset contains about 1000 utterances of each of the 500 words. Dataset size is around 70GB.

How to Run

Download the LRW dataset from this website
Preprocess the dataset as given in the PyTorch counterpart of this repo (available here)
Write the appropriate dataset path in config.json
Run the following command :-

python main.py --config config.json

after the training is over, change the mode variable in config.json to 'backendGRU' and run the above command.
Finally fine tune the model by switching the mode to 'finetuneGRU'.

Make sure you change the path variable to saved model location after step 4.

TODOs

Chainer code, tested
Tested on CPU
Making it work on GPU

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
README.md		README.md
chainer_icon_red.png		chainer_icon_red.png
config.json		config.json
cvtransforms.py		cvtransforms.py
dataset.py		dataset.py
label_sorted.txt		label_sorted.txt
lr_scheduler.py		lr_scheduler.py
main.py		main.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

chainer_icon_red.png

chainer_icon_red.png

config.json

config.json

cvtransforms.py

cvtransforms.py

dataset.py

dataset.py

label_sorted.txt

label_sorted.txt

lr_scheduler.py

lr_scheduler.py

main.py

main.py

model.py

model.py

utils.py

utils.py

Repository files navigation

Lipreading Chainer

Dataset

How to Run

TODOs

About

Releases

Packages

Languages

euler16/Lipreading-Chainer

Folders and files

Latest commit

History

Repository files navigation

Lipreading Chainer

Dataset

How to Run

TODOs

About

Topics

Resources

Stars

Watchers

Forks

Languages