Gesture-Classification-in-Real-Time

This is a second version of this application. This time it's more computation heavy as I tried to tackle a problem of gesture classification in real time without background subtraction and in background agnostic way using custom CNN. Gestures to be classified:

Base dataset is also a part of this repository as well as augmentation script, that produces a lot more data artifically via various techniques. For pc version switch branch to pc_version. If you'd like to test this app without providing calibration matrices, run:

python3 gesture_classification.py --nocalib

Otherwise upload your own cameraCalibResults.npz and run normally. My other app, called camera calibration can generate this file for you: https://github.com/EmbeddedPaul166/Camera-Calibration

Hardware components:

Antmicro TX2/TX2i Deep Learning Kit

Software components:

Python 3.6
OpenCV 4.1.1
TensorFlow 2.1.0
CUDA 10.0
CuDNN 7.6.3
TensorRT 6.0.1

Data augmentation

Training set was augmented with various techniques. Base pictures were taken on a black background so that it was possible to use thresholding on numpy arrays to easily segment out gestures and their backgrounds. After that many random solid color backgrounds were generated and gestures were downscaled, randomly stretched and planted in various translations in the frames. Some images were darkened randomly, to fit different lighting conditions. Apart from that all images were flipped mirror-like and random noise was added, varying in density. For more information see: https://github.com/EmbeddedPaul166/Gesture-Classification-in-Real-Time/blob/master/images/augment_data.py

Algorithm steps

Download frame from camera.
Remap frame based on calibration output.
Resize frame to 267x150.
Cut 150x150 from it.
Convert it to grayscale.
Normalize for float values between 0 and 1
Reshape to 1x150x150x1 size
Convert to tensor
Predict using a frozen graph from TensorRT optimized model, previously trained in Tensorflow

Optimization

Model is optimized using TensorRT which resulted in huge increase (4 -> 22) of FPS. It's due to Tensorflow using only about 10% GPU on TX2, despite information on the internet that it maps 100% of GPU on default. Conversion was done on saved model format, at the beginning of the main app converted saved model is loaded and converted to frozen graph with which inference is done.

CNN architecture

For custom CNN architecture see https://github.com/EmbeddedPaul166/Gesture-Classification-in-Real-Time/blob/master/cnn/train_cnn_model.py

Benchmarking

Framerate mostly stays at 22 FPS, where upper limit is 30 FPS due to the way camera driver is implemented.

Running

python3 gesture_classification.py

TensorRT technical note

I encountered extremely slow model loading with TensorRT. Later I found out it's due to incorrect protobuf installation. For more information how to fix it see: https://github.com/EmbeddedPaul166/Gesture-Classification-in-Real-Time/blob/master/util/note

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
camera_parameters		camera_parameters
cnn		cnn
examples		examples
images		images
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gesture_classification.py		gesture_classification.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

camera_parameters

camera_parameters

cnn

cnn

examples

examples

images

images

util

util

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

gesture_classification.py

gesture_classification.py

Repository files navigation

Gesture-Classification-in-Real-Time

Hardware components:

Software components:

Data augmentation

Algorithm steps

Optimization

CNN architecture

Benchmarking

Running

TensorRT technical note

About

Releases

Packages

Languages

License

EmbeddedPaul166/Gesture-Classification-in-Real-Time

Folders and files

Latest commit

History

Repository files navigation

Gesture-Classification-in-Real-Time

Hardware components:

Software components:

Data augmentation

Algorithm steps

Optimization

CNN architecture

Benchmarking

Running

TensorRT technical note

About

Topics

Resources

License

Stars

Watchers

Forks

Languages