Human Activity Recognition from Videos

Problem statement :

Given a video (any file : .mp4, .avi, .MTS) , the task is to recognize, i.e , classify the activity being performed in the video.

Applications of such a system :

Elderly & infant care
Suspicious Activity Recognition
Industrial manufacturing & assistance

& many more

Dataset used:

10 classes from the UCF-101 dataset : https://www.crcv.ucf.edu/data/UCF101.php

Libraries used:

* Numpy
* OpenCV 
* PyTorch

Methodologies :

1) Using CNN:

Videos can be thought as many images stitched together. Thus we can assume subsequent frames in a video are correlated with respect to their semantic contents. Hence, we can extract images from the videos & then train a CNN pretrained on ImageNet dataset to classify the images extracted from the videos.

Accuracy achieved using this methodology 91%.

Dataflow diagram :

2) Using Spatio Temporal Classifer (CNN-LSTM):

Since, videos are temporal sequences thus we may also create a spatio-temporal classifer. I've done this by training an LSTM network on the features given by the CNN from the images of the video.

However, accuracy achieved was only 56%.

Dataflow diagram :

Reasons for low accuracy :

Less amount of data per class

Other ways of doing it:

Other ways of doing it have been beautifully descibed in this blog: http://blog.qure.ai/notes/deep-learning-for-videos-action-recognition-review

I hope to implement some of them in the near future !!!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
activity_recog_CNNtrainer.ipynb		activity_recog_CNNtrainer.ipynb
activity_recog_cnnFeatureExtractor.ipynb		activity_recog_cnnFeatureExtractor.ipynb
activity_recog_dataprep.ipynb		activity_recog_dataprep.ipynb
activity_recog_lstm.ipynb		activity_recog_lstm.ipynb
dfd_1.JPG		dfd_1.JPG
dfd_2.JPG		dfd_2.JPG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

activity_recog_CNNtrainer.ipynb

activity_recog_CNNtrainer.ipynb

activity_recog_cnnFeatureExtractor.ipynb

activity_recog_cnnFeatureExtractor.ipynb

activity_recog_dataprep.ipynb

activity_recog_dataprep.ipynb

activity_recog_lstm.ipynb

activity_recog_lstm.ipynb

dfd_1.JPG

dfd_1.JPG

dfd_2.JPG

dfd_2.JPG

Repository files navigation

Human Activity Recognition from Videos

Problem statement :

Applications of such a system :

Dataset used:

Libraries used:

Methodologies :

1) Using CNN:

Dataflow diagram :

2) Using Spatio Temporal Classifer (CNN-LSTM):

Dataflow diagram :

Other ways of doing it:

About

Releases

Packages

Languages

subhromitra/Video-analytics

Folders and files

Latest commit

History

Repository files navigation

Human Activity Recognition from Videos

Problem statement :

Applications of such a system :

Dataset used:

Libraries used:

Methodologies :

1) Using CNN:

Dataflow diagram :

2) Using Spatio Temporal Classifer (CNN-LSTM):

Dataflow diagram :

Other ways of doing it:

About

Topics

Resources

Stars

Watchers

Forks

Languages