Skip to content

nkegke/deep-affective-bodily-expression-recognition

Repository files navigation

Deep Affective Bodily Expression Recognition

tsn

TSN-Based Visual Emotion Recognition Model

Abstract

The COVID-19 pandemic has undoubtedly changed the standards and affected all aspects of our lives, especially social communication. It has forced people to extensively wear medical face masks, in order to prevent transmission. This face occlusion can strongly irritate emotional reading from the face and urges us to incorporate the whole body as an emotional cue. In this paper, we conduct insightful studies about the effect of face occlusion on emotion recognition performance, and showcase the superiority of full body input over the plain masked face. We utilize a deep learning model based on the Temporal Segment Network framework, and aspire to fully overcome the face mask consequences. Although facial and bodily features can be learned from a single input, this may lead to irrelevant information confusion. By processing those features separately and fusing their prediction scores, we are more effectively taking advantage of both modalities. This framework also naturally supports temporal modeling, by mingling information among neighboring frames. In combination, these techniques form an effective system capable of tackling emotion recognition difficulties, caused by safety protocols applied in crucial areas.

Prerequisites

  • Linux
  • CUDA 11.2
  • numpy 1.21.5
  • matplotlib 3.5.1
  • torch 1.11.0
  • torchvision 0.12.0
  • PIL 9.0.1
  • opencv-python 4.5.5
  • pandas 1.4.2
  • sklearn 1.0.2
  • pytorch_grad_cam
  • gdown

Dataset PATHs:

Dataset CSVs: EmoReact/{train,val,test}.csv

How to run

python train_EmoReact.py [--input {face,body,fullbody,fusion}]
                         [--mask]
                         [--num_segments {1,3,5,10}]
                         [--arch {resnet50,mobilenet_v2}]
                         [--shift] [--shift_div {4,8}]
                         --config EmoReact/config.json
Face Full Body Fusion (Face + Body)
face face face

Visual Explanation (using Grad-CAM)

Excitement Frustration
face face face face face face
face face face face face face

Acknowledgements

About

Diploma Thesis (MEng) at School of Electrical & Computer Engineering - National Technical University of Athens

Topics

Resources

License

Stars

Watchers

Forks

Languages