Skip to content

EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important temporal segments in educational videos.

License

VideoAnalysis/EDUVSUM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EDUVSUM (Educational Video Summarization)

EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important temporal segments in educational videos.

Dataset

Download the dataset and put it in "data" folder.

Dataset v1

Video Annotation Tool (VAT)

Video Annotation Tool (VAT) is a web based tool to annotate videos datasets to use in machine learning tasks.

Get started (Requirements and Setup)

Python version >= 3.6

# clone the repository
git clone git@github.com:Junaid112/EDUVSUM-Educational-Video-Summarization.git
cd EDUVSUM-Educational-Video-Summarization
pip install -r requirements.txt

Citation

@article{ghauri2020eduvsum, 
   title={Classification of Important Segments in Educational Videos using Multimodal Features},
   author={Ghauri, Junaid Ahmed and Hakimov, Sherzod and Ewerth, Ralph}, 
   Conference={International Workshop on Investigating Learning During Web Search (IWILDS 2020) co-located with CIKM}, 
   year={2020} 
}

About

EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important temporal segments in educational videos.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages