PIVOT: Prompting for Video Continual Learning.

Modern machine learning pipelines are limited due to data availability, storage quotas, privacy regulations, and expensive annotation processes. These constraints make it difficult or impossible to train and update large-scale models on such dynamic annotated sets. Continual learning directly approaches this problem, with the ultimate goal of devising methods where a deep neural network effectively learns relevant patterns for new (unseen) classes, without significantly altering its performance on previously learned ones. In this paper, we address the problem of continual learning for video data. We introduce PIVOT, a novel method that leverages extensive knowledge in pre-trained models from the image domain, thereby reducing the number of trainable parameters and the associated forgetting. Unlike previous methods, ours is the first approach that effectively uses prompting mechanisms for continual learning without any in-domain pre-training. Our experiments show that PIVOT improves state-of-the-art methods by a significant 27% on the 20-task ActivityNet setup.

Prerequisites

It is essential to install all the dependencies and libraries needed to run the project. To this end, you need to run this line:

conda env create -f environment.yml

Dataset

We leverage The vCLIMB Benchmark to evaluate PIVOT. For more information about the benchmark and how to set it, we encourage you to visit the vCLIMB website.

Run

First of all, the configuration file must be created or modified according to the provided examples. Then, you must run the following line to train our main model considering your configuration file.

python main.py -conf './conf/CONF_FILE_NAME.yaml'

Citation

If you find this repository useful for your research, please consider citing our paper:

@inproceedings{PIVOT_villa,
  author    = {Villa, Andr{\'{e}}s and
               Le{\'{o}}n Alc{\'{a}}zar, Juan and
               Alfarra, Motasem and
               Alhamoud, Kumail and
               Hurtado, Julio and
               Caba Heilbron, Fabian and
               Soto, Alvaro and
               Ghanem, Bernard},
  title     = {{PIVOT:} Prompting for Video Continual Learning},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2023},
  month={June}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
conf		conf
images		images
model		model
utils		utils
.DS_Store		.DS_Store
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conf

conf

images

images

model

model

utils

utils

.DS_Store

.DS_Store

README.md

README.md

main.py

main.py

Repository files navigation

PIVOT: Prompting for Video Continual Learning.

Prerequisites

Dataset

Run

Citation

About

Releases

Packages

Languages

ojedaf/PIVOT

Folders and files

Latest commit

History

Repository files navigation

PIVOT: Prompting for Video Continual Learning.

Prerequisites

Dataset

Run

Citation

About

Resources

Stars

Watchers

Forks

Languages