Learning from Video and Text via Large-Scale Discriminative Clustering

Introduction

This is the code for the paper :

Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, Josef Sivic, Learning from Video and Text via Large-Scale Discriminative Clustering, ICCV17.

The webpage for this project is available here.

It only contains the code for the optimization part of the action recognition model given pre-extracted track features.

Dependencies

To run this code, you need to install :

MOSEK : version 7
CVX : version 2.1

Once installed, setup the paths in the startup file :

main.m

Data

First you will need to download the pre-extracted person track features:

wget https://www.rocq.inria.fr/cluster-willow/amiech/iccv17/X.mat

Optimization

Now you can run our optimization code that will take X as input and output the label matrix Z given the bags formation and weak-supervision:

   main.m

This code is optimized for running everything on a computer with enough memory. If you are looking for a way to solve the Discriminative Clustering model in a totally online manner (ie with very limited memory usage) please contact me. We only provided this version as the fully online version is much slower to run because of the slow disk speed access.

Cite

If you find this code useful in your research, please, consider citing our paper:

@InProceedings{miech17learningvideotext, author = "Miech, Antoine and Alayrac, Jean-Baptiste and Bojanowski, Piotr and Laptev, Ivan and Sivic, Josef", title = "Learning from Video and Text via Large-Scale Discriminative Clustering", booktitle = "ICCV", year = "2017" }

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
Z_total.mat		Z_total.mat
bags.mat		bags.mat
build_PQ.m		build_PQ.m
build_action_bags.m		build_action_bags.m
data_block.mat		data_block.mat
get_blockmosek_A.m		get_blockmosek_A.m
get_blockmosek_lx.m		get_blockmosek_lx.m
get_blocks.m		get_blocks.m
get_mosek_A (copy).m		get_mosek_A (copy).m
get_mosek_A.m		get_mosek_A.m
get_mosek_A_fw.m		get_mosek_A_fw.m
get_mosek_Q.m		get_mosek_Q.m
get_mosek_lx.m		get_mosek_lx.m
get_mosek_lx_fw.m		get_mosek_lx_fw.m
get_prediction.m		get_prediction.m
get_uniqueness_bag.m		get_uniqueness_bag.m
init_action_params.m		init_action_params.m
joint_optimisation.m		joint_optimisation.m
linbcfwopt.m		linbcfwopt.m
main.m		main.m
scene_cast.mat		scene_cast.mat
track_map.mat		track_map.mat
tracks_in_bag.m		tracks_in_bag.m
weak_square_loss.m		weak_square_loss.m

antoine77340/iccv17learning

Folders and files

Latest commit

History

Repository files navigation

Learning from Video and Text via Large-Scale Discriminative Clustering

Introduction

Contents

Dependencies

Data

Optimization

Cite

About

Resources

Stars

Watchers

Forks

Languages