- Must_Read
- Anomaly Detection
- Temporal Action Localization
- Temporal Action Segmentation
- Temporal Procedural Planning
- Procedural Learning Dataset
- Object Localization_Detection
- Open-set Action Recognition
- Out-of-distribution Detection
- Other interesting papers
- (CVPR24) GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
- (CVPR24) PREGO: online mistake detection in PRocedural EGOcentric videos
- (CVPR23) Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
- (ICCV23) Feature Prediction Diffusion Model for Video Anomaly Detection
- (CVPR22) High-Resolution Image Synthesis with Latent Diffusion Models
- (ICCV23) Unsupervised Surface Anomaly Detection with Diffusion Probabilistic Model
- (ICIP23) EXPLORING DIFFUSION MODELS FOR UNSUPERVISED VIDEO ANOMALY DETECTION
- (Arxiv23) Time Series Anomaly Detection using Diffusion-based Models
- (MICCAI22) Diffusion Models for Medical Anomaly Detection
- (CVPR23) Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
- (CVPR23) Generating Anomalies for Video Anomaly Detection with Prompt-based Feature Mapping
- (CVPR23) Prompt-Guided Zero-Shot Anomaly Action Recognition using Pretrained Deep Skeleton Features
- (ECCV22) Self-Supervised Sparse Representation for Video Anomaly Detection
- (CVPR22) Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection
- (ICCV21) A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction
- (TPAMI21) A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video
- (???) TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
- (ECCV22) ActionFormer: Localizing Moments of Actions with Transformers
- (ICCV23) Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization
- (ICCV23) DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization
- (ICCV23) Action Sensitivity Learning for Temporal Action Localization
- (ICCV23) Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach
- (CVPR22) Unsupervised Action Segmentation by Joint Representation Learning and Online Clustering
- (ICCV19) Weakly Supervised Energy-Based Learning for Action Segmentation
- (ECCV22) Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
- (CVPR22) Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos
- (ICCV19) Weakly Supervised Energy-Based Learning for Action Segmentation
- (???) IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting
- (NIPS23) Every Mistake Counts in Assemble
- (CVPR23) HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
- (CVPR22) Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities
- (Arxiv23) What, when, and where? - Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
- (ICCV23) Self-Supervised Object Detection from Egocentric Videos
- (WACV23) TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained Videos
- (ICLR24) UNSUPERVISED OPEN-VOCABULARY ACTION RECOGNITION WITH AN AUTOREGRESSIVE MODEL
- (NIPS23) Opening the Vocabulary of Egocentric Actions
- (CVPR23) Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
- (CVPR23) Open Set Action Recognition via Multi-Label Evidential Learning
- (Arxiv22) Human Activity Recognition in an Open World
- (ICCV21) Evidential Deep Learning for Open Set Action Recognition
- (Arxiv23) DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
- (MICCAI23 Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos)
- (CVPR19) Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition