video-understanding

Code for the Paper: Quasi-Online Detection of Take and Release Actions from Egocentric Videos. International Conference on Image Analysis and Processing 2023.

video-understanding action-detection

Updated Jul 15, 2023

chajchaj / models

Star

Pre-trained and Reproduced Deep Learning Models （『飞桨』官方模型库，包含多种学术前沿和工业场景验证的深度学习模型）

video-understanding video-classification action-classification

Updated Sep 1, 2020
Python

crim-ca / FrVD-visualization-tool

Star

Tool employed to visualize synchronized FrVD metadata and videos simultaneously.

visualization annotations dataset action-recognition video-understanding video-description

Updated Apr 1, 2024
Python

InvincibleWyq / VBA

Star

Undergraduate Thesis @ Department of Automation, Tsinghua -- Understanding Few-shot Video with Pretrained Image-Text Models

transfer-learning video-understanding image-text-pretraining

Updated Dec 18, 2023
Python

Fsoft-AIC / UGLF

Star

[IJCNN 2024] Unifying Global and Local Scene Entities Modelling for Precise Action Spotting

video-processing video-understanding vision-language-model action-spotting

Updated May 4, 2024
Python

SCUT-BIP-Lab / 3DTDS-Net

Star

The code for 3DTDS-Net with Pytorch

biometrics human-computer-interaction video-understanding hand-gesture-authentication

Updated Mar 21, 2022
Python

carriex / C3D

Star

Video understanding with C3D

video-understanding c3d pytorch-implementation

Updated Jun 10, 2020
Python

SCUT-BIP-Lab / FSTA-Net

Star

The code for FSTA-Net with Pytorch

biometrics human-computer-interaction video-understanding biometric-authentication hand-gesture-authentication behavioral-characteristic-analysis

Updated May 23, 2023
Python

XFeiF / ComputerVision_PaperNotes

Star

📚 Paper Notes (Computer vision)

computer-vision notes paper cv representation-learning cvpr action-recognition iccv video-understanding eccv video-representation-learning self-supervised-learning video-representation video-retrieval tpami video-papernotes

Updated Mar 23, 2021

SCUT-BIP-Lab / L3AM

Star

The code for L3AM loss with Pytorch

biometrics loss-functions video-understanding softmax palmprint palm-vein hand-gesture-authentication

Updated Mar 25, 2024
Python

engindeniz / DialogSummary-VideoQA

Star

[ICCV 2021] On the hidden treasure of dialog in video question answering

language-models video-understanding vision-language video-question-answering knowledge-base-videoqa

Updated Mar 30, 2022
Python

mx-mark / SPMNet

Star

Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)

synchronization video-understanding audioset vas cross-modality visual-audio audio-generation visual-to-sound

Updated Apr 12, 2022

jinxiang-liu / UFE-AVS

Star

Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""

semantic-segmentation video-understanding audio-visual-segmentation

Updated Mar 25, 2024

ZJCV / Non-local

Star

[CVPR 2018] Non-local Neural Networks

pytorch action-recognition video-understanding video-recognition non-local i3d c2d resnet3d

Updated Dec 15, 2020
Python

SCUT-BIP-Lab / PB-Net

Star

The code for PB-Net with Pytorch

biometrics human-computer-interaction video-understanding biometrics-authentication hand-gesture-authentication behavioral-characteristic-analysis

Updated Feb 27, 2023
Python

We use visual data alone to learn a control policy for a robotic arm by observing expert video demonstrations. We implement and test several models, accomplishing an 85% success rate for a pick-and-place task.

machine-learning video computer-vision deep-learning robotics video-understanding visuomotor-control

Updated Dec 4, 2022
Python

Improve this page

Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-understanding

Here are 182 public repositories matching this topic...

crim-ca / FrVD

unitaryai / VTC-dataset

dukaenea / unintentional_actions

Fsoft-AIC / Z-GMOT

fpv-iplab / Quasi-Online-Detection-Take-Release

chajchaj / models

crim-ca / FrVD-visualization-tool

InvincibleWyq / VBA

Fsoft-AIC / UGLF

SCUT-BIP-Lab / 3DTDS-Net

carriex / C3D

SCUT-BIP-Lab / FSTA-Net

XFeiF / ComputerVision_PaperNotes

SCUT-BIP-Lab / L3AM

engindeniz / DialogSummary-VideoQA

mx-mark / SPMNet

jinxiang-liu / UFE-AVS

ZJCV / Non-local

SCUT-BIP-Lab / PB-Net

jeremy-collins / vroom

Improve this page

Add this topic to your repo