multimodal-deep-learning

Star

Here are 341 public repositories matching this topic...

GeorgeTouros / video-soundtrack-evaluation

Star

Create a large, well-managed and clean data-set for the task of music composition for video soundtracks.

video deep-learning music-composition soundtrack dataset-generation multimodal-deep-learning multimodal-datasets

Updated Jul 6, 2023
Jupyter Notebook

SRM-IST-KTR / disturbance-detection-in-messaging-apps-using-machine-learning-e5d7h9m7

Star

A Fully Deployable React-Native mobile app that seeks to classify incoming messages in messaging apps into important or disturbing categories. using a Multi-Modal Machine Learning Architecture to achieve Text classification, Image classification and YouTube Video Link classification.

graphql react-native aws-amplify multimodal-deep-learning

Updated May 10, 2022
Jupyter Notebook

marialymperaiou / knowledge-enhanced-multimodal-learning

Star

A list of research papers on knowledge-enhanced multimodal learning

knowledge-graph multi-task-learning visual-reasoning visual-dialog visual-question-answering vision-and-language multimodal-deep-learning visual-storytelling multimodal-retrieval visual-grounding visual-commonsense-reasoning vision-and-language-navigation story-visualization image-text-matching vision-language-transformer image-text-retrieval vision-and-language-pre-training conditional-image-generation knowledge-enhanced-multimodal-learning knowledge-enhanced-vision-language

Updated Dec 8, 2022

abs711 / The-way-of-the-future

Star

A dataset of egocentric vision, eye-tracking and full body kinematics from human locomotion in out-of-the-lab environments. Also, different use cases of the dataset along with example code.

machine-learning data-visualization eye-tracking data-analysis motion-capture human-pose-estimation multimodal-deep-learning egocentric-vision

Updated Nov 5, 2023
Python

ishitab1310 / HateFilter

Star

Analyzing Hateful Memes/ (Resources:- Hateful Memes Challenge)

multimodal-deep-learning hateful-memes-challenge

Updated Feb 18, 2024
Jupyter Notebook

kyegomez / MMCA-MGQA

Sponsor

Star

Experiments around using Multi-Modal Casual Attention with Multi-Grouped Query Attention

artificial-intelligence attention attention-mechanism multimodality attention-is-all-you-need multimodal multimodal-deep-learning gpt4

Updated Mar 11, 2024
Python

l-yohai / Look-Attend-and-Generate-Poem

Star

AI Poet who looks at the images and writes poems Web service.

poem poem-generator imagecaptioning multimodal-deep-learning

Updated Dec 28, 2021
Python

shbz80 / fb_marketplace_reco

Star

Facebook Marketplace is a platform for buying and selling products on Facebook. This project involves training a multimodal deep neural network model that predicts the category of a product based on its image and text description.

deep-learning text-classification image-classification language-model multimodal-deep-learning

Updated Jul 27, 2022
Jupyter Notebook

PrachiJainxD / AmbientAI_IMU2CLIP

Star

COMPSCI 696DS Industry Mentorship Program with Meta Reality Labs: Ambient AI: Multimodal Wearable Sensor Understanding (Experiments in Distilling Knowledge in Cross-Modal Contrastive Learning.)

imu knowledge-distillation contrastive-loss multimodal-deep-learning contrastive-learning