multimodal-learning

This repository contains an official PyTorch implementation of Position-aware Location Regression Network (PLRN) for temporal video grounding, which is presented in the paper Position-aware Location Regression Network for Temporal Video Grounding.

attention-mechanism multimodal-learning video-grounding

Updated Apr 16, 2022
Python

abhinav-neil / socratic-models

Star

Socratic models for multimodal reasoning & image captioning

image-captioning clip multimodal-learning visual-question-answering gpt-3 chain-of-thought flan-t5 vision-language-learning

Updated Jun 4, 2023
Jupyter Notebook

anaezquerro / imx-evqa

Star

Interactive Multimodal Explanations for Easy Visual Question Answering

natural-language-processing computer-vision multimodal-learning explainable-ai

Updated Mar 13, 2024
Jupyter Notebook

shantistewart / Emo-CLIM

Star

Emo-CLIM: Emotion-Aligned Contrastive Learning Between Images and Music [ICASSP 2024]

music-information-retrieval multimodal-learning contrastive-learning

Updated Jan 15, 2024
Python

DFKI-Earth-And-Space-Applications / MVCC_IGARSS

Star

Public repository of our IGARSS 2023 submission

remote-sensing agriculture-research data-fusion multimodal-learning multiview-learning multi-view-learning crop-classification multi-modal-learning datafusion multisensor-fusion croptypes crop-type-mapping

Updated Jul 27, 2023
Python

talipucar / talipucar.github.io_old

Star

Showcases ongoing, and completed projects within various research themes.

domain-adaptation self-supervised multimodal-learning multimodal-deep-learning self-supervised-learning domain-translation

Updated Dec 28, 2022

stoneMo / OneAVM

Star

Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)

multimodal-learning self-supervised-learning sound-source-localization audio-visual-correspondence audio-visual-learning sound-source-separation

Updated Jun 1, 2023

willxxy / awesome-mmps

Star

Corpus of resources for multimodal machine learning with physiological signals

machine-learning deep-learning signal-processing physiological-signals multimodal-learning multimodal multimodal-deep-learning multimodal-data

Updated May 20, 2024

minjoong507 / MPGN

Star

[EMNLP 2022] Pytorch code for "Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval"

multimodal-learning video-retrieval video-grounding

Updated Jan 15, 2024
Python

abhinav-neil / multimodal-dl-biomarkers

Star

A multimodal deep learning framework for prediction of cancer biomarkers

deep-learning medical-imaging feature-extraction biomarkers multimodal-learning

Updated Nov 5, 2023
Python

koushikvikram / multimodal-image-retrieval

Star

📝🔍🖼️ A deep learning application for retrieving images by searching with text.

Updated Dec 14, 2021
Jupyter Notebook

waybarrios / guidance-based-video-grounding

Star

[ICCV 2023] The official PyTorch implementation of the paper: "Localizing Moments in Long Video Via Multimodal Guidance"

pytorch multimodal-learning accepted-papers moment-retrieval video-language iccv2023

Updated Jul 19, 2023

jtonglet / Numerical-Hybrid-QA-Literature

Star

A list of Numerical Multimodal reasoning papers and their implementation

question-answering hybrid multimodal-learning math-word-problem numerical-reasoning

Updated May 13, 2024

Jiamian-Wang / T-MASS-text-video-retrieval

Star

Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"

retrieval stochastic multimodal-learning multimodal text-video-retrieval

Updated Apr 5, 2024
Python

Improve this page

Add a description, image, and links to the multimodal-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-learning

Here are 234 public repositories matching this topic...

prasenjit52282 / BuStop

david-alvarez-rosa / referring-expression-comprehension

stoneMo / MGN

YingWANGG / M2IB

Lukeasargen / Show-Attend-and-Tell-Pytorch-Lightning

yookyungkho / Multimodal-Entailment-pytorch

sunoh-kim / PLRN

abhinav-neil / socratic-models

anaezquerro / imx-evqa

shantistewart / Emo-CLIM

DFKI-Earth-And-Space-Applications / MVCC_IGARSS

talipucar / talipucar.github.io_old

stoneMo / OneAVM

willxxy / awesome-mmps

minjoong507 / MPGN

abhinav-neil / multimodal-dl-biomarkers

koushikvikram / multimodal-image-retrieval

waybarrios / guidance-based-video-grounding

jtonglet / Numerical-Hybrid-QA-Literature

Jiamian-Wang / T-MASS-text-video-retrieval

Improve this page

Add this topic to your repo