cross-modal

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

nlp machine-learning embeddings image-classification cross-modal audio-classification video-tagging

Updated Feb 9, 2024
Jupyter Notebook

Superdev0909 / discoart-main

Star

Create Disco Diffusion artworks in one line

generative-art cross-modal diffusion prompts creative-ai creative-art multimodal clip-guided-diffusion disco-diffusion dalle2 midjourney imgen discodiffusgsion

Updated Feb 4, 2024
Python

krantiparida / awesome-audio-visual

Star

A curated list of different papers and datasets in various areas of audio-visual processing

awesome localization awesome-list cross-modal source-separation audio-visual mutli-modal

Updated Jan 30, 2024

Eaphan / UPIDet

Star

Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]

cross-modal multi-modal 3d-object-detection

Updated Jan 20, 2024
Python

nataliakoliou / Music-Visualization-Network

Star

cDCGAN model for audio-to-image generation: a cross-modal analysis using deep-learning techniques

deep-learning pytorch generative-adversarial-network image-generation cross-modal audio-encoder cdcgan audio-to-image music-visualization

Updated Jan 10, 2024
Python

YangLiu9208 / DIVAFN

Star

[IEEE T-IP 2020] Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition

deep-learning cross-modal action-recognition video-understanding domain-adaptation

Updated Dec 19, 2023

YangLiu9208 / SAKDN

Star

[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition

transfer-learning cross-modal knowledge-distillation action-recognition video-understanding domain-adaptation

Updated Dec 19, 2023
Python

NeuRoNeLab / RS-DatasetsHub

Star

A hub hosting essential remote sensing datasets.

deep-learning remote-sensing classification satellite-imagery datasets cross-modal satellite-data captioning-images zero-shot-classification

Updated Dec 7, 2023

mesnico / ALADIN

Star

Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"

natural-language-processing computer-vision deep-learning pytorch cross-modal cross-modal-retrieval language-and-vision

Updated Dec 6, 2023
Python

qcraftai / distill-bev

Star

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

point-cloud lidar cross-modal autonomous-driving multi-modal knowledge-distillation self-driving bev distillation multi-camera 3d-object-detection nuscenes

Updated Nov 24, 2023
Python

marslanm / Multimodality-Representation-Learning

Star

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl.acm.org/doi/abs/10.1145/3617833 .

cross-modal multimodal-deep-learning multimodal-datasets transformer-models multimodal-pre-trained-model vision-language-pretraining multimodal-applications multimodal-pretext

Updated Oct 19, 2023

ovshake / cobra

Star

Code for COBRA: Contrastive Bi-Modal Representation Algorithm (https://arxiv.org/abs/2005.03687)

machine-learning pytorch representation-learning cross-modal contrastive-learning

Updated Jul 6, 2023
Python

Improve this page

Add a description, image, and links to the cross-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cross-modal topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cross-modal

Here are 48 public repositories matching this topic...

QizhiPei / BioT5

haihuangcode / CMG

docarray / docarray

yisun98 / SOLC

Paranioar / UniPT

sarahESL / AlignCLIP

roboflow / multimodal-maestro

zjukg / DUET

towhee-io / examples

Superdev0909 / discoart-main

krantiparida / awesome-audio-visual

Eaphan / UPIDet

nataliakoliou / Music-Visualization-Network

YangLiu9208 / DIVAFN

YangLiu9208 / SAKDN

NeuRoNeLab / RS-DatasetsHub

mesnico / ALADIN

qcraftai / distill-bev

marslanm / Multimodality-Representation-Learning

ovshake / cobra

Improve this page

Add this topic to your repo