PyTorch implementation for Self-supervised Modal and View Invariant Feature Learning
-
Updated
Jul 5, 2020
PyTorch implementation for Self-supervised Modal and View Invariant Feature Learning
Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval" (ACM TOMM 2024).
This repository contains the code for the paper "Extending CLIP for Category-to-image Retrieval in E-commerce" published at ECIR 2022.
VNEL(Visual Named Entity Linking) is a brand-new task that accepts the pure image and processes entity linking on it, which focus on CBIR, Cross-modal retrieve, and Multimodal fusion.
Implementation of "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives" in Tensorflow.
This repository contains the code for the paper "Object-centric vs. Scene-centric Image-Text Cross-modal Retrieval: A Reproducibility Study" published at ECIR 2023.
Joint Versus Independent Multiview Hashing for Cross-View Retrieval[J] (IEEE TCYB 2021, PyTorch Code)
[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”
An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.
PyTorch code for the paper "Complementarity is the king: A multi-modal and multi-grained hierarchical semantic enhancement network for cross-modal retrieval"
The Unified Code of Image-Text Retrieval for Further Exploration.
Deep Semisupervised Cross-modal Retrieval/Cross-view Recognition (IEEE TCYB 2022, PyTorch Code)
Code for the paper "Sentiment-Oriented Metric Learning for Text-to-Image Retrieval", ECIR'21
My master thesis: Siamese multi-hop attention for cross-modal retrieval.
The code for the paper "GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval" (AAAI'24)
ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
[CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval
An attempt to transfer sentence to image style.
[ICASSP 2022] EEG - Music Cross Modal Learning
PyTorch code for cross-modal-retrieval on Flickr8k/30k using Bert and EfficientNet
Add a description, image, and links to the cross-modal-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the cross-modal-retrieval topic, visit your repo's landing page and select "manage topics."