Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
-
Updated
May 26, 2024 - Python
Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval" (ACM TOMM 2024).
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”
[CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval
The Unified Code of Image-Text Retrieval for Further Exploration.
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
[IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
An AI-powered interactive video retrieval system
The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
This repository contains the code for the paper "Extending CLIP for Category-to-image Retrieval in E-commerce" published at ECIR 2022.
Add a description, image, and links to the cross-modal-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the cross-modal-retrieval topic, visit your repo's landing page and select "manage topics."