Code for ALBEF: a new vision-language pre-training method
-
Updated
Sep 20, 2022 - Python
Code for ALBEF: a new vision-language pre-training method
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Data release for the ImageInWords (IIW) paper.
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
The largest multilingual image-text classification dataset. It contains fashion products.
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment
Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.
A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
WWDC22: Enabling Live Text interactions with images in SwiftUI
caption generator using lavis and argostranslate
Contrastive Learning Representations for Images and Text Pairs. Colab implementation of ConVIRT for transfer learning with insufficient data volume.
Download flickr8k, flickr30k image caption datasets
Write texts on images with php
The first public Vietnamese visual linguistic foundation model(s)
FCLL: A Fine-grained Contrastive Language-Image Learning Model
Some Python scripts to load Vietnamese visual linguistic data
Add a description, image, and links to the image-text topic page so that developers can more easily learn about it.
To associate your repository with the image-text topic, visit your repo's landing page and select "manage topics."