A list of awesome remote sensing image captioning resources
-
Updated
May 16, 2024 - Python
A list of awesome remote sensing image captioning resources
Pheye - a family of efficient small vision-language models
Automatically generate Alt Text for images and other objects in Powerpoint presentations using MLLM/VLM
Data release for the ImageInWords (IIW) paper.
Deployed image captioning ML model using Flask and access via Flutter app
a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。
A tool to streamline AI image captioning
Python scripts to use for captioning images with VLMs
Testing the Moondream tiny vision model
Tag manager and captioner for image datasets
[ICPR 2024] The official repo for FIDAVL: Fake Image Detection and Attribution using Vision-Language Model
A PyTorch Based Deep Learning Quick Develop Framework. One-Stop for train/predict/server/demo
Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"
Light-ExpansionNet: Enhancing Cost-Efficient Image Captioning through ExpansionNet v2 Optimization.
Image Captioning With MobileNet-LLaMA 3
Deep learning for natural language processing
An integrated web app that captions image and created with ReactJs and Python, with Pytorch
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Pytorch Image Captioning model using a CNN-RNN architecture
Add a description, image, and links to the image-captioning topic page so that developers can more easily learn about it.
To associate your repository with the image-captioning topic, visit your repo's landing page and select "manage topics."