image-captioning

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

image-captioning nodes vlm custom-nodes img2text llm mllm llava comfyui siglip phi15 joytag img2sfx

Updated May 10, 2024
Python

Mamadou-Keita / FIDAVL

Star

[ICPR 2024] The official repo for FIDAVL: Fake Image Detection and Attribution using Vision-Language Model

image-captioning gans image-forensics deepfake diffusion-models soft-prompt-tuning large-language-model vision-language-model vision-question-answering synthetic-image-attribution

Updated May 10, 2024

cstsunfu / dlk

Star

A PyTorch Based Deep Learning Quick Develop Framework. One-Stop for train/predict/server/demo

demo lightning deep-learning pytorch image-classification image-captioning summary grid-search ner relation-extraction nli adversarial-training streamlit

Updated May 9, 2024
Python

jchenghu / ExpansionNet_v2

Star

Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"

computer-vision deep-learning image-captioning

Updated May 7, 2024
Python

va-kiet / Light-ExpansionNet

Star

Light-ExpansionNet: Enhancing Cost-Efficient Image Captioning through ExpansionNet v2 Optimization.

python natural-language-processing computer-vision neural-network artificial-intelligence image-captioning generative-ai

Updated May 7, 2024
Python

reshalfahsi / image-captioning-mobilenet-llama3

Star

Image Captioning With MobileNet-LLaMA 3

nlp cnn pytorch transformer image-captioning image-text flickr8k-dataset mobilenetv3 pytorch-lightning kv-cache rotary-position-embedding grouped-query-attention rms-norm llama3

Updated May 5, 2024
Jupyter Notebook

ejaj / nlp-deep

Star

Deep learning for natural language processing

python deep-learning text-classification machine-translation word-embeddings bag-of-words image-captioning data-preprocessing language-model keras-tensorflow

Updated Apr 29, 2024
Python

darkmatter18 / Caption-AI

Sponsor

Star

An integrated web app that captions image and created with ReactJs and Python, with Pytorch

python reactjs cnn pytorch lstm image-captioning microsoft-coco

Updated May 10, 2024
Jupyter Notebook

OFA-Sys / OFA

Star

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

prompt chinese image-captioning pretrained-models visual-question-answering multimodal text-to-image-synthesis vision-language pretraining referring-expression-comprehension prompt-tuning

Updated Apr 24, 2024
Python

emmareysanchez / ImageCaptioning

Star

Pytorch Image Captioning model using a CNN-RNN architecture

deep-learning pytorch image-captioning beam-search encoder-decoder-model

Updated Apr 22, 2024
Python

Improve this page

Add a description, image, and links to the image-captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image-captioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image-captioning

Here are 775 public repositories matching this topic...

iOPENCap / awesome-remote-image-captioning

miguelscarv / pheye

waltervanheuven / auto-alt-text

google / imageinwords

german-zarate / image-captioning-app

HanXinzi-AI / awesome-computer-vision-resources

ArchAngelAries / TagScribeR

ProGamerGov / VLM-Captioning-Tools

antonio-f / Moondream

jhc13 / taggui

gokayfem / ComfyUI_VLM_nodes

Mamadou-Keita / FIDAVL

cstsunfu / dlk

jchenghu / ExpansionNet_v2

va-kiet / Light-ExpansionNet

reshalfahsi / image-captioning-mobilenet-llama3

ejaj / nlp-deep

darkmatter18 / Caption-AI

OFA-Sys / OFA

emmareysanchez / ImageCaptioning

Improve this page

Add this topic to your repo