CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
-
Updated
Dec 17, 2023 - Python
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
Deep learning-based image captioning with Flickr8k dataset. Code includes data prep, model training, and a Streamlit app.
📷 Deployed image captioning ML model using Flask and access via Flutter app
Image Caption Generator implemented using Tensorflow and Keras in a Python Jupyter Notebook. The goal is to describe the content of an image by using a CNN and RNN.
Image captioning using beam search heuristic on top of the encoder-decoder based architecture
pre-trained model and source code for generate description of images.
Automatically generates captions for an image using Image processing and NLP. Model was trained on Flickr30K dataset.
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features (attributes, style, functionality etc.) of the items and increase online sales by enticing more customers.
This Streamlit app is designed for image captioning and tagging using the Google Gemini AI
Inspired from the paper "Show Attend and Tell". This project's aim was to train a neural network which can provide descriptive text for a given image.
This is a Deep Learning model which uses Computer Vision and NLP to generate captions for images.
COL774 Machine Learning Spring-2019-2020 IIT Delhi. Instructor - Prof. Parag Singla
An Image captioning web application combines the power of React.js for front-end, Flask and Node.js for back-end, utilizing the MERN stack. Users can upload images and instantly receive automatic captions. Authenticated users have access to extra features like translating captions and text-to-speech functionality.
Image captioning model with Resnet50 encoder and LSTM decoder
🚀 Image Caption Generator Project 🚀 🧠 Building Customized LSTM Neural Network Encoder model with Dropout, Dense, RepeatVector, and Bidirectional LSTM layers. Sequence feature layers with Embedding, Dropout, and Bidirectional LSTM layers. Attention mechanism using Dot product, Softmax attention scores,...
Simple image caption generator built upon Xception net using CNN and LSTM.
Fabricating a Python application that generates a caption for a selected image. Involves the use of Deep Learning and NLP Frameworks in Tensorflow, Keras and NLTK modules for data processing and creation of deep learning models and their evaluation.
Add a description, image, and links to the image-caption-generator topic page so that developers can more easily learn about it.
To associate your repository with the image-caption-generator topic, visit your repo's landing page and select "manage topics."