Image Caption Generator

Image caption generator is a model which generates a caption that describes the contents of the image. It requires a model from computer vision to understand the content of the image and a language model from NLP to translate the understanding of the image into words.

Deep Learning Models have provided an excellent way to get results for examples of this problem.

Dataset:

FLICKR_8K. This dataset includes around images along with 5 different captions written by different people for each image.

The Block diagram of model used in the project

Flow of the Project

1. Cleaning the Captions

2. Extracting Features of image

3. Preprocessing of Image and Text Data

4. Training on LSTM Model

5. Predicting Captions and Evaluating performance

VGG Model Summary

Pre trained VGG16 Model has been used to extract the features of the image.

LSTM Model Summary

Predictions by the model

Evaluation using BLEU score

Good Captions

Bad Captions

Conclusion

The Model has successfully generated captions for images. The performance of the model can be further improved by hyperparameter training.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Images		Images
Saved_models		Saved_models
Caption_Images.ipynb		Caption_Images.ipynb
Flickr8k.token.txt		Flickr8k.token.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Images

Images

Saved_models

Saved_models

Caption_Images.ipynb

Caption_Images.ipynb

Flickr8k.token.txt

Flickr8k.token.txt

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Image Caption Generator

Dataset:

The Block diagram of model used in the project

Flow of the Project

1. Cleaning the Captions

2. Extracting Features of image

3. Preprocessing of Image and Text Data

4. Training on LSTM Model

5. Predicting Captions and Evaluating performance

VGG Model Summary

LSTM Model Summary

Predictions by the model

Evaluation using BLEU score

Good Captions

Bad Captions

Conclusion

About

Releases

Packages

Languages

License

AniketDogra/Caption-Generator

Folders and files

Latest commit

History

Repository files navigation

Image Caption Generator

Dataset:

The Block diagram of model used in the project

Flow of the Project

1. Cleaning the Captions

2. Extracting Features of image

3. Preprocessing of Image and Text Data

4. Training on LSTM Model

5. Predicting Captions and Evaluating performance

VGG Model Summary

LSTM Model Summary

Predictions by the model

Evaluation using BLEU score

Good Captions

Bad Captions

Conclusion

About

Topics

Resources

License

Stars

Watchers

Forks

Languages