GitHub - chiral-carbon/ImageCaptioning: Image captioning with visual attention.

Implementing image captioning with and without soft attention model on the Flickr8k dataset.

This is my implementation of the Show, Attend and Tell paper.

Taken assistance from the blogpost: https://machinelearningmastery.com/develop-a-deep-learning-caption-generation-model-in-python/

You can see my implementation at this Kaggle kernel. The attention model was not successfully implemented, which is why I trained my model without it.

The highest BLEU scores after 20 epochs were:

BLEU-1: 53.0076%

BLEU-2: 28.6551%

BLEU-3: 19.7607%

BLEU-4: 9.4241%

This is the first implementation and will be optimized further.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
attention.py		attention.py
attention_decoder.py		attention_decoder.py
browse.py		browse.py
descriptions.txt		descriptions.txt
img_to_feature.py		img_to_feature.py
model_eval.py		model_eval.py
model_ex.py		model_ex.py
model_predict.py		model_predict.py
model_train_progressive.py		model_train_progressive.py
prepare_text.py		prepare_text.py
time_distributed.py		time_distributed.py
tokenizer.pkl		tokenizer.pkl
tokenizer.py		tokenizer.py
vocabulary.txt		vocabulary.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

attention.py

attention.py

attention_decoder.py

attention_decoder.py

browse.py

browse.py

descriptions.txt

descriptions.txt

img_to_feature.py

img_to_feature.py

model_eval.py

model_eval.py

model_ex.py

model_ex.py

model_predict.py

model_predict.py

model_train_progressive.py

model_train_progressive.py

prepare_text.py

prepare_text.py

time_distributed.py

time_distributed.py

tokenizer.pkl

tokenizer.pkl

tokenizer.py

tokenizer.py

vocabulary.txt

vocabulary.txt

Repository files navigation

About

Releases

Packages

Languages

chiral-carbon/ImageCaptioning

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages