Image Caption Generator

Project Information

The objective of the project is to predict the captions for the input image. The dataset consists of 8k images and 5 captions for each image. The features are extracted from both the image and the text captions for input. The features will be concatenated to predict the next word of the caption. CNN is used for image and LSTM is used for text. BLEU Score is used as a metric to evaluate the performance of the trained model.

Environment: Google Colab

Libraries

numpy
matplotlib
keras
tensorflow
nltk

Neural Network

VGG16 Network
CNN-LSTM Network

Dataset

https://www.kaggle.com/datasets/adityajn105/flickr8k?resource=download

Accuracy

BLEU-1 Score: 0.516

BLEU-2 Score: 0.293

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Image Caption		Image Caption
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Caption Generator

Project Information

Libraries

Neural Network

Dataset

Accuracy

About

Releases

Packages

Languages

kausik-t/Image-Caption-Generator

Folders and files

Latest commit

History

Repository files navigation

Image Caption Generator

Project Information

Libraries

Neural Network

Dataset

Accuracy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages