In this project, I design and train a CNN-RNN (Convolutional Neural Network - Recurrent Neural Network) model for automatically generating image captions. The network is trained on the Microsoft Common Objects in COntext (MS COCO) dataset. The image captioning model is displayed below.
Here are some predictions from my model.
There are 500 predictions in the samples folder.
The Udacity repository for this project: CVND---Image-Captioning-Project