Overview

In the project, we implemented a Neural Network for Image Captioning. In this model, we give an image to the network and generate sentences that describe the image. This model implements a CNN for identifying high-level image features and an LSTM for building sentences. We use the Flickr8k dataset to train our image captioning network. It includes 8091 RGB images and 40455 sentences, five sentences for each image gathered from different persons.

Dataset

Flickr8k dataset: https://www.kaggle.com/adityajn105/flickr8k

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
HW3_Q1_DNN (1).ipynb		HW3_Q1_DNN (1).ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Dataset

About

Releases

Packages

Languages

erfunmirzaei/RCNN-ImageCaptioning

Folders and files

Latest commit

History

Repository files navigation

Overview

Dataset

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages