Skip to content

TanyaChutani/Image-Captioning-Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

ImageCaptionGenerator

Image Captioning Generator Keras

Data

Dataset - Flickr 8k Dataset

Flicker8k_Dataset
Flickr8k_text

Flicker8k_Dataset - Contains 8092 images in jpeg format.
Flickr8k_text - Each image contains 5 description.

Model

Built and trained a deep learning model for captioning real world image.

  • Used pre-trained InceptionV3 to extract feature from image.
  • Used pre-trained fasttext embedding, these were feed into a Stacked Bi-directional GRU layer.
  • They both were combined and predicted the next word till the end of caption using greedy search (during testing).

Result

Weights

To Do

  • Add attention
  • Use beam search instead of greddy seaech