Skip to content

Latest commit

 

History

History
33 lines (29 loc) · 3.38 KB

README.md

File metadata and controls

33 lines (29 loc) · 3.38 KB

Meme Caption Generation

This is the implementation of our course project in CSCI-376 Natural Language Processing taught by Prof. Yik-Cheung Tam at NYU Shanghai. Yuchen Wang (yw3642@nyu.edu) and Yichen Huang (yh2689@nyu.edu), May 2021.

Paper Report

report(pdf)
presentation slides

Pipeline Architecture

pipeline architecture

Example Output

sample model outputs

Data

Code

Data and Preprocessing

Main Pipeline

Both of these notebooks contain everything from data loading to evaluation and can be run independently. Note that we do not include the BERT splitter in the main pipeline.

  • Baseline: The baseline pipeline using a standard encoder-decoder.
  • Proposed: The proposed pipeline with naive / MMI / CLIP score decoding.

Pipeline Components

Model Weights