Multimodal Machine Translation

Implementation of MSc Thesis "Image Informed Neural Machine Translation with Transformers". The Transformer takes as input text and image features extracted from a ResNet-50. The provided code is for the first case of the input image features mentioned in Image_Informed_Neural_Machine_Translation.pdf

You can find Multi-30K dataset here: http://www.statmt.org/wmt16/multimodal-task.html#task1

Training

python train_mm.py -data /path/to/text/data -train_image_feat /path/to/train/image/features -val_image_feat /path/to/validation/image/features

Translate

python translate_mm.py -model /path/to/model/chkpt -src /path/to/source/sentences -vocab /path/to/source/vocabulary -test_image_feat /path/to/test/image/features

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
transformer_mmt		transformer_mmt
.gitignore		.gitignore
Image_Informed_Neural_Machine_Translation.pdf		Image_Informed_Neural_Machine_Translation.pdf
README.md		README.md
dataset_mm.py		dataset_mm.py
preprocess_mm.py		preprocess_mm.py
train_mm.py		train_mm.py
translate_mm.py		translate_mm.py
visualize_attention_weights.py		visualize_attention_weights.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Machine Translation

Training

Translate

About

Releases

Packages

Languages

koninik/multimodal_machine_translation

Folders and files

Latest commit

History

Repository files navigation

Multimodal Machine Translation

Training

Translate

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages