Word alignment methods to extract bi/multi -lingual lexica
-
Updated
Jul 5, 2019 - Python
Word alignment methods to extract bi/multi -lingual lexica
Leveraging Almost Black-Box NMT for Word Alignment
A pipeline for POS tagging, sentence alignment, word alignment, and transliteration of texts in 30+ languages.
This project provide an API to perform word alignment
Word-alignment models for Bible translations in 100+ historical and contemporary languages
This is simple replica of IBM Model-1. It is trained to find word-alignments between two Indo-European languages - English and Hindi
A pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic processing with the Stanza pipeline, machine translation and word alignment with the Eflomal tool.
Using alignments and posteriorgrams extracted from lyrics as novel input into source separation models
Demonstration of AI/neural word alignment of English & Japanese text using mBERT-based machine learning models.
Assignment 1: Word Alignment in 'Statistical Machine Translation' course by Dr. Roee Aharoni at Bar-Ilan University.
Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization (ACL 2019)
Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries (ACL 2020)
WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction, to appear at ACL 2023 main conference.
Inference library and evaluation script for WSPAlign (https://github.com/qiyuw/WSPAlign)
Word Alignment Visualization is a Python package for visualizing word alignments between two sentences in a Jupyter notebook. The package provides an interactive widget that displays original and translated sentences with word alignment lines.
Create "pretty" graphs for aligned sentences
Enhanced awesome-align for low-resource languages and noise simulation: https://arxiv.org/abs/2301.09685
Java application for creating bilingual word alignments
A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.
Add a description, image, and links to the word-alignment topic page so that developers can more easily learn about it.
To associate your repository with the word-alignment topic, visit your repo's landing page and select "manage topics."