Data for the DiMSUM shared task at SEMEVAL 2016
-
Updated
Feb 8, 2016 - Python
Data for the DiMSUM shared task at SEMEVAL 2016
Foma-based multi-word tagger and morphological analyzer
Comparison between various noun compound embeddings
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
A set of useful tools for use with multiword expression extraction from parallel corpora for Moses statistical machine translation system
Python implementation of Substitution-driven Measures of Association
Learning English expressions has never been so easy
Code for NAACL 2019 paper: "Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions"
Java implementation of substitution driven measures of association that can be used to identify MWEs.
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
Data and code for the paper "ID10M: Idiom Identification in 10 Languages" (NAACL 2022).
Data and code for the paper "NER4ID at SemEval-2022 Task 2: Named Entity Recognition for Idiomaticity Detection".
Repo for the paper "MWE as WSD: Solving Multi-Word Expression Identification with Word Sense Disambiguation"
A Python package for Exploratory Data Analysis (EDA) for text-based data.
Rigor-Mortis is an online GWAP where players have to find multiword expressions in French sentences
Adjacent code related to the paper prepared for Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), 25th May, 2024.
Add a description, image, and links to the multiword-expressions topic page so that developers can more easily learn about it.
To associate your repository with the multiword-expressions topic, visit your repo's landing page and select "manage topics."