Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 807 Bytes

README.md

File metadata and controls

11 lines (6 loc) · 807 Bytes

Prediction of multiword expressions using eye tracking data

This repository contains the source code, data, and analyses behind the paper Using Gaze Data to Predict Multiword Expressions.

Data

We have annotated the text used in GECO eye tracking corpus with regards to occurrences of Verb+Noun and Verb+Particle multiword expressions (MWEs). For details about the data, please refer to the information file in the data folder.

Methodology

We used the structured prediction model CRF to label the sequences according to the classes defined in the tagged dataset.