Skip to content

ravikanagpal/Gender_Bias_Coref_CMPUT_622

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Measuring Gender Bias in Coreference Resolution Systems

Team

Student name CCID
Ravika Nagpal ravika
Natalie Hervieux nhervieu

Execution Instructions

Run

Use the google colab notebooks provided in src folder to run the two models:

  1. E2E model - provided by Allen NLP and based on the work of Lee, Kenton & He, Luheng & Lewis, Mike & Zettlemoyer, Luke. 2017. End-to-end Neural Coreference Resolution.

  2. Neural Coref model - provided by Huggingface and based on the work of Kevin Clark and Christopher D. Manning. 2016. Deep reinforcement learning for mention-ranking coreference models. In Empirical Methods on Natural Language Processing.

Data

The data used can be found in data folder. The csv files are taken from the github repo of the paper Levy, Shahar & Lazar, Koren & Stanovsky, abriel. (2021). Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation.

Apart from the csv files, the data folder contains folder gold_stats which contain files showing our analysis on the BUG dataset which is also reported in the report.

Output

The output of the run done on gold_BUG dataset is stored in csv files corresponding to each model and can be found in the output directory.


Releases

No releases published

Packages

No packages published