Skip to content

toby-p/nlp-bert-predicting-helpfulness

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

w266-final-project

Final project for UCB MIDS W266 NLP class

Report

The full write up for our experiments and learning is published here RoBERTa and Transfer Learning to Predict Review Helpfulness

Data

The data sets used in this project can be found in the /data folder. We use review data sourced from Amazon and from Yelp. /data /amazon /yelp

EDA

Initial EDA and creation of the Yelp data sets can be found in Yelp_Data_EDA.ipynb

Processing of the Amazon data can be found in Amazon-data-processing-LARGE.ipynb and Amazon-data-processing-SMALL.ipynb

Baseline models

Recreation of the baseline model form Bilal et. al. is in Bilal_et_al_Baseline.ipynb

In Bilal_et_al_baseline_on_Yelp_data.ipynb we fine-tune the baseline model using the Yelp data set, to create a new baseline.

Model Training

Fine-tuning of the RoBERTa model can be found in RoBERTa.ipynb

Model training using transfer learning techniques can be found in transfer_learning.py, train_bilal_baseline.py and train_amazon.py, train_amazon_large.py

The evalution of transfer learning can be found in Evaluating Transfer Learning Models.ipynb

Model checkpoints

Saved models can be found in /results, not that not all models are stored here due to size contraints.

About

Final project for UCB MIDS W266 NLP class

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published