Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
download_prepare_dataset.py		download_prepare_dataset.py
gpt_download.py		gpt_download.py
previous_chapters.py		previous_chapters.py
requirements-extra.txt		requirements-extra.txt
sklearn-baseline.ipynb		sklearn-baseline.ipynb
train_bert_hf.py		train_bert_hf.py
train_bert_hf_spam.py		train_bert_hf_spam.py
train_gpt.py		train_gpt.py
train_sklearn_logreg.py		train_sklearn_logreg.py

README.md

Additional Experiments Classifying the Sentiment of 50k IMDB Movie Reviews

Step 1: Install Dependencies

Install the extra dependencies via

pip install -r requirements-extra.txt

Step 2: Download Dataset

The codes are using the 50k movie reviews from IMDb (dataset source) to predict whether a movie review is positive or negative.

Run the following code to create the train.csv, validation.csv, and test.csv datasets:

python download_prepare_dataset.py

Step 3: Run Models

The 124M GPT-2 model used in the main chapter, starting with pretrained weights, and finetuning all weights:

python train_gpt.py --trainable_layers "all" --num_epochs 1

Ep 1 (Step 000000): Train loss 3.706, Val loss 3.853
Ep 1 (Step 000050): Train loss 0.682, Val loss 0.706
...
Ep 1 (Step 004300): Train loss 0.199, Val loss 0.285
Ep 1 (Step 004350): Train loss 0.188, Val loss 0.208
Training accuracy: 95.62% | Validation accuracy: 95.00%
Training completed in 9.48 minutes.

Evaluating on the full datasets ...

Training accuracy: 95.64%
Validation accuracy: 92.32%
Test accuracy: 91.88%

A 340M parameter encoder-style BERT model:

!python train_bert_hf --trainable_layers "all" --num_epochs 1 --model "bert"

Ep 1 (Step 000000): Train loss 0.848, Val loss 0.775
Ep 1 (Step 000050): Train loss 0.655, Val loss 0.682
...
Ep 1 (Step 004300): Train loss 0.146, Val loss 0.318
Ep 1 (Step 004350): Train loss 0.204, Val loss 0.217
Training accuracy: 92.50% | Validation accuracy: 88.75%
Training completed in 7.65 minutes.

Evaluating on the full datasets ...

Training accuracy: 94.35%
Validation accuracy: 90.74%
Test accuracy: 90.89%

A 66M parameter encoder-style DistilBERT model (distilled down from a 340M parameter BERT model), starting for the pretrained weights and only training the last transformer block plus output layers:

!python train_bert_hf.py --trainable_layers "all" --num_epochs 1 --model "distilbert"

Ep 1 (Step 000000): Train loss 0.693, Val loss 0.688
Ep 1 (Step 000050): Train loss 0.452, Val loss 0.460
...
Ep 1 (Step 004300): Train loss 0.179, Val loss 0.272
Ep 1 (Step 004350): Train loss 0.199, Val loss 0.182
Training accuracy: 95.62% | Validation accuracy: 91.25%
Training completed in 4.26 minutes.

Evaluating on the full datasets ...

Training accuracy: 95.30%
Validation accuracy: 91.12%
Test accuracy: 91.40%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

03_bonus_imdb-classification

03_bonus_imdb-classification

README.md

Additional Experiments Classifying the Sentiment of 50k IMDB Movie Reviews

Step 1: Install Dependencies

Step 2: Download Dataset

Step 3: Run Models

Files

03_bonus_imdb-classification

Directory actions

More options

Directory actions

More options

Latest commit

History

03_bonus_imdb-classification

Folders and files

parent directory

README.md

Additional Experiments Classifying the Sentiment of 50k IMDB Movie Reviews

Step 1: Install Dependencies

Step 2: Download Dataset

Step 3: Run Models