Change perplexity to be calculated with base e #242

mathemakitten · 2022-08-10T17:35:29Z

Merging with the open docs PR for perplexity, #238.

Closes #241.

HuggingFaceDocBuilderDev · 2022-08-10T17:38:46Z

The documentation is not available anymore as the PR was closed or merged.

mathemakitten · 2022-08-10T22:05:13Z

A comparison, for reference, on the sentence ['Hugging Face is a startup based in New York City and Paris']

Previously, base 2:

import evaluate
perplexity = evaluate.load("perplexity", module_type="metric")
input_texts = ['Hugging Face is a startup based in New York City and Paris']
results = perplexity.compute(model_id='gpt2',
                             add_start_token=False,
                             predictions=input_texts)
print(list(results.keys()))

ppl = 19.1218

Now, base e: ppl = 70.6083

Compare with the canonical example in transformers from here:

encodings = tokenizer(["Hugging Face is a startup based in New York City and Paris"], return_tensors="pt")

max_length = model.config.n_positions
stride = 512

nlls = []
for i in tqdm(range(0, encodings.input_ids.size(1), stride)):
    begin_loc = max(i + stride - max_length, 0)
    end_loc = min(i + stride, encodings.input_ids.size(1))
    trg_len = end_loc - i  # may be different from stride on last loop
    input_ids = encodings.input_ids[:, begin_loc:end_loc].to(device)
    target_ids = input_ids.clone()
    target_ids[:, :-trg_len] = -100

    with torch.no_grad():
        outputs = model(input_ids, labels=target_ids)
        neg_log_likelihood = outputs[0] * trg_len

    nlls.append(neg_log_likelihood)

ppl = torch.exp(torch.stack(nlls).sum() / end_loc)

ppl = 70.6075

And the usual:

model = GPT2LMHeadModel.from_pretrained('gpt2')
tokenizer = GPT2TokenizerFast.from_pretrained('gpt2')

loss = model(input_ids, labels=input_ids)[0]
print(np.exp(loss.cpu().detach().numpy()))

ppl = 70.60746

Fix examples in perplexity measurement docs

lvwerra

Thanks for updating this and fixing the docs. Looks good to me, the only thing I would add is an explicit comment in the docstring (_DESCRIPTION) as well as at the very beginning of the readme that we compute ppl with base e.

mathemakitten added 4 commits August 9, 2022 10:58

Fix perplexity 'measurement' card

d08bc04

Fix other example as well

28eafdc

Fix kwarg

52ca062

Change perplexity to be calculated with base e

738b641

mathemakitten marked this pull request as draft August 10, 2022 19:03

mathemakitten and others added 4 commits August 10, 2022 15:24

Merge pull request #238 from huggingface/hn-fix-perplexity-docs

39d1bd1

Fix examples in perplexity measurement docs

Change ppl result #s in metric cards

a5d9aee

Change ppl numbers for wikitext in metric cards

6249a87

More precise language on the metric cards

ed939db

mathemakitten marked this pull request as ready for review August 11, 2022 00:05

Cleanup docs and add note about recent convention on base e

0ce556f

lvwerra approved these changes Aug 15, 2022

View reviewed changes

Add notes for exponent base more prominently

f42c655

mathemakitten merged commit 940d6de into main Aug 15, 2022

mathemakitten deleted the hn-base-e branch August 15, 2022 17:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change perplexity to be calculated with base e #242

Change perplexity to be calculated with base e #242

mathemakitten commented Aug 10, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 10, 2022 •

edited

Loading

mathemakitten commented Aug 10, 2022 •

edited

Loading

lvwerra left a comment

Change perplexity to be calculated with base e #242

Change perplexity to be calculated with base e #242

Conversation

mathemakitten commented Aug 10, 2022 • edited Loading

HuggingFaceDocBuilderDev commented Aug 10, 2022 • edited Loading

mathemakitten commented Aug 10, 2022 • edited Loading

lvwerra left a comment

Choose a reason for hiding this comment

mathemakitten commented Aug 10, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 10, 2022 •

edited

Loading

mathemakitten commented Aug 10, 2022 •

edited

Loading