legacy to init the slow tokenizer when converting from slow was wrong #30972

ArthurZucker · 2024-05-22T15:55:32Z

What does this PR do?

A small nit but the arg was not passed. cc @itazap one fix already 😉

amyeroberts

Thanks for fixing!

amyeroberts · 2024-05-22T16:03:05Z

src/transformers/models/llama/tokenization_llama_fast.py

@@ -151,9 +151,6 @@ def __init__(
        self.legacy = legacy

        if add_prefix_space is not None:
-            logger.warning_once(


Why remove this?

It's not helpful and conversion time is fast, and if we save (serialize it) then you have the warning while you shuld not!

HuggingFaceDocBuilderDev · 2024-05-22T16:20:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…#30972)

…huggingface#30972)

legacy to init the slow tokenizer when converting from slow was wrong

22214c0

ArthurZucker requested a review from amyeroberts May 22, 2024 15:55

amyeroberts approved these changes May 22, 2024

View reviewed changes

ArthurZucker merged commit 1d568df into main May 22, 2024
19 checks passed

ArthurZucker deleted the patch-llama-tokenizer branch May 22, 2024 16:06

ArthurZucker added a commit that referenced this pull request May 22, 2024

legacy to init the slow tokenizer when converting from slow was wrong (…

0414185

…#30972)

itazap pushed a commit that referenced this pull request May 24, 2024

legacy to init the slow tokenizer when converting from slow was wrong (…

e790c80

…#30972)

itazap pushed a commit that referenced this pull request May 30, 2024

legacy to init the slow tokenizer when converting from slow was wrong (…

d799d67

…#30972)

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request Jun 11, 2024

legacy to init the slow tokenizer when converting from slow was wrong (…

e1c8ca9

…huggingface#30972)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

legacy to init the slow tokenizer when converting from slow was wrong #30972

legacy to init the slow tokenizer when converting from slow was wrong #30972

ArthurZucker commented May 22, 2024

amyeroberts left a comment

amyeroberts May 22, 2024

ArthurZucker May 22, 2024

HuggingFaceDocBuilderDev commented May 22, 2024

legacy to init the slow tokenizer when converting from slow was wrong #30972

legacy to init the slow tokenizer when converting from slow was wrong #30972

Conversation

ArthurZucker commented May 22, 2024

What does this PR do?

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts May 22, 2024

Choose a reason for hiding this comment

ArthurZucker May 22, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 22, 2024