don't resize embeddings to multiples of 32x by default #313

winglian · 2023-07-22T05:57:00Z

resizing the embeddings to larger than the actual number of tokens causes problems with llama.cpp downstream. let's not enable this by default for safety.

NanoCode012 · 2023-07-22T05:59:46Z

YES! I had to modify downstream code for this 😂

NanoCode012 · 2023-07-22T06:00:41Z

While on this topic, I was wondering whether to consider using the "smart resize" code floating around in fast chat and other repos.

…/tokenizer-llama2-embeddings don't resize embeddings to multiples of 32x by default

don't resize embeddings to multiples of 32x by default

1066751

winglian merged commit 3ffb018 into main Jul 22, 2023
6 checks passed

winglian deleted the tokenizer-llama2-embeddings branch July 22, 2023 08:10

NanoCode012 mentioned this pull request Jul 24, 2023

[Doc] Axolotl resizes model embedding size to slightly different tokenizer vocabulary size #227

Closed

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

Merge pull request axolotl-ai-cloud#313 from OpenAccess-AI-Collective…

3539732

…/tokenizer-llama2-embeddings don't resize embeddings to multiples of 32x by default

philschmid mentioned this pull request Jan 17, 2024

Add setup_chat_format for adding new special tokens to model for training chat models huggingface/trl#1242

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

don't resize embeddings to multiples of 32x by default #313

don't resize embeddings to multiples of 32x by default #313

winglian commented Jul 22, 2023

NanoCode012 commented Jul 22, 2023

NanoCode012 commented Jul 22, 2023

don't resize embeddings to multiples of 32x by default #313

don't resize embeddings to multiples of 32x by default #313

Conversation

winglian commented Jul 22, 2023

NanoCode012 commented Jul 22, 2023

NanoCode012 commented Jul 22, 2023