Adding the following models: Vicuna, Koala, Pythia-Chat-Base-7B, GPT-NeoXT-Chat-Base-20B #1268

ViktorThink · 2023-04-17T08:36:56Z

Adding support for using Vicuna, Koala, Pythia-Chat-Base-7B, GPT-NeoXT-Chat-Base-20B for mining on netuid 1.

Closing my pervious pull requests 1266 and 1261 since this new pull request includes those models, and it was cleaner to have them all in the same branch.

Vicuna, Pythia and NeoXT has been tested that they work properly on netuid 1. I plan to test Koala today on netuid 1 if there are active validators.

neurons/text/prompting/miners/koala/README.md

camfairchild

Seems good. Not sure about the license implications.

neurons/text/prompting/miners/neoxt/neuron.py

neurons/text/prompting/miners/koala/neuron.py

neurons/text/prompting/miners/vicuna/README.md

neurons/text/prompting/miners/vicuna/neuron.py

Co-authored-by: Cameron Fairchild <cameron@fairchild.dev>

ifrit98

Looks good. Thanks for adding tweaks/suggestions @camfairchild

Co-authored-by: Cameron Fairchild <cameron@fairchild.dev>

ifrit98 · 2023-04-19T16:03:36Z

@ViktorThink Nice work!

A few Qs:

Can you provide a complete example for running your Koala miner? Given the instructions and code provided, you cannot instantiate a Koala miner, even after converting the weights.
Do you use a LLaMA tokenizer? If not, what tokenizer should be used with this model? (unclear)

Command ran:

python neurons/text/prompting/miners/koala/neuron.py --koala.model_name /home/jason/sandbox/PR/llama/llama_weights/koala/7B/model --netuid 1 --logging.debug --wallet.hotkey miner4

Error message (indicating there is no tokenizer found, which makes sense because there is not one mentioned in the readme at all and is not discussed how to move from the final Koala weights to prepare the miner.)

HFValidationError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/home/jason/sandbox/PR/llama/llama_weights/koala/7B/model'. Use `repo_type` argument if needed.

The error is on line 45:
self.tokenizer = AutoTokenizer.from_pretrained( self.config.koala.model_name, use_fast=False )

https://github.com/opentensor/bittensor/blob/b179b334afc16ed7a8f654fcc1b350abcf559ef8/neurons/text/prompting/miners/koala/neuron.py#L45

ViktorThink · 2023-04-19T16:09:32Z

@ifrit98 Ah, nice spotting. Yes, it is the same tokenizer as llama. I'll update the documentation later today to show exactly how to do it.

ViktorThink · 2023-04-19T21:01:04Z

@ifrit98 I had missed to add the last step of the conversion process before starting the miner.

Converting the Koala Weights to HuggingFace Transformers

To run the model on the miner, it has to be converted to Huggingface Transformers format. To do so, use
the following command:
python -m EasyLM.models.llama.convert_easylm_to_hf \
    --load_checkpoint='params::path/to/koala/checkpoint' \
    --tokenizer_path='path/to/llama/tokenizer' \
    --model_size='13b' \  # '7b', '13b', '30b' or '65b'
    --output_dir='path/to/output/huggingface/koala/checkpoint'
Starting Miner
python3 neurons/text/prompting/miners/koala/neuron.py --koala.model_name model_path_or_huggingface_identifier

EasyLM has not released any converted version of the models, only delta, but I saw on Huggingface there are already converted checkpoints that could be loaded directly.

I don't actually have the Llama weights in torch format, so I will look into adding a second option for conversion for those who have Llama in HF format but not in torch.

ifrit98 · 2023-04-19T23:12:26Z

Converted and miner is up and running. Nice work! Closing.

ViktorThink added 29 commits April 12, 2023 16:10

Added neoxt miner

b634cb4

Fix spelling error

f13f112

Fixed prompting and post process

7f3bc13

Fixed generation

d85e1ed

Corrected transformers version

3fde03c

Fixed syntax error

f6b79a1

Switched to generate()

60b8ab7

Added customizeable model

ca2210b

Merge branch 'text_prompting' into text_prompting_vicuna

09a0e4f

Resolve merge conflict: Keep my version of requirements.txt

9257b72

Improved documentation

0b343ac

Merge branch 'text_prompting' into text_prompting_vicuna

3687334

Updating documentation

33b3677

Removed output of generation

8827f40

Added Vicuna Readme

dbe1201

Updated vicuna naming

4c095bf

Updated prompting

c1b8761

Updated prompting

b42af6b

Changing to slow tokenizer

dba9237

Increase max new tokens

e339f8a

Increase max new tokens

548be0b

Remove redundancy

b628da8

Logging generation

17eb0c9

Koala model added

c31245a

Merge branch 'text_prompting_vicuna' into text_prompting_new_models

82ce594

Combined model branches into single branch

b318312

Fixed logging, low cpu mem

2b2d4bb

Merge branch 'text_prompting_koala' into text_prompting_new_models

11c19e2

Koala added

9e27475

ViktorThink changed the title ~~Adding the following models:~~ Adding the following models: Vicuna, Koala, Pythia-Chat-Base-7B, GPT-NeoXT-Chat-Base-20B Apr 17, 2023

ViktorThink added 4 commits April 18, 2023 14:42

Added custom prompt injection

c112eba

Fix pythia model name

603339c

Fixed pythia and neoxt custom prompt injection

61414c7

Corrected prompts

d870f9d

shibshib requested review from shibshib, joeylegere, robertalanm, camfairchild, isabella618033, unconst and opentaco April 18, 2023 20:58

Corrected prompts

5609df0

camfairchild reviewed Apr 18, 2023

View reviewed changes

neurons/text/prompting/miners/koala/README.md Outdated Show resolved Hide resolved

Update neurons/text/prompting/miners/koala/README.md

b179b33

camfairchild reviewed Apr 19, 2023

View reviewed changes

ifrit98 and others added 5 commits April 19, 2023 15:13

consistent formatting suggestions

c420035

Update neurons/text/prompting/miners/koala/neuron.py

6db1598

Co-authored-by: Cameron Fairchild <cameron@fairchild.dev>

Update neurons/text/prompting/miners/neoxt/neuron.py

b3eb530

Co-authored-by: Cameron Fairchild <cameron@fairchild.dev>

Update neurons/text/prompting/miners/vicuna/README.md

9e33da8

Co-authored-by: Cameron Fairchild <cameron@fairchild.dev>

Update neurons/text/prompting/miners/vicuna/neuron.py

24730e1

Co-authored-by: Cameron Fairchild <cameron@fairchild.dev>

ifrit98 approved these changes Apr 19, 2023

View reviewed changes

Update neurons/text/prompting/miners/vicuna/README.md

000201c

Co-authored-by: Cameron Fairchild <cameron@fairchild.dev>

ifrit98 closed this Apr 19, 2023

ifrit98 reopened this Apr 19, 2023

ifrit98 merged commit 9133e30 into opentensor:text_prompting Apr 19, 2023

ViktorThink deleted the text_prompting_new_models branch May 7, 2023 14:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding the following models: Vicuna, Koala, Pythia-Chat-Base-7B, GPT-NeoXT-Chat-Base-20B #1268

Adding the following models: Vicuna, Koala, Pythia-Chat-Base-7B, GPT-NeoXT-Chat-Base-20B #1268

ViktorThink commented Apr 17, 2023 •

edited

Loading

camfairchild left a comment

ifrit98 left a comment

ifrit98 commented Apr 19, 2023 •

edited

Loading

ViktorThink commented Apr 19, 2023

ViktorThink commented Apr 19, 2023 •

edited

Loading

Converting the Koala Weights to HuggingFace Transformers

Starting Miner

ifrit98 commented Apr 19, 2023

Adding the following models: Vicuna, Koala, Pythia-Chat-Base-7B, GPT-NeoXT-Chat-Base-20B #1268

Adding the following models: Vicuna, Koala, Pythia-Chat-Base-7B, GPT-NeoXT-Chat-Base-20B #1268

Conversation

ViktorThink commented Apr 17, 2023 • edited Loading

camfairchild left a comment

Choose a reason for hiding this comment

ifrit98 left a comment

Choose a reason for hiding this comment

ifrit98 commented Apr 19, 2023 • edited Loading

ViktorThink commented Apr 19, 2023

ViktorThink commented Apr 19, 2023 • edited Loading

Converting the Koala Weights to HuggingFace Transformers

Starting Miner

ifrit98 commented Apr 19, 2023

ViktorThink commented Apr 17, 2023 •

edited

Loading

ifrit98 commented Apr 19, 2023 •

edited

Loading

ViktorThink commented Apr 19, 2023 •

edited

Loading