Add StableLM support #616

D4ve-R · 2024-03-01T10:25:24Z

Adds support for

StableLMForCausalLM
StableLMForSequenceClassification

Closes #549

D4ve-R · 2024-03-01T15:45:36Z

StableLM support in Optimum is currently underway.
huggingface/optimum#1719

xenova · 2024-03-01T21:11:05Z

I know it's still a draft, but I added a commit to get it into a working state :)

Example usage:

import { pipeline } from '@xenova/transformers';

const generator = await pipeline('text-generation', 'Xenova/tiny-random-StableLmForCausalLM')
const output = await generator('hi')
console.log(output);

It's just a randomly initialized tiny model, so the output is gibberish. I will export some larger models too.

HuggingFaceDocBuilderDev · 2024-03-01T21:13:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

D4ve-R · 2024-03-02T10:49:31Z

Awesome! 😎
@xenova, how did you create the testnet? Just export a random pytorch net with the same architecture as stablelm?

xenova · 2024-03-02T11:12:23Z

Yes that is how you do it - but luckily in this case, we already have a tiny stablelm model: https://huggingface.co/hf-internal-testing/tiny-random-StableLmForCausalLM (check out the org for the full list of them)

xenova · 2024-03-02T12:37:38Z

Here are some of the larger models:

which we can test with

(see all)

xenova · 2024-03-02T13:52:54Z

Seems to work alright! 👍

import { pipeline } from '@xenova/transformers';

const generator = await pipeline('text-generation', 'Xenova/stablelm-2-zephyr-1_6b')

const prompt = [{'role': 'user', 'content': 'Tell me a joke'}];
const inputs = generator.tokenizer.apply_chat_template(prompt, { add_generation_prompt: true, tokenize: false });

const output = await generator(inputs, { max_new_tokens: 20 })
console.log(output[0].generated_text);
// "<|user|>\nTell me a joke\n<|assistant|>\nWhy did the tomato turn red?\n\nBecause it saw the salad dressing!"

D4ve-R · 2024-03-03T14:42:12Z

Really cool! Will turn this into a pr.

xenova · 2024-03-03T15:20:32Z

Thanks! I think the last thing to do is just export and test with some sequence classifier models. Is that something you'd like to work on?

D4ve-R · 2024-03-04T13:38:50Z

Yes I will do that. Just to check, what do you mean by export?

xenova · 2024-03-04T13:56:17Z

what do you mean by export?

export / convert to onnx 👍

xenova · 2024-03-06T11:11:26Z

Looks like there aren't any stablelm text-classification models on the HF Hub (other than your test of course). So I think it will be a good idea to move that to a separate PR and get this merged?

xenova

TODO: move stablelm text classification to separate PR

src/models.js

D4ve-R · 2024-03-06T12:55:51Z

Wow, seems like you can read my mind 😆
You're right, will move this to another PR!
My response i had typed in, but not send yet:

Little update, since there is currently no stablelm model for text classification on the hub, I tried training my own.
Unfortunately I was unable to train on 16GB T4. I will add the code below to train stablelm-2-1_6b on twitter-financial-news-sentiment, if somebody wants to give it a shot.
I think we should split this in two seperate prs and merged the Seq Classification one when a model is available.

import numpy as np
from datasets import load_dataset
from transformers import AutoTokenizer, DataCollatorWithPadding, AutoModelForSequenceClassification, TrainingArguments, Trainer
import evaluate
from huggingface_hub import login
login()

dataset = "zeroshot/twitter-financial-news-sentiment"
model = "stabilityai/stablelm-2-1_6b"

dataset = load_dataset(dataset)
tokenizer = AutoTokenizer.from_pretrained(model)
tokenizer.pad_token = tokenizer.eos_token
data_collator = DataCollatorWithPadding(tokenizer=tokenizer)
accuracy = evaluate.load("accuracy")

def compute_metrics(eval_pred):
    predictions, labels = eval_pred
    predictions = np.argmax(predictions, axis=1)
    return accuracy.compute(predictions=predictions, references=labels)

def preprocess_function(examples):
    return tokenizer(examples["text"], truncation=True)

tokenized_dataset = dataset.map(preprocess_function, batched=True)
id2label = {0: "Bearish", 1: "Bullish", 2: "Neutral"}
label2id = {"Bearish": 0, "Bullish": 1, "Neutral": 2}

model = AutoModelForSequenceClassification.from_pretrained(
    model,
    num_labels=len(id2label),
    id2label=id2label,
    label2id=label2id,
)
model.config.pad_token_id = model.config.eos_token_id

training_args = TrainingArguments(
    output_dir="stablelm-2-1_6b-sentiment",
    learning_rate=2e-5,
    # TODO: will it fit 16GB?
    per_device_train_batch_size=8,
    per_device_eval_batch_size=8,
    num_train_epochs=2,
    weight_decay=0.01,
    evaluation_strategy="epoch",
    save_strategy="epoch",
    # TODO: will it fit 16GB?
    gradient_accumulation_steps=4,
    gradient_checkpointing=True,
    optim="adamw_bnb_8bit",
    load_best_model_at_end=True,
    push_to_hub=True,
)

trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=tokenized_dataset["train"],
    eval_dataset=tokenized_dataset["validation"],
    tokenizer=tokenizer,
    data_collator=data_collator,
    compute_metrics=compute_metrics,
)

trainer.train()
trainer.push_to_hub()

will be added in seperate PR Co-authored-by: Joshua Lochner <admin@xenova.com>

will be added in seperate pr Co-authored-by: Joshua Lochner <admin@xenova.com>

src/models.js

xenova

naming nits

src/models.js

scripts/convert.py

src/models.js

xenova · 2024-03-07T01:22:21Z

Merged! Thanks for this @D4ve-R! 🤗

Example: Text generation with Xenova/stablelm-2-zephyr-1_6b.

import { pipeline } from '@xenova/transformers';

// Create text generation pipeline
const generator = await pipeline('text-generation', 'Xenova/stablelm-2-zephyr-1_6b');

// Define the prompt and list of messages
const prompt = "Tell me a funny joke."
const messages = [
    { "role": "system", "content": "You are a helpful assistant." },
    { "role": "user", "content": prompt },
]

// Apply chat template
const inputs = generator.tokenizer.apply_chat_template(messages, {
    tokenize: false,
    add_generation_prompt: true,
});

// Generate text
const output = await generator(inputs, { max_new_tokens: 20 });
console.log(output[0].generated_text);
// "<|system|>\nYou are a helpful assistant.\n<|user|>\nTell me a funny joke.\n<|assistant|>\nHere's a joke for you:\n\nWhy don't scientists trust atoms?\n\nBecause they make up everything!"

D4ve-R added 2 commits March 1, 2024 11:13

add stablelm model impl.

ef1b453

add stablelm mapping

01c8cf4

Update StableLMPreTrainedModel class config

1c8a7b8

xenova added 2 commits March 2, 2024 12:35

Update stablelm conversion script quantization settings

303cd1a

Add StableLm to list of supported models

35d14b6

D4ve-R marked this pull request as ready for review March 3, 2024 14:43

xenova requested changes Mar 6, 2024

View reviewed changes

src/models.js Outdated Show resolved Hide resolved

src/models.js Outdated Show resolved Hide resolved

src/models.js Outdated Show resolved Hide resolved

D4ve-R and others added 4 commits March 6, 2024 14:00

Remove StableLMForSequenceClassification

02f516a

will be added in seperate PR Co-authored-by: Joshua Lochner <admin@xenova.com>

Remove SequenceClassifierOutputWithPast

814e334

will be added in seperate PR Co-authored-by: Joshua Lochner <admin@xenova.com>

Remove StableLMForSequenceClassification mapping

4837c47

will be added in seperate pr Co-authored-by: Joshua Lochner <admin@xenova.com>

Merge branch 'main' into stablelm

f995b46

D4ve-R mentioned this pull request Mar 6, 2024

Add StableLMForSequenceClassification support #628

Draft

xenova reviewed Mar 6, 2024

View reviewed changes

src/models.js Outdated Show resolved Hide resolved

src/models.js Outdated Show resolved Hide resolved

src/models.js Outdated Show resolved Hide resolved

src/models.js Outdated Show resolved Hide resolved

xenova added 2 commits March 6, 2024 17:37

Update src/models.js

b56aec2

Naming nit

295c317

xenova reviewed Mar 6, 2024

View reviewed changes

src/models.js Outdated Show resolved Hide resolved

src/models.js Outdated Show resolved Hide resolved

src/models.js Outdated Show resolved Hide resolved

Naming nits

d02f078

xenova reviewed Mar 7, 2024

View reviewed changes

scripts/convert.py Outdated Show resolved Hide resolved

Update scripts/convert.py

0efe69e

xenova reviewed Mar 7, 2024

View reviewed changes

src/models.js Outdated Show resolved Hide resolved

Update src/models.js

0c14c05

xenova merged commit 5bb8d25 into huggingface:main Mar 7, 2024
1 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add StableLM support #616

Add StableLM support #616

D4ve-R commented Mar 1, 2024 •

edited

Loading

D4ve-R commented Mar 1, 2024

xenova commented Mar 1, 2024

HuggingFaceDocBuilderDev commented Mar 1, 2024

D4ve-R commented Mar 2, 2024 •

edited

Loading

xenova commented Mar 2, 2024

xenova commented Mar 2, 2024

xenova commented Mar 2, 2024

D4ve-R commented Mar 3, 2024

xenova commented Mar 3, 2024

D4ve-R commented Mar 4, 2024

xenova commented Mar 4, 2024

xenova commented Mar 6, 2024

xenova left a comment

D4ve-R commented Mar 6, 2024 •

edited

Loading

xenova left a comment

xenova commented Mar 7, 2024

Add StableLM support #616

Add StableLM support #616

Conversation

D4ve-R commented Mar 1, 2024 • edited Loading

D4ve-R commented Mar 1, 2024

xenova commented Mar 1, 2024

HuggingFaceDocBuilderDev commented Mar 1, 2024

D4ve-R commented Mar 2, 2024 • edited Loading

xenova commented Mar 2, 2024

xenova commented Mar 2, 2024

xenova commented Mar 2, 2024

D4ve-R commented Mar 3, 2024

xenova commented Mar 3, 2024

D4ve-R commented Mar 4, 2024

xenova commented Mar 4, 2024

xenova commented Mar 6, 2024

xenova left a comment

Choose a reason for hiding this comment

D4ve-R commented Mar 6, 2024 • edited Loading

xenova left a comment

Choose a reason for hiding this comment

xenova commented Mar 7, 2024

D4ve-R commented Mar 1, 2024 •

edited

Loading

D4ve-R commented Mar 2, 2024 •

edited

Loading

D4ve-R commented Mar 6, 2024 •

edited

Loading