use max sequence length for tokenization #1166

horheynm · 2023-08-04T01:59:05Z

Fix bug for https://neuralmagic.slack.com/archives/C0592FX3215/p1687210067995879

Example:

from deepsparse import Pipeline


path = "path"

pipeline = Pipeline.create(
    task = "text-classification",
    model_path = path,
    batch_size=8,
    num_cores=None,
    sequence_length = [2, 128],
)

text = "We are flying from Texas to California"
pipeline(text)

Before:

(.venv) ubuntu@quad-mle-2:~/george/nm/deepsparse$ python3 scratch/_p.py 
...
2023-08-04 01:19:39 __main__     INFO     Overwriting in-place the input shapes of the transformer model at /home/ubuntu/.cache/sparsezoo/bert-large-squad_wikipedia_bookcorpus-pruned80.4block_quantized/bert-large-squad_wikipedia_bookcorpus-pruned80.4block_quantized/model.onnx
Token indices sequence length is longer than the specified maximum sequence length for this model (9 > 2). Running this sequence through the model will result in indexing errors

After"

(.venv) ubuntu@quad-mle-2:~/george/nm/deepsparse$ python3 scratch/_p.py 
...
2023-08-04 01:25:44 __main__     INFO     Overwriting in-place the input shapes of the transformer model at /home/ubuntu/.cache/sparsezoo/bert-large-squad_wikipedia_bookcorpus-pruned80.4block_quantized/bert-large-squad_wikipedia_bookcorpus-pruned80.4block_quantized/model.onnx

src/deepsparse/transformers/pipelines/pipeline.py

stale

mgoin

nice!

horheynm marked this pull request as ready for review August 4, 2023 02:21

mgoin previously requested changes Aug 4, 2023

View reviewed changes

src/deepsparse/transformers/pipelines/pipeline.py Outdated Show resolved Hide resolved

select largest tokenizer

7a7df94

horheynm force-pushed the max-bucket-tokenization branch from 7823c07 to 7a7df94 Compare August 8, 2023 15:57

bfineran approved these changes Aug 8, 2023

View reviewed changes

mgoin approved these changes Aug 9, 2023

View reviewed changes

mgoin merged commit cf9864b into main Aug 9, 2023
7 checks passed

mgoin deleted the max-bucket-tokenization branch August 9, 2023 13:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use max sequence length for tokenization #1166

use max sequence length for tokenization #1166

horheynm commented Aug 4, 2023 •

edited

Loading

mgoin left a comment

use max sequence length for tokenization #1166

use max sequence length for tokenization #1166

Conversation

horheynm commented Aug 4, 2023 • edited Loading

mgoin left a comment

Choose a reason for hiding this comment

horheynm commented Aug 4, 2023 •

edited

Loading