[Pipeline Refactor] Migration #1460

dsikka · 2023-12-06T20:51:29Z

Summary

Branch to update pathways such that the new text generation pipeline can be used
All new pipeline components and updated pipelines (text_generation, image_classification) were moved from the v2 pathway and are now the default pipelines that will be used
The old files have been moved to a legacy folder under src. Old text_generation and image_classification folders were also moved to legacy subfolders in their respective modules
To make it easier, text_generation schemas were moved to a separate folder under transformers/schemas making it easy for both the new and old pipelines to pull them in

Testing

You can load the new pipelines using the normal Pipeline.create(...) method.
If the pipeline has not been registered using the new registry/migrated to use the new framework, you can use Pipeline.create(...) as well. This will use the legacy pipeline class under the hood.
To use the legacy pipeline (old text generation and old image classification) which have already been migrated, use have to use the legacy Pipeline under legacy/pipeline.py

All 3 examples are shown below.

Example:

Run the new text generation pipeline (with continuous batching, if that's what your heart desires):

from deepsparse import Pipeline
from deepsparse.transformers.schemas.text_generation_schemas import TextGenerationInput

pipeline = Pipeline.create(
    task="text_generation",
    model_path=model_path,
    engine_type="deepsparse",
    internal_kv_cache=False,
    continuous_batch_sizes=[2, 4]
)

prompts = [["Hello there!", "The sun shined bright", "The dog barked"]]
for i in range(len(prompts)):
    input_value = TextGenerationInput(
        prompt=prompts[i],
        generation_kwargs={
            "num_return_sequences": 4,
            "max_new_tokens": 20,
            "do_sample": True,
        },
    )
    output = pipeline(input_value)
    for i in output.generations:
        print(i)
        print("\n")

Run the old text_generation pipeline:

from deepsparse.legacy.pipeline import Pipeline
from deepsparse.transformers.schemas.text_generation_schemas import TextGenerationInput

model_path = "hf:neuralmagic/mpt-7b-chat-pruned50-quant"
pipeline = Pipeline.create(
    task="text_generation",
    model_path=model_path,
    engine_type="deepsparse",
    internal_kv_cache=True,
)

prompts = [["Hello there!", "The sun shined bright", "The dog barked"]]
input_value = TextGenerationInput(
    prompt=prompts[0],
    generation_kwargs={
        "num_return_sequences": 4,
        "max_new_tokens": 20,
        "do_sample": True,
    },
)

output = pipeline(input_value)
for i in output.generations:
    print(i)
    print("\n")

Run any pipeline that has not yet been migrated to use the new `Pipeline` class/framework

from deepsparse import Pipeline

sa_pipeline = Pipeline.create(
    task="sentiment-analysis",
    model_path="zoo:bert-large-sst2_wikipedia_bookcorpus-pruned90_quantized"
)

inference = sa_pipeline("I love it!")

Next Steps

Some of the tests needs to be updated to reflect the new pipeline changes (example: test_pipeline.py and test_dynamic_import.py). Right now they are testing the legacy pipeline.
To reflect then new text generation pipeline, test_text_generation.py needs to be updated. It is currently testing the legacy pipeline.
Update PIpeline.to_config/Pipeline.from_config such that new pipelines can be loaded in the server. Right now, only old pipelines can run on the server

dbogunowicz

Few general inquires to @dsikka / @bfineran

How in the future will the full "retirement" of V1 look like?
I understand that once this PR lands, we stop any development of legacy code
There are still two functionalities for V2 pipelines that need to land from my side: non-KV cache pipeline (ready for review) and streaming (WiP). Also there are small differences between V1 and V2 text generation pipeline (e.g. [Text Generation] Terminate the inference when kv cache is full #1446). When do we want to get those in ASAP, to assert that V1 and V2 are identical?

src/deepsparse/image_classification/validation_script.py

src/deepsparse/operators/operator.py

tests/deepsparse/pipelines/test_clip.py

tests/deepsparse/transformers/pipelines/test_text_generation.py

tests/deepsparse/evaluation/test_utils.py

bfineran

LGTM pending tests passing and confirmation that user facing scripts run as expected - examples look great

src/deepsparse/transformers/pipelines/code_generation.py

…line

update pathways to use new v2 pipeline

5abfc26

dsikka force-pushed the update_pathways branch from e07d49a to 5abfc26 Compare December 6, 2023 20:58

dsikka added 2 commits December 6, 2023 22:15

fix image classification

e049272

quality

c3191be

dsikka requested review from bfineran and dbogunowicz December 6, 2023 23:00

leftover fixes

e697bd3

dbogunowicz requested changes Dec 7, 2023

View reviewed changes

dsikka added 2 commits December 7, 2023 16:41

fix batch sizes:

0a570e7

Merge branch 'main' into update_pathways

7aaa6a0

dsikka requested a review from dbogunowicz December 7, 2023 16:42

dsikka added 10 commits December 7, 2023 17:08

quality; use old tasks for server

ef6a49c

fix server tests

e7b38b2

fix captioning

8df9c78

fix init for haystack

469d101

fix import

e4079d6

fix another import

4489c50

fix test

15f8c0e

fix init file

99cd69f

fix test

d0cccd5

fix remaining base tests

edf7d0d

dbogunowicz previously approved these changes Dec 8, 2023

View reviewed changes

tests/deepsparse/evaluation/test_utils.py Show resolved Hide resolved

fix docstring

ffd8083

dsikka dismissed dbogunowicz’s stale review via ffd8083 December 8, 2023 15:38

quality

6c42956

dsikka requested a review from dbogunowicz December 8, 2023 15:47

bfineran previously approved these changes Dec 8, 2023

View reviewed changes

src/deepsparse/transformers/pipelines/code_generation.py Outdated Show resolved Hide resolved

update codegen alias to use the new registry and text generation pipe…

8c71397

…line

dsikka dismissed bfineran’s stale review via 8c71397 December 8, 2023 16:42

dsikka requested a review from bfineran December 8, 2023 16:42

bfineran approved these changes Dec 8, 2023

View reviewed changes

Merge branch 'main' into update_pathways

cbd20cb

dbogunowicz approved these changes Dec 11, 2023

View reviewed changes

dsikka merged commit 23096ef into main Dec 11, 2023
13 checks passed

dsikka deleted the update_pathways branch December 11, 2023 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pipeline Refactor] Migration #1460

[Pipeline Refactor] Migration #1460

dsikka commented Dec 6, 2023 •

edited

Loading

dbogunowicz left a comment •

edited

Loading

bfineran left a comment

[Pipeline Refactor] Migration #1460

[Pipeline Refactor] Migration #1460

Conversation

dsikka commented Dec 6, 2023 • edited Loading

Summary

Testing

Example:

Run the new text generation pipeline (with continuous batching, if that's what your heart desires):

Run the old text_generation pipeline:

Run any pipeline that has not yet been migrated to use the new Pipeline class/framework

Next Steps

dbogunowicz left a comment • edited Loading

Choose a reason for hiding this comment

bfineran left a comment

Choose a reason for hiding this comment

dsikka commented Dec 6, 2023 •

edited

Loading

Run any pipeline that has not yet been migrated to use the new `Pipeline` class/framework

dbogunowicz left a comment •

edited

Loading