[Text Generation][Enhancement] If `prompt_processing_sequence_length` == 1, do not inititalize multitoken_engine #1214

dbogunowicz · 2023-08-29T06:03:38Z

A simple improvement that streamlines the pipeline.

If prompt_processing_sequence_length == 1, we are essentially running single-token prompt prefill , so we should not be initializing and running the additional engine.

src/deepsparse/transformers/pipelines/text_generation.py

mgoin

awesome! a special case i'll take advantage of :)

initial commit

649f600

dbogunowicz changed the title ~~Feature/damian/prompt processing one~~ [Text Generation][Enhancement] If prompt_processing_sequence_length == 1, do not inititalize multitoken_engine Aug 29, 2023

dbogunowicz requested review from bfineran, tlrmchlsmth, Satrat and dsikka August 29, 2023 06:05

tlrmchlsmth reviewed Aug 29, 2023

View reviewed changes

src/deepsparse/transformers/pipelines/text_generation.py Show resolved Hide resolved

mgoin approved these changes Aug 29, 2023

View reviewed changes

bfineran approved these changes Aug 30, 2023

View reviewed changes

dbogunowicz merged commit d8b63da into main Aug 30, 2023
7 checks passed

dbogunowicz deleted the feature/damian/prompt_processing_one branch August 30, 2023 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Text Generation][Enhancement] If `prompt_processing_sequence_length` == 1, do not inititalize multitoken_engine #1214

[Text Generation][Enhancement] If `prompt_processing_sequence_length` == 1, do not inititalize multitoken_engine #1214

dbogunowicz commented Aug 29, 2023 •

edited

Loading

mgoin left a comment •

edited

Loading

[Text Generation][Enhancement] If prompt_processing_sequence_length == 1, do not inititalize multitoken_engine #1214

[Text Generation][Enhancement] If prompt_processing_sequence_length == 1, do not inititalize multitoken_engine #1214

Conversation

dbogunowicz commented Aug 29, 2023 • edited Loading

mgoin left a comment • edited Loading

Choose a reason for hiding this comment

[Text Generation][Enhancement] If `prompt_processing_sequence_length` == 1, do not inititalize multitoken_engine #1214

[Text Generation][Enhancement] If `prompt_processing_sequence_length` == 1, do not inititalize multitoken_engine #1214

dbogunowicz commented Aug 29, 2023 •

edited

Loading

mgoin left a comment •

edited

Loading