[TextGeneration][Timer] text gen specific timings + improved timing tooling #1121

bfineran · 2023-07-13T21:19:28Z

adds:

bugfix for TimerManager workflow for engine_forward subtimings
- since engine_forward is always called in a child thread for batch splitting, it will not have access to the current timer contextvar
- see changes in timer.py new_timer_context for this
adds timer.time helper contextmanager; see docstring for usage
adds subtimings for prefill, generation and the individual autoregressive passes for each in the text generation pipeline
- see example in test plan for what these timings may look like

test_plan:
Manually for now - once we have more text gen pipeline tests, we can add assertions that the proper stages are generated in the timer manager:

# running text gen / opt pipeline
from deepsparse import Pipeline
opt = Pipeline.create(task="opt", model_path=MODEL_PATH, engine_type="onnxruntime", max_generated_tokens=128)
output = opt(sequences="Who is the president of the United States?", return_logits=True)

print(opt.timer_manger)
print(opt.timer_manager.stages)

Manager and stages:

TimerManager({'engine_token_generation': 4.507111581042409, 'engine_prompt_prefill_single': 0.03916543936356902, 'engine_prompt_prefill': 0.3927434356883168, 'pre_process': 0.001995994709432125, 'engine_token_generation_single': 0.035451228459050334, 'engine_forward': 4.909945313818753, 'total_inference': 4.912277169525623, 'post_process': 0.00030374620109796524})

['engine_token_generation', 'engine_prompt_prefill_single', 'engine_prompt_prefill', 'pre_process', 'engine_token_generation_single', 'engine_forward', 'total_inference', 'post_process']

src/deepsparse/transformers/pipelines/text_generation.py

rahul-tuli

LGTM pending response to comments!

src/deepsparse/transformers/pipelines/text_generation.py

src/deepsparse/utils/timer.py

src/deepsparse/transformers/pipelines/text_generation.py

bfineran · 2023-07-19T14:03:50Z

review suggestions LGTM

…ooling

@rahul-tuli

Revert to using timer.time for `TOKEN_GENERATION` Remove finally clause from `contextmanagers` Address review comments from @rahul-tuli

bfineran requested review from mgoin, markurtz and dbogunowicz July 13, 2023 21:19

bfineran self-assigned this Jul 13, 2023

dbogunowicz reviewed Jul 14, 2023

View reviewed changes

src/deepsparse/transformers/pipelines/text_generation.py Outdated Show resolved Hide resolved

dbogunowicz previously approved these changes Jul 14, 2023

View reviewed changes

bfineran dismissed dbogunowicz’s stale review via 27fec25 July 18, 2023 13:21

dbogunowicz previously approved these changes Jul 18, 2023

View reviewed changes

rahul-tuli reviewed Jul 18, 2023

View reviewed changes

rahul-tuli dismissed dbogunowicz’s stale review via f7572f9 July 21, 2023 13:45

bfineran and others added 3 commits July 21, 2023 09:47

[TextGeneration][Timer] text gen specific timings + improved timing t…

f6e7a87

…ooling

review suggestion - names to dataclass

3ad06e3

Add types to _TextGenerationTimings attributes

92b2fac

Revert to using timer.time for `TOKEN_GENERATION` Remove finally clause from `contextmanagers` Address review comments from @rahul-tuli

rahul-tuli force-pushed the text-gen-timings branch from f7572f9 to 92b2fac Compare July 21, 2023 13:50

rahul-tuli approved these changes Jul 21, 2023

View reviewed changes

Merge branch 'main' into text-gen-timings

ed82c33

dbogunowicz approved these changes Jul 23, 2023

View reviewed changes

Merge branch 'main' into text-gen-timings

54e3f03

dbogunowicz merged commit 29a8f68 into main Jul 24, 2023
7 checks passed

dbogunowicz deleted the text-gen-timings branch July 24, 2023 10:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TextGeneration][Timer] text gen specific timings + improved timing tooling #1121

[TextGeneration][Timer] text gen specific timings + improved timing tooling #1121

bfineran commented Jul 13, 2023 •

edited

Loading

rahul-tuli left a comment

bfineran commented Jul 19, 2023

[TextGeneration][Timer] text gen specific timings + improved timing tooling #1121

[TextGeneration][Timer] text gen specific timings + improved timing tooling #1121

Conversation

bfineran commented Jul 13, 2023 • edited Loading

rahul-tuli left a comment

Choose a reason for hiding this comment

bfineran commented Jul 19, 2023

bfineran commented Jul 13, 2023 •

edited

Loading