[`BT`] add decoder benchmark script #857

younesbelkada · 2023-03-06T15:21:25Z

What does this PR do?

This PR adds the benchmark script adapted for decoder-based models as well, to benchmark the speedup we obtain with torch.sdpa and transformers models

cc @fxmarty

fxmarty · 2023-03-06T15:33:58Z

tests/benchmark/benchmark_bettertransformer.py

+            max_new_tokens=max_token,
+            use_cache=False,


Why use_cache = False? I'd rather benchmark with the cache enabled.

Also, can we rather set min_length=length, max_length=length? So that we can generate a specific length.

I think it's because max_length is deprecated, yes we can add min_length I think let me try

fxmarty · 2023-03-06T15:35:02Z

tests/benchmark/benchmark_bettertransformer.py

+            max_new_tokens=max_token,
+            use_cache=False,
+        )
+        _ = hf_model.generate(input_ids, generation_config=gen_config)


the input_ids are not defined no?

Does it make sense to pass attention_mask as well? Not sure

fxmarty · 2023-03-06T15:35:19Z

tests/benchmark/benchmark_bettertransformer.py

    start_event = torch.cuda.Event(enable_timing=True)
    end_event = torch.cuda.Event(enable_timing=True)
    start_event.record()
    for _ in range(num_batches):
-        _ = model(input_ids, masks)
+        if is_decoder:
+            _ = model.generate(input_ids, generation_config=generation_config)


the input_ids is not defined no?

HuggingFaceDocBuilderDev · 2023-03-06T15:37:57Z

The documentation is not available anymore as the PR was closed or merged.

fxmarty

Feel free to merge once you think it's good. I think we should use use_cache=True to benchmark.

HuggingFaceDocBuilderDev · 2023-03-06T16:46:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

younesbelkada and others added 2 commits March 6, 2023 15:19

add benchmark script

d7e5d65

forward contrib credits from original script

bae5a57

fxmarty reviewed Mar 6, 2023

View reviewed changes

fxmarty approved these changes Mar 6, 2023

View reviewed changes

younesbelkada added 4 commits March 6, 2023 15:50

fix benchmark

e9b7e35

revert mistakes

11798c0

fix benchmark script

f23d17a

style

bbf07c6

younesbelkada merged commit c7b384a into huggingface:main Mar 6, 2023

younesbelkada deleted the benchmark-bt branch March 6, 2023 16:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`BT`] add decoder benchmark script #857

[`BT`] add decoder benchmark script #857

younesbelkada commented Mar 6, 2023

fxmarty Mar 6, 2023

younesbelkada Mar 6, 2023

fxmarty Mar 6, 2023

fxmarty Mar 6, 2023

fxmarty Mar 6, 2023

HuggingFaceDocBuilderDev commented Mar 6, 2023 •

edited

Loading

fxmarty left a comment

HuggingFaceDocBuilderDev commented Mar 6, 2023

[BT] add decoder benchmark script #857

[BT] add decoder benchmark script #857

Conversation

younesbelkada commented Mar 6, 2023

What does this PR do?

fxmarty Mar 6, 2023

Choose a reason for hiding this comment

younesbelkada Mar 6, 2023

Choose a reason for hiding this comment

fxmarty Mar 6, 2023

Choose a reason for hiding this comment

fxmarty Mar 6, 2023

Choose a reason for hiding this comment

fxmarty Mar 6, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 6, 2023 • edited Loading

fxmarty left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 6, 2023

[`BT`] add decoder benchmark script #857

[`BT`] add decoder benchmark script #857

HuggingFaceDocBuilderDev commented Mar 6, 2023 •

edited

Loading