Fix evaluation code to improve performance #2421

vigneshwaran · 2023-08-10T18:45:22Z

I have been experiencing llm-foundry/eval takes a lot of time compared to lm-evaluation-harness. After digging into the code, I found padding token is appended till the maximum length of the tokenizer.

inp, continuation_span = _make_padded_input(context_enc, continuation_enc, self.max_seq_len,
                                                        self.pad_tok_id)

https://github.com/bmosaicml/composer/blob/1011f90f2653dae103c3837c968071e399b1decc/composer/datasets/in_context_learning_evaluation.py#L418C1-L428C59

My proposal:

Instead of padding till max_seq_len, use the maximum length of the batch.

inp, continuation_span = _make_padded_input(context_enc, continuation_enc, max_len_of_data,
                                                        self.pad_tok_id)

This has improved latency by 400% when I used 2048 as sequence length. It would be even more for models trained with higher sequence length.

The text was updated successfully, but these errors were encountered:

eracah · 2023-08-10T19:05:42Z

Hi, @vigneshwaran, thanks for discovering this! please make a pull request with your changes!

mvpatel2000 · 2023-08-10T19:56:20Z

Great find! If you can open a PR, we'd love to accept it :)

vigneshwaran · 2023-08-14T08:18:56Z

I have proposed my MR #2428
@eracah @mvpatel2000

vigneshwaran · 2023-08-17T03:23:57Z

In test script, batch is always expected to have fixed size of (batch_size, max_seq_len)

composer/tests/datasets/test_in_context_learning_datasets.py

Line 591 in b5ed487

assert tuple(batch['input_ids'].shape) == (batch_size, seqlen)

Test error:

So, we need to update test files.

This was referenced Aug 10, 2023

Strip extra pad tokens #2422

Closed

Fix: Padding till max_seq_len of batch vigneshwaran/composer#1

Merged

Remove unnecessary pad token #2428

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix evaluation code to improve performance #2421

Fix evaluation code to improve performance #2421

vigneshwaran commented Aug 10, 2023

eracah commented Aug 10, 2023

mvpatel2000 commented Aug 10, 2023

vigneshwaran commented Aug 14, 2023

vigneshwaran commented Aug 17, 2023 •

edited

Loading

Fix evaluation code to improve performance #2421

Fix evaluation code to improve performance #2421

Comments

vigneshwaran commented Aug 10, 2023

My proposal:

eracah commented Aug 10, 2023

mvpatel2000 commented Aug 10, 2023

vigneshwaran commented Aug 14, 2023

vigneshwaran commented Aug 17, 2023 • edited Loading

vigneshwaran commented Aug 17, 2023 •

edited

Loading