Batch inference #93

mtsysin · 2024-06-13T05:10:59Z

Hi!

I'm evaluating the model on a relatively large dataset (single question, single answer). I was able to fine-tune the Bunny-1.1-Llama-3-8B-V model using one of the scripts provided. What is the best strategy to implement batch inference?

Isaachhh · 2024-07-06T10:18:34Z

Sorry for that we don't support batch inference currently. You may split the dataset into multiple parts and launch a model on each GPU, like evaluating on VQA, GQA and SEED-Bench.

Isaachhh · 2024-08-02T15:33:51Z

@mtsysin

You may refer to batch_inference.py.

However, we failed to set the attention_mask of left-padding tokens to be 0. So the attention_mask of inputs are all 1 and the outputs may be a little different from single-sample inference.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch inference #93

Batch inference #93

mtsysin commented Jun 13, 2024

Isaachhh commented Jul 6, 2024

Isaachhh commented Aug 2, 2024

Batch inference #93

Batch inference #93

Comments

mtsysin commented Jun 13, 2024

Isaachhh commented Jul 6, 2024

Isaachhh commented Aug 2, 2024