add basic support for internvl2-4b #11718

MeouSker77 · 2024-08-06T03:23:04Z

Description

add basic support for internvl2-4b

1. Why the change?

2. User API changes

test with transformers 4.38.0

A simple example:

pip install lmdeploy timm

import time
import torch

model_path = 'InternVL2-4B'

from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_cod=True)

from ipex_llm.transformers import AutoModelForCausalLM
# use "sym_int8" for better output, use "sym_int4" for better speed
model = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True,
                                             load_in_low_bit="sym_int8", modules_to_not_convert=["vision_model"])
model = model.half().eval()
model = model.to('xpu')

from lmdeploy.vl import load_image
from transformers import CLIPImageProcessor
image_processor = CLIPImageProcessor.from_pretrained(model_path)
# image = load_image('https://github.com/raw/open-mmlab/mmdeploy/main/tests/data/tiger.jpeg')
image = load_image("tiger.jpeg")
pixel_values = image_processor(images=[image], return_tensors='pt').pixel_values
pixel_values = pixel_values.to('xpu')

generation_config = {
    "max_new_tokens": 32,
    "do_sample": False,
}

question = "<image>What's in the picture?"

with torch.inference_mode():
    for i in range(5):
        st = time.time()
        response = model.chat(
            tokenizer=tokenizer,
            pixel_values=pixel_values,
            question=question,
            generation_config=generation_config,
            history=[]
        )
        et = time.time()
        print(response)
        print(et - st)

see this for more examples and usage

for now, lmdeploy usage is not supported

3. Summary of the change

4. How to test?

N/A
Unit test: Please manually trigger the PR Validation here by inputting the PR number (e.g., 1234). And paste your action link here once it has been successfully finished.
Application test
Document test
...

MeouSker77 · 2024-08-06T03:23:50Z

PR validation: https://github.com/intel-analytics/ipex-llm-workflow/actions/runs/10259726207

support internvl2-4b

9ba1ecd

MeouSker77 requested a review from rnwang04 August 6, 2024 03:24

rnwang04 approved these changes Aug 6, 2024

View reviewed changes

MeouSker77 merged commit f44b732 into intel-analytics:main Aug 6, 2024
1 check passed

MeouSker77 deleted the support-internvl2-4b branch August 6, 2024 05:36

MeouSker77 mentioned this pull request Aug 9, 2024

Please provide a method to benchmark Multimodal InternVL-4B on MTL‘s iGPU #11750

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add basic support for internvl2-4b #11718

add basic support for internvl2-4b #11718

MeouSker77 commented Aug 6, 2024 •

edited

Loading

MeouSker77 commented Aug 6, 2024

add basic support for internvl2-4b #11718

add basic support for internvl2-4b #11718

Conversation

MeouSker77 commented Aug 6, 2024 • edited Loading

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

MeouSker77 commented Aug 6, 2024

MeouSker77 commented Aug 6, 2024 •

edited

Loading