Request for ONNX Conversion Script #6

godiclee · 2024-06-25T07:14:10Z

Thank you for your amazing work on the project. Could you kindly provide a script to convert the SigClip-large model (specifically the image encoder) to ONNX format? I would greatly appreciate your assistance with this.

rhysdg · 2024-06-25T20:16:35Z

Hey there @godiclee! I'm glad it's all proving of use to you. Forsure, you just need to leverage Huggingface's AutoProcessor , and use opset_version=13 likes so:

import onnx
from PIL import Image
from transformers import AutoProcessor, AutoModel

variant = "google/siglip-large-patch16-384"
model = AutoModel.from_pretrained(variant).eval()
processor = AutoProcessor.from_pretrained(variant)

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)
texts = ["a photo of 2 cats", "a photo of 2 dogs"]
inputs = processor(text=texts, images=image, padding="max_length", return_tensors="pt")


inputs_list = ['input_ids', 'pixel_values']
outputs = ['logits_per_image', 
                  'logits_per_text',
                 'image_embeds',
                 'text_embeds',
                 'text_model_hidden',
                 'text_model_pooler',
                 'vision_model_hidden',
                 'vision_model_pooler'
                 ]

dynamic_axes = {'input_ids': {0: 'text_batch_size', 1: 'sequence_length'},
 'pixel_values': {0: 'image_batch_size', 1: 'num_channels', 2: 'height', 3: 'width' },
 'logits_per_image': {0: 'image_batch_size', 1: 'text_batch_size'},
 'logits_per_text': {0: 'text_batch_size', 1: 'image_batch_size'},
 'image_embeds': {0: 'image_batch_size'},
 'text_embeds': {0: 'text_batch_size'},
}


# export
torch.onnx.export(
    model, 
    (inputs['input_ids'],inputs['pixel_values']),
    "siglip_large/siglip-large.onnx",  
    export_params=True,
    input_names=inputs_list, 
    output_names=outputs,
    dynamic_axes=dynamic_axes,
    do_constant_folding=True, 
    opset_version=13, 
)

I'm yet to formalise it in here as I'm trying to strip back the back need for the transformers library for lightweight deployment on Jetson boards, Raspbery Pi etc but I'll like have an export repo up and running soon, or it looks like full support is on it's way at optimum soon too

rhysdg · 2024-06-25T20:18:30Z

^ Notice that I'm assigning the weights to a separate folder too as they have to stay external with a model that's 2gb+, otherwise you'll run into a protobuf error

I've verified with Netron and the following snippet and all is well ;)

import onnxruntime

session =  onnxruntime.InferenceSession(
                    'siglip_large/siglip-large.onnx', providers=onnxruntime.get_available_providers())

inputs = processor(text=texts, images=image, padding="max_length", return_tensors="np")

res = session.run(None, {'input_ids': inputs['input_ids'],
                                        'pixel_values': inputs['pixel_values']})[0]

res = scipy.special.expit(res)

godiclee · 2024-06-26T07:41:07Z

Thanks a lot! It works well.

rhysdg · 2024-07-03T13:07:05Z

Thanks a lot! It works well.

Glad to hear it!

rhysdg self-assigned this Jun 25, 2024

rhysdg added the enhancement New feature or request label Jun 25, 2024

rhysdg pinned this issue Jun 25, 2024

godiclee closed this as completed Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for ONNX Conversion Script #6

Request for ONNX Conversion Script #6

godiclee commented Jun 25, 2024

rhysdg commented Jun 25, 2024 •

edited

Loading

rhysdg commented Jun 25, 2024 •

edited

Loading

godiclee commented Jun 26, 2024

rhysdg commented Jul 3, 2024

Request for ONNX Conversion Script #6

Request for ONNX Conversion Script #6

Comments

godiclee commented Jun 25, 2024

rhysdg commented Jun 25, 2024 • edited Loading

rhysdg commented Jun 25, 2024 • edited Loading

godiclee commented Jun 26, 2024

rhysdg commented Jul 3, 2024

rhysdg commented Jun 25, 2024 •

edited

Loading

rhysdg commented Jun 25, 2024 •

edited

Loading