Add MPT onnx and ORT support #1161

jiqing-feng · 2023-07-05T14:03:39Z

Relate to 1101. This PR enables MPT models onnx config to support generating dummy inputs.

The shape arrangement of past_key_values may be different across models. We can use the member variable sequence_length of DummyPastKeyValuesGenerator to identify the sequence length of past_key_values.

Would you please help me review it? Thanks

cc @changwangss

fxmarty · 2023-07-06T10:01:09Z

Hi @jiqing-feng , thank you for the PR! I'm personally not in favor of supporting custom models in Optimum that otherwise use trust_remote_code=True. I think it would make more sense to host custom ONNX configs in the model repos directly, and allow the main_export function to use a custom ONNX config.

What is your opinion @echarlaix @JingyaHuang @michaelbenayoun ?

I propose #1166 #1143 in this regard, see the API here: https://moon-ci-docs.huggingface.co/docs/optimum/pr_1166/en/exporters/onnx/usage_guides/export_a_model#custom-export-of-transformers-models

jiqing-feng · 2023-07-06T10:13:06Z

Hi @fxmarty . Thanks for your comment.

The MPT model is going to upstream on transformers, see 24629. We do not need trust_remote_code=True after the PR is merged.

We can wait for the 24629 merge and then we can discuss if we can move forward with my PR.

HuggingFaceDocBuilderDev · 2023-07-06T10:34:52Z

The documentation is not available anymore as the PR was closed or merged.

JingyaHuang · 2023-07-06T10:44:01Z

@fxmarty @jiqing-feng For the ease of maintenance, Optimum shall only support configs of transformers native models. For those withtrust_remote_code(chatglm and mpt for now), hosting directly on the hub sounds very nice to me.

btw, mpt sounds like a good example to test a bit with what you suggest @fxmarty

fxmarty · 2023-07-06T14:01:19Z

@jiqing-feng Yes, once mpt is merged & released in transformers for sure we can merge this PR in Optimum! My suggestion is more for models with custom modeling code (as mpt is currently).

jiqing-feng · 2023-07-28T08:00:45Z

Hi @fxmarty . Since MPT has been merged to HF models, see 24629.

Could we merge this PR for supporting MPT onnx config?

cc @JingyaHuang

fxmarty · 2023-08-01T09:53:13Z

Hi @jiqing-feng, for sure we can merge once there is a transformers release! Is the KV cache layout fine?

jiqing-feng · 2023-08-02T01:18:37Z

Hi @jiqing-feng, for sure we can merge once there is a transformers release! Is the KV cache layout fine?

Yes! The KV cache layout is the same as most CLM models like llama. We can wait for the release.

jiqing-feng · 2023-08-29T01:48:23Z

Hi @fxmarty @JingyaHuang . I think we can move forward on this PR since the recently released version of transformers contained MPT model. I see that some checks have failed. Would you please tell me where and what kind of tests should I add? Thx!

echarlaix · 2023-08-29T14:34:25Z

Would you please tell me where and what kind of tests should I add? Thx!

Could you add ONNX Runtime tests as well by adding the model name here and here using a tiny random model like this one?

jiqing-feng · 2023-08-29T15:05:25Z

Would you please tell me where and what kind of tests should I add? Thx!

Could you add ONNX Runtime tests as well by adding the model name here and here using a tiny random model like this one?

Done. Thx!

jiqing-feng · 2023-08-31T01:32:23Z

Hi @echarlaix @fxmarty @JingyaHuang

Could we merge this PR? I see that the failed checks are not related to my changes and the mpt model has been released in the latest released version of transformers. Thx!

fxmarty

@jiqing-feng Thank you for the addition and apologize for the delay!

support MPT onnx generate dummy inputs

52efe82

jiqing-feng force-pushed the mpt_task branch from 600279e to e47d95f Compare July 28, 2023 07:57

onnx config for MPT

e47d95f

add mpt test

7f2cc8b

jiqing-feng added 2 commits August 28, 2023 19:03

merge origin

e0b034a

fix conflict

1acbc31

add mpt test

9a55e13

fxmarty changed the title ~~support MPT onnx generate dummy inputs~~ Add MPT onnx and ORT support Aug 31, 2023

fxmarty approved these changes Aug 31, 2023

View reviewed changes

fxmarty merged commit 7450ca3 into huggingface:main Aug 31, 2023
63 of 68 checks passed

fxmarty mentioned this pull request Sep 4, 2023

Andreyan/exporters model configs #1159

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MPT onnx and ORT support #1161

Add MPT onnx and ORT support #1161

jiqing-feng commented Jul 5, 2023

fxmarty commented Jul 6, 2023 •

edited

Loading

jiqing-feng commented Jul 6, 2023

HuggingFaceDocBuilderDev commented Jul 6, 2023 •

edited

Loading

JingyaHuang commented Jul 6, 2023

fxmarty commented Jul 6, 2023

jiqing-feng commented Jul 28, 2023

fxmarty commented Aug 1, 2023

jiqing-feng commented Aug 2, 2023

jiqing-feng commented Aug 29, 2023 •

edited

Loading

echarlaix commented Aug 29, 2023 •

edited

Loading

jiqing-feng commented Aug 29, 2023

jiqing-feng commented Aug 31, 2023

fxmarty left a comment

Add MPT onnx and ORT support #1161

Add MPT onnx and ORT support #1161

Conversation

jiqing-feng commented Jul 5, 2023

fxmarty commented Jul 6, 2023 • edited Loading

jiqing-feng commented Jul 6, 2023

HuggingFaceDocBuilderDev commented Jul 6, 2023 • edited Loading

JingyaHuang commented Jul 6, 2023

fxmarty commented Jul 6, 2023

jiqing-feng commented Jul 28, 2023

fxmarty commented Aug 1, 2023

jiqing-feng commented Aug 2, 2023

jiqing-feng commented Aug 29, 2023 • edited Loading

echarlaix commented Aug 29, 2023 • edited Loading

jiqing-feng commented Aug 29, 2023

jiqing-feng commented Aug 31, 2023

fxmarty left a comment

Choose a reason for hiding this comment

fxmarty commented Jul 6, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 6, 2023 •

edited

Loading

jiqing-feng commented Aug 29, 2023 •

edited

Loading

echarlaix commented Aug 29, 2023 •

edited

Loading