AutoModel abstraction fails for pre-training initialization #11953

g-karthik · 2021-05-31T00:20:15Z

Environment info

transformers version: 4.5.1
Python version: 3.6
PyTorch version: 1.4+
Using GPU in script?: No
Using distributed or parallel set-up in script?: No

Who can help

@patrickvonplaten, @LysandreJik

Information

Model I am using: GPT-2

The problem arises when using:

[Y] my own modified scripts: (give details below)

To reproduce

Steps to reproduce the behavior:

from transformers import AutoConfig, AutoModelForCausalLM, GPT2LMHeadModel
config = AutoConfig.from_pretrained("gpt2", return_dict=True, gradient_checkpointing=False)

model_class = GPT2LMHeadModel
model = model_class(config)  # WORKS FINE

model_class = AutoModelForCausalLM
model = model_class(config)  # FAILS, stack trace below

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: __init__() takes 1 positional argument but 2 were given

Expected behavior

Both cases should work fine. The latter case should pull the former class internally.

The text was updated successfully, but these errors were encountered:

LysandreJik · 2021-05-31T07:32:41Z

Hello! We recommend you read the docs regarding the AutoModel. I have linked you the from_config method which should be used in this use case.

LysandreJik · 2021-05-31T07:41:59Z

However, it is indeed unexpected for you to receive this error message. The message should be more explicit, investigating now.

LysandreJik · 2021-05-31T07:50:00Z

Opened #11956 for a more explicit error, and opening your use case for discussion.

LysandreJik mentioned this issue May 31, 2021

Authorize args when instantiating an AutoModel #11956

Merged

g-karthik closed this as completed Jun 3, 2021

yongzhuo mentioned this issue Mar 6, 2024

求回复：用bert-tiny做多标签分类，训练没有问题，保存了tc.config和tc.model，推理的时候在加载模型的地方报错 yongzhuo/Pytorch-NLU#12

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoModel abstraction fails for pre-training initialization #11953

AutoModel abstraction fails for pre-training initialization #11953

g-karthik commented May 31, 2021

LysandreJik commented May 31, 2021

LysandreJik commented May 31, 2021

LysandreJik commented May 31, 2021

AutoModel abstraction fails for pre-training initialization #11953

AutoModel abstraction fails for pre-training initialization #11953

Comments

g-karthik commented May 31, 2021

Environment info

Who can help

Information

To reproduce

Expected behavior

LysandreJik commented May 31, 2021

LysandreJik commented May 31, 2021

LysandreJik commented May 31, 2021