Add a condition for nested_detach #31852

haikuoxin · 2024-07-09T09:08:54Z

System Info

When use Trainer to train a VLM like 'internlm/internlm-xcomposer2-vl-7b' with evaluation_strategy, an AttributeError error will be triggered like below:

trainer.train()
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer.py", line 1553, in train
    return inner_training_loop(
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer.py", line 1927, in _inner_training_loop
    self._maybe_log_save_evaluate(tr_loss, model, trial, epoch, ignore_keys_for_eval)
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer.py", line 2254, in _maybe_log_save_evaluate
    metrics = self.evaluate(ignore_keys=ignore_keys_for_eval)
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer.py", line 2968, in evaluate
    output = eval_loop(
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer.py", line 3157, in evaluation_loop
    loss, logits, labels = self.prediction_step(model, inputs, prediction_loss_only, ignore_keys=ignore_keys)
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer.py", line 3347, in prediction_step
    labels = nested_detach(tuple(inputs.get(name) for name in self.label_names))
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer_pt_utils.py", line 166, in nested_detach
    return type(tensors)(nested_detach(t) for t in tensors)
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer_pt_utils.py", line 166, in <genexpr>
    return type(tensors)(nested_detach(t) for t in tensors)
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer_pt_utils.py", line 168, in nested_detach
    return type(tensors)({k: nested_detach(t) for k, t in tensors.items()})
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer_pt_utils.py", line 168, in <dictcomp>
    return type(tensors)({k: nested_detach(t) for k, t in tensors.items()})
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer_pt_utils.py", line 166, in nested_detach
    return type(tensors)(nested_detach(t) for t in tensors)
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer_pt_utils.py", line 166, in <genexpr>
    return type(tensors)(nested_detach(t) for t in tensors)
  File "/root/anaconda3/envs/sharegpt4v/lib/python3.10/site-packages/transformers/trainer_pt_utils.py", line 169, in nested_detach
    return tensors.detach()
AttributeError: 'str' object has no attribute 'detach'

Who can help?

@muellerzr

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

load the model and tokenizer with from_pretrained method
set training_args
build train_dataset and eval_dataset, the dataset return Dict[str, str]
create a trainer with Trainer API then trainer.train()

Expected behavior

Model should be train normally

The text was updated successfully, but these errors were encountered:

fix bug: huggingface#31852

fix bug: #31852

fix bug: huggingface#31852

haikuoxin added a commit to haikuoxin/transformers that referenced this issue Jul 9, 2024

Add a condition for nested_detach

2b66034

fix bug: huggingface#31852

haikuoxin mentioned this issue Jul 9, 2024

Add a condition for nested_detach #31855

Merged

5 tasks

amyeroberts pushed a commit that referenced this issue Jul 10, 2024

Add a condition for nested_detach (#31855)

c54af4c

fix bug: #31852

amyeroberts closed this as completed in #31855 Jul 10, 2024

amyeroberts pushed a commit to amyeroberts/transformers that referenced this issue Jul 19, 2024

Add a condition for nested_detach (huggingface#31855)

479ccf3

fix bug: huggingface#31852

MHRDYN7 pushed a commit to MHRDYN7/transformers that referenced this issue Jul 23, 2024

Add a condition for nested_detach (huggingface#31855)

ab0bb7c

fix bug: huggingface#31852

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this issue Jul 24, 2024

Add a condition for nested_detach (huggingface#31855)

4743417

fix bug: huggingface#31852

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a condition for nested_detach #31852

Add a condition for nested_detach #31852

haikuoxin commented Jul 9, 2024

Add a condition for nested_detach #31852

Add a condition for nested_detach #31852

Comments

haikuoxin commented Jul 9, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior