Metrics error due to inplace operation, "computation has been modified by an inplace operation". #2862

sykrn · 2020-08-07T12:37:41Z

Hey, @williamFalcon, I got a new error since I upgraded the library today.
I used the accuracy metric, but got an error.

Code sample:

# in lightning module
def training_step(self, batch, batch_idx):
        x, y = batch
        y_hat = self(x)
        loss = F.cross_entropy(y_hat, y)
        acc = accuracy(y_hat, y)      # from the functional metric classification
        tensorboard_logs = {'train_loss': loss}
        return {'loss': loss, 'log': tensorboard_logs}

Error msg:

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.LongTensor [32]] is at version 2; expected version 1 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

Can be solved using `.clone()` method.

However, when I clone the y before feeding to the accuracy function, no error was shown.

acc = accuracy(y_hat, y.clone())

But, it's inconvenience if user has to do it manually, isn't it?
Actually, I can use the code above without clone before I upgrade to the latest. So, it might due to the latest update/rebase causing this error.

The same error shown for f1_score metric.

The text was updated successfully, but these errors were encountered:

github-actions · 2020-08-07T12:38:21Z

Hi! thanks for your contribution!, great first issue!

williamFalcon · 2020-08-07T13:36:30Z

cc @justusschock

justusschock · 2020-08-07T14:39:27Z

cc @Diuven , I think you introduced in-place ops for speed-up, right?
Does the speedup only come from inplace methods or can we simply replace them by out-of-place methods ?

Diuven · 2020-08-08T03:23:02Z

cc @Diuven , I think you introduced in-place ops for speed-up, right?
Does the speedup only come from inplace methods or can we simply replace them by out-of-place methods ?

Yeah, you're right. I think this is because of the clamp_max_ I used in stat_scores_mulitple_classes. This is the part.

This is a quick PR fixing this issue. If you may, please check the code still gives the issue.

Sorry for the inconvenience!

* Faster classfication stats * Faster accuracy metric * minor change on cls metric * Add out-of-bound class clamping * Add more tests and minor fixes * Resolve code style warning * Update for #2781 * hotfix * Update pytorch_lightning/metrics/functional/classification.py Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> * Update about conversation * Add docstring on stat_scores_multiple_classes * Fixing #2862 Co-authored-by: Younghun Roh <yhunroh@mindslab.ai> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

sykrn · 2020-08-08T10:42:52Z

Great, It runs without any error, now. Thanks for the hotfix. 👍

sykrn added bug Something isn't working help wanted Open to be worked on labels Aug 7, 2020

williamFalcon assigned justusschock Aug 7, 2020

Diuven added a commit to Diuven/pytorch-lightning that referenced this issue Aug 8, 2020

Fixing Lightning-AI#2862

84ffdd3

Diuven mentioned this issue Aug 8, 2020

hotfix on classification metrics #2878

Merged

7 tasks

williamFalcon closed this as completed in #2878 Aug 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics error due to inplace operation, "computation has been modified by an inplace operation". #2862

Metrics error due to inplace operation, "computation has been modified by an inplace operation". #2862

sykrn commented Aug 7, 2020

github-actions bot commented Aug 7, 2020

williamFalcon commented Aug 7, 2020

justusschock commented Aug 7, 2020 •

edited

Loading

Diuven commented Aug 8, 2020

sykrn commented Aug 8, 2020

Metrics error due to inplace operation, "computation has been modified by an inplace operation". #2862

Metrics error due to inplace operation, "computation has been modified by an inplace operation". #2862

Comments

sykrn commented Aug 7, 2020

Code sample:

Error msg:

Can be solved using .clone() method.

github-actions bot commented Aug 7, 2020

williamFalcon commented Aug 7, 2020

justusschock commented Aug 7, 2020 • edited Loading

Diuven commented Aug 8, 2020

sykrn commented Aug 8, 2020

Can be solved using `.clone()` method.

justusschock commented Aug 7, 2020 •

edited

Loading