AssertionError: Not equal to tolerance rtol=0.001, atol=1e-05 #1272

GeneralJing · 2020-12-09T06:50:04Z

Recently I am converting the pytorch segmentation model to onnx model。I can export the onnx model, pass the onnx.checker.check_model() and use the onnxruntime to do inference. But when I use np.testing.assert_allclose(to_numpy(torch_out), ort_outs[0], rtol=1e-03, atol=1e-05) to compare ONNX Runtime and PyTorch results, there is an AssertionError, like follows:

AssertionError:
Not equal to tolerance rtol=0.001, atol=1e-05

Mismatched elements: 20827169 / 20971520 (99.3%)
Max absolute difference: 1.8859415
Max relative difference: 1008390.8
x: array([[[[ 1.165803e+01, 1.163278e+01, 1.160753e+01, ...,
1.179392e+01, 1.176985e+01, 1.174578e+01],
[ 1.167064e+01, 1.164517e+01, 1.161970e+01, ...,...
y: array([[[[11.636896, 11.6166 , 11.596304, ..., 12.943967, 12.909642,
12.875318],
[11.656967, 11.636346, 11.615723, ..., 12.954525, 12.920053,...

The code snippet to export the model is as follows：

model.eval()
batch_size = 1
input_shape = (3, 512, 512)

# x = torch.autograd.Variable(torch.randn(batch_size, *input_shape))

x = torch.rand(batch_size, 3, 512, 512, requires_grad=True)
torch.onnx.export(model, x, model_file_name + '.onnx', export_params=True, opset_version=11, verbose=False)

In this tutorial, https://pytorch.org/tutorials/advanced/super_resolution_with_onnxruntime.html, it said, if the results do not match then there is an issue in the ONNX exporter. But i don't know where is the mistake.

cc @BowenBao @sekyondaMeta @svekars @carljparker @NicolasHug @kit1980 @subramen

Samsonboadi · 2023-06-01T13:37:52Z

/assigntome

bhavyashahh · 2023-06-02T09:16:41Z

/assigntome

bhavyashahh · 2023-06-10T01:09:03Z

Hello, can you please help me with the specific pytorch-segmentation model you are trying to use and the other code settings, so that I can try and reproduce your issue on my end?
Apart from that, I tried a sample snippet for the pretrained fcn_resnet50 model within torchvision(0.15.2+cu118).

import torch
import torchvision
import torch.onnx
import onnx
import onnxruntime
import numpy as np

model = torchvision.models.segmentation.fcn_resnet50(pretrained=True)
model.eval()
batch_size = 1

x = torch.rand(batch_size, 3, 512, 512, requires_grad=True)
torch.onnx.export(model, x, 'exported_model.onnx', export_params=True, opset_version=11, verbose=True)

# PyTorch results
torch_out = model(x)

onnx_model = onnx.load("exported_model.onnx")
onnx.checker.check_model(onnx_model)

ort_session = onnxruntime.InferenceSession("exported_model.onnx")

def to_numpy(tensor):
    return tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy()

# compute ONNX Runtime output prediction
ort_inputs = {ort_session.get_inputs()[0].name: to_numpy(x)}
ort_outs = ort_session.run(None, ort_inputs)

# compare ONNX Runtime and PyTorch results
np.testing.assert_allclose(to_numpy(torch_out['out']), ort_outs[0], rtol=1e-03, atol=1e-05)

There is no assertion error related to tolerance anymore, and the two output results fall under the range.
Version I use that work for me are:

torch: 2.0.1+cu118
onnx: 1.14.0
onnxruntime: 1.15.0

I would suggest you to check your versions and retry first.

svekars · 2023-10-24T18:40:44Z

This issue has been unassigned due to inactivity. If you are still planning to work on this, you can still send a PR referencing this issue.

bjhargrave · 2023-11-01T19:32:43Z

/assigntome

bjhargrave · 2023-11-01T19:35:25Z

I don't think this is an issue with the tutorial. The tutorial works as expected and #1272 (comment) shows that it also works for a segmentation model. Without any additional details from the submitter, we should close this issue.

cc: @svekars

svekars · 2023-11-07T00:44:47Z

Thanks @bjhargrave - will close and grant half credit for investigation.

holly1238 added the onnx Issues relating to ONNX label Jul 27, 2021

svekars added onnx Issues relating to ONNX and removed onnx Issues relating to ONNX labels Mar 10, 2023

svekars added medium docathon-h1-2023 A label for the docathon in H1 2023 labels May 31, 2023

github-actions bot assigned Samsonboadi Jun 1, 2023

Samsonboadi removed their assignment Jun 1, 2023

github-actions bot assigned bhavyashahh Jun 2, 2023

svekars unassigned bhavyashahh Oct 24, 2023

svekars added docathon-h2-2023 and removed docathon-h1-2023 A label for the docathon in H1 2023 labels Oct 30, 2023

github-actions bot assigned bjhargrave Nov 1, 2023

svekars closed this as completed Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AssertionError: Not equal to tolerance rtol=0.001, atol=1e-05 #1272

AssertionError: Not equal to tolerance rtol=0.001, atol=1e-05 #1272

GeneralJing commented Dec 9, 2020 •

edited by pytorch-bot bot

Loading

Samsonboadi commented Jun 1, 2023

bhavyashahh commented Jun 2, 2023

bhavyashahh commented Jun 10, 2023 •

edited

Loading

svekars commented Oct 24, 2023

bjhargrave commented Nov 1, 2023

bjhargrave commented Nov 1, 2023

svekars commented Nov 7, 2023

AssertionError: Not equal to tolerance rtol=0.001, atol=1e-05 #1272

AssertionError: Not equal to tolerance rtol=0.001, atol=1e-05 #1272

Comments

GeneralJing commented Dec 9, 2020 • edited by pytorch-bot bot Loading

# x = torch.autograd.Variable(torch.randn(batch_size, *input_shape))

Samsonboadi commented Jun 1, 2023

bhavyashahh commented Jun 2, 2023

bhavyashahh commented Jun 10, 2023 • edited Loading

svekars commented Oct 24, 2023

bjhargrave commented Nov 1, 2023

bjhargrave commented Nov 1, 2023

svekars commented Nov 7, 2023

GeneralJing commented Dec 9, 2020 •

edited by pytorch-bot bot

Loading

bhavyashahh commented Jun 10, 2023 •

edited

Loading