The sdr metric in TM sometime gives NaN for some input #895

quancs · 2022-03-19T16:27:30Z

🐛 Bug

This issue is related with fast-bss-eval's torch version, see fakufaku/fast_bss_eval#5

To Reproduce

import numpy as np
import torch

x = np.load('debug.npz')
preds = torch.tensor(x['preds'])
target = torch.tensor(x['target'])
print(preds.shape, target.shape)

from torchmetrics.functional.audio import signal_distortion_ratio
sdr = signal_distortion_ratio(preds, target)
print(sdr)

from mir_eval.separation import bss_eval_sources
sdr, _, _, _ = bss_eval_sources(target.numpy(), preds.numpy(), False)
print(sdr)

outputs:

torch.Size([2, 64000]) torch.Size([2, 64000])
tensor([-2.6815,     nan])
[-2.68156071 44.58523729]

unzip data.zip to get the debug.npz

Code sample

Expected behavior

the results given by signal_distortion_ratio is close to the one given by mir_eval

Environment

OS (e.g., Linux):
Python & PyTorch Version (e.g., 1.0):
How you installed PyTorch (conda, pip, build command if you used source):
Any other relevant information:

Additional context

The text was updated successfully, but these errors were encountered:

SkafteNicki · 2022-03-21T13:29:49Z

Hi @quancs,
From the issue that you linked it seems that the solution from the author is basically to do the evaluation in double instead of float. I can confirm that doing this fixes the example you send. Do you think we should cast the users input here:
https://github.com/PyTorchLightning/metrics/blob/865a08fcf102c2eb1b776b13643bc87aadf7f4f7/torchmetrics/functional/audio/sdr.py#L140-L141
to double instead of float. Alternatively, we can insert note in docstring that in some cases it is better to evaluate using double precision.

quancs · 2022-03-21T14:12:00Z

Hi @quancs, From the issue that you linked it seems that the solution from the author is basically to do the evaluation in double instead of float. I can confirm that doing this fixes the example you send. Do you think we should cast the users input here:

https://github.com/PyTorchLightning/metrics/blob/865a08fcf102c2eb1b776b13643bc87aadf7f4f7/torchmetrics/functional/audio/sdr.py#L140-L141

to double instead of float. Alternatively, we can insert note in docstring that in some cases it is better to evaluate using double precision.

This issue happens on the torch version on cpu. On GPU it's OK.
And I don't see any vialation in my past experiment results tested on GPU.
I have some ideas to fix this:

convert to double anyway, no matter on CPU or GPU, but it may make the metric slow
convert to double on CPU, but it may make the metric slow, and we don't know whether on GPU is really OK
convert to double when we detect the result is not a valid number (NaN or Inf), and run again.

I prefer 3). what do you think? or do you have other ideas?

SkafteNicki · 2022-03-22T08:33:55Z

is fine by me :]

quancs added bug / fix Something isn't working help wanted Extra attention is needed labels Mar 19, 2022

quancs mentioned this issue Mar 22, 2022

Recompute SDR in float64 type if float32 gives NaN or Inf #899

Merged

4 tasks

Borda added this to the v0.7 milestone Mar 22, 2022

Borda closed this as completed in #899 Mar 23, 2022

Borda added the topic: Audio label Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The sdr metric in TM sometime gives NaN for some input #895

The sdr metric in TM sometime gives NaN for some input #895

quancs commented Mar 19, 2022

SkafteNicki commented Mar 21, 2022

quancs commented Mar 21, 2022

SkafteNicki commented Mar 22, 2022

The sdr metric in TM sometime gives NaN for some input #895

The sdr metric in TM sometime gives NaN for some input #895

Comments

quancs commented Mar 19, 2022

🐛 Bug

To Reproduce

Code sample

Expected behavior

Environment

Additional context

SkafteNicki commented Mar 21, 2022

quancs commented Mar 21, 2022

SkafteNicki commented Mar 22, 2022