Detection metrics score doesn't accurately point out quality or privacy. Should the formula be changed? #375

npatki · 2023-06-26T19:24:31Z

Problem Description

The detection metrics for single table data and sequential data both compute the AUC (ROC) and return 1-AUC as the final score. The score is hard to interpret.

An extreme (close to 0 or close to 1) indicates that the synthetic and real data are noticeably different -- enough for a model to tell them apart. This indicates lower quality or alternatively higher privacy.
A middle score (close to 0.5) indicates that the synthetic and real data are similar -- enough to fool the model, as the model is no better than random. This indicates higher quality or alternately lower privacy.

This is an odd way to interpret the score. Usually, we want 1 to represent success and 0 to represent failure.

Proposed Changes

Instead of returning 1-AUC, perhaps there is a different formula we can use such as:

$score = | AUC - 0.5 | \times 2$

This would yield a score that is geared towards privacy:

0 if the AUC score was close to 0.5, which means lower privacy
1 if the AUC score was closer to an extreme (0 or 1), which means higher privacy

The text was updated successfully, but these errors were encountered:

npatki added the question General question about the software label Jun 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detection metrics score doesn't accurately point out quality or privacy. Should the formula be changed? #375

Detection metrics score doesn't accurately point out quality or privacy. Should the formula be changed? #375

npatki commented Jun 26, 2023 •

edited

Loading

Detection metrics score doesn't accurately point out quality or privacy. Should the formula be changed? #375

Detection metrics score doesn't accurately point out quality or privacy. Should the formula be changed? #375

Comments

npatki commented Jun 26, 2023 • edited Loading

Problem Description

Proposed Changes

npatki commented Jun 26, 2023 •

edited

Loading