Overall accuracy is reported as 0.0 while it should be greater than 0 #53485

przemekwitek · 2020-03-12T14:38:40Z

Issue noticed and described by @wwang500:

I have a question about overall_accuracy result. when I ran _eval on car-parts classification/inference result index.

POST _ml/data_frame/_evaluate
{
  "index": "dest_car_parts_70_1583979097545",
  "evaluation": {
      "classification": {
        "actual_field": "ml.inference.predicted_value.keyword",
        "predicted_field": "ml.N_Lunker_prediction",
         "metrics": {
           "accuracy": {}
         }
      }
   }
}

I got this results:

{
  "classification" : {
    "accuracy" : {
      "classes" : [
        {
          "class_name" : "0",
          "accuracy" : 1.0
        },
        {
          "class_name" : "1",
          "accuracy" : 1.0
        }
      ],
      "overall_accuracy" : 0.0
    }
  }
}

shouldn't overall_accuracy be 1.0 too?
it might be caused the field mapping,
"ml.N_Lunker_prediction" : {"type" : "long"},
"ml.inference.predicted_value.keyword"

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-03-12T14:38:42Z

Pinging @elastic/ml-core (:ml)

przemekwitek · 2020-03-12T14:41:06Z

I reproduced the issue and already know why it happens. You were right with mappings mismatch.
Inference does not impose any mappings on the prediction field so it is mapped as text&keyword.
In the case you described the other field is of type long.

I think the sensible approach is to make comparison in evaluation painless script more lenient so that it compares string representations rather than raw values:
String.valueOf(doc[''{0}''].value).equals(String.valueOf(doc[''{1}''].value))
instead of:
doc[''{0}''].value == doc[''{1}''].value

#53458 implements this idea.

przemekwitek · 2020-03-16T14:40:15Z

#53458 and its backport to 7.x are now merged in.

przemekwitek added >bug :ml Machine learning labels Mar 12, 2020

przemekwitek self-assigned this Mar 12, 2020

przemekwitek mentioned this issue Mar 12, 2020

Make classification evaluation metrics work when there is field mapping type mismatch #53458

Merged

przemekwitek closed this as completed Mar 16, 2020

codebrain mentioned this issue Apr 1, 2020

7.7.0 meta ticket (Part 3) elastic/elasticsearch-net#4534

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overall accuracy is reported as 0.0 while it should be greater than 0 #53485

Overall accuracy is reported as 0.0 while it should be greater than 0 #53485

przemekwitek commented Mar 12, 2020

elasticmachine commented Mar 12, 2020

przemekwitek commented Mar 12, 2020 •

edited

Loading

przemekwitek commented Mar 16, 2020

Overall accuracy is reported as 0.0 while it should be greater than 0 #53485

Overall accuracy is reported as 0.0 while it should be greater than 0 #53485

Comments

przemekwitek commented Mar 12, 2020

elasticmachine commented Mar 12, 2020

przemekwitek commented Mar 12, 2020 • edited Loading

przemekwitek commented Mar 16, 2020

przemekwitek commented Mar 12, 2020 •

edited

Loading