Evaluation on coco dataset #33

omkaar718 · 2024-07-02T15:18:37Z

The results of using this implementation on coco val dataset seem to be quite lower than those reported in the paper.

Model: ViT-B
YOLOv8 detector model: yolov8x
YOLOv8 threshold: 0.35
YOLOv8 image size: 640
mAP@0.5:0.95 obtained on coco val data: 0.446, mAP@0.5: 0.589.

JunkyByte · 2024-07-02T15:41:20Z

Hello! Thank you for your test, this started as a fork of https://github.com/jaehyunnn/ViTPose_pytorch just to improve the inference pipeline, can you try checking with that implementation if you obtain similar results?

Also if you don't mind to share the code you use for eval, I won't have the time in the next couple weeks but I could do some tests.

Also can you check the map you get with the detector or try to run with groundtruth bbox? They report "Using detection results from a detector that obtains 56 mAP on person"

Thanks

JunkyByte · 2024-07-02T19:15:01Z

Hi I did some checks but I cannot give you an answer. I found that yolov8 had problems on MPS, if by any chance you are running on mac the evaluation. Updating the Ultralytics package solves the problem (I updated the requirements)

omkaar718 · 2024-07-03T00:53:53Z

@JunkyByte Thank you for your response!
I have opened a PR (#34) for COCO evaluation code. Readme has been updated with instructions to use the evaluation code.

omkaar718 · 2024-07-03T20:40:44Z

@JunkyByte
I found person detection results here provided in the official implementation: https://github.com/ViTAE-Transformer/ViTPose/blob/main/docs/en/tasks/2d_body_keypoint.md#:~:text=Please%20download%20from%20OneDrive%20or%20GoogleDrive from the official implementation's readme.
Not sure if these were the exact ones used by them, but the results have drastically improved and are close to those obtained using the official implementation.

mAP@0.5:0.95, detector threshold = 0.5 to filter out low confidence detection bboxes:

This imiplementation (easy_ViTPose): 0.693
Official implementation: 0.726

Therefore, the bbox detections resulting from yolov8 could be the main reason behind low scores in this pipeline.

JunkyByte · 2024-07-03T23:02:30Z

@omkaar718 thank you very much for inspecting this. I'm busy these days but I checked your PR and I will eventually merge it in the next few days, so thanks again.

Applying the models to videos I see qualitatively good results, it might be that indeed yolo does not work well for the coco val images.

I will get back to you :) have a nice day!

JunkyByte closed this as completed Jul 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation on coco dataset #33

Evaluation on coco dataset #33

omkaar718 commented Jul 2, 2024 •

edited

Loading

JunkyByte commented Jul 2, 2024 •

edited

Loading

JunkyByte commented Jul 2, 2024

omkaar718 commented Jul 3, 2024

omkaar718 commented Jul 3, 2024 •

edited

Loading

JunkyByte commented Jul 3, 2024

Evaluation on coco dataset #33

Evaluation on coco dataset #33

Comments

omkaar718 commented Jul 2, 2024 • edited Loading

JunkyByte commented Jul 2, 2024 • edited Loading

JunkyByte commented Jul 2, 2024

omkaar718 commented Jul 3, 2024

omkaar718 commented Jul 3, 2024 • edited Loading

JunkyByte commented Jul 3, 2024

omkaar718 commented Jul 2, 2024 •

edited

Loading

JunkyByte commented Jul 2, 2024 •

edited

Loading

omkaar718 commented Jul 3, 2024 •

edited

Loading