how to convert the outputs of the yolov5.onnx to boxes ,labels and scores . #708

JiaoPaner · 2020-08-11T08:42:57Z

❔Question

Hi buddy ,can you help me to explain the outputs of the onnx model ? I don't know how to convert the outputs to boxes ,labels and scores .
I use netron to display this onnx model .
outputs:
name: classes
type: float32[1,3,80,80,85]

why the type are five dimensions? how to convert them to detection task result ?
thanks.

NosremeC · 2020-08-12T00:47:41Z

I think you should look for the output from the non_max_suppression, which is called 'pred' in detect.py. It has the form of (x1, y1, x2, y2, conf, cls). You can arrange the elements in it in whatever way you like and write it into txt or json.

JiaoPaner · 2020-08-12T00:59:41Z

Thanks.I found the method too, maybe I'll recode it in c++ because of using onnx runtime c++ version.

yongjingli · 2020-08-12T07:47:40Z

hello, @JiaoPaner , @NosremeC I also want to do the same work to use onnnx runtime c++ version, but I met some problmes with Detect in yolo.py. After I set self.training==False, I don't know why I still get a output of x but not (torch.cat(z, 1), x).

this is some code in yolo.py:

         if not self.training:  # inference
            if self.grid[i].shape[2:4] != x[i].shape[2:4]:
                self.grid[i] = self._make_grid(nx, ny).to(x[i].device)

            y = x[i].sigmoid()
            y[..., 0:2] = (y[..., 0:2] * 2. - 0.5 + self.grid[i].to(x[i].device)) * self.stride[i]  # xy
            y[..., 2:4] = (y[..., 2:4] * 2) ** 2 * self.anchor_grid[i]  # wh
            z.append(y.view(bs, -1, self.no))

    return x if self.training else (torch.cat(z, 1), x)

JiaoPaner · 2020-08-15T16:50:19Z

@yongjingli you can go to see #343, this issue solved my problem.I recoded the non_max_suppression in yolov5/utils/general.py into c++ version with yolov5s.onnx (in export.py ,I set model.model[-1].export = False). The main output analysis code as follows:

    float* output = output_tensor[0].GetTensorMutableData<float>(); // output of onnx runtime ->>> 1,25200,85
    size_t size = output_tensor[0].GetTensorTypeAndShapeInfo().GetElementCount(); // 1x25200x85=2142000
    int dimensions = 85; // 0,1,2,3 ->box,4->confidence，5-85 -> coco classes confidence 
    int rows = size / dimensions; //25200
    int confidenceIndex = 4;
    int labelStartIndex = 5;
    float modelWidth = 640.0;
    float modelHeight = 640.0;
    float xGain = modelWidth / image.width;
    float yGain = modelHeight / image.height;
    
    std::vector<cv::Vec4f> locations;
    std::vector<int> labels;
    std::vector<float> confidences;

    std::vector<cv::Rect> src_rects;
    std::vector<cv::Rect> res_rects;
    std::vector<int> res_indexs;

    cv::Rect rect;
    cv::Vec4f location;
    for (int i = 0; i < rows; ++i) {
        int index = i * dimensions;
        if(output[index+confidenceIndex] <= 0.4f) continue;

        for (int j = labelStartIndex; j < dimensions; ++j) {
            output[index+j] = output[index+j] * output[index+confidenceIndex];
        }

        for (int k = labelStartIndex; k < dimensions; ++k) {
            if(output[index+k] <= 0.5f) continue;

            location[0] = (output[index] - output[index+2] / 2) / xGain;//top left x
            location[1] = (output[index + 1] - output[index+3] / 2) / yGain;//top left y
            location[2] = (output[index] + output[index+2] / 2) / xGain;//bottom right x
            location[3] = (output[index + 1] + output[index+3] / 2) / yGain;//bottom right y

            locations.emplace_back(location);

            rect = cv::Rect(location[0], location[1],
                            location[2] - location[0], location[3] - location[1]);
            src_rects.push_back(rect);
            labels.emplace_back(k-labelStartIndex);


            confidences.emplace_back(output[index+k]);
        }

    }
    utils::nms(src_rects,res_rects,res_indexs);

    cJSON  *result = cJSON_CreateObject(), *items = cJSON_CreateArray();
    for (int i = 0; i < res_indexs.size(); ++i) {
        cJSON  *item = cJSON_CreateObject();
        int index = res_indexs[i];
        cJSON_AddStringToObject(item, "label", classes[labels[index]].c_str());
        cJSON_AddNumberToObject(item,"score",confidences[index]);
        cJSON  *location = cJSON_CreateObject();
        cJSON_AddNumberToObject(location,"x",locations[index][0]);
        cJSON_AddNumberToObject(location,"y",locations[index][1]);
        cJSON_AddNumberToObject(location,"width",locations[index][2] - locations[index][0]);
        cJSON_AddNumberToObject(location,"height",locations[index][3] - locations[index][1]);
        cJSON_AddItemToObject(item,"location",location);
        cJSON_AddItemToArray(items,item);
    }
    cJSON_AddNumberToObject(result, "code", 0);
    cJSON_AddStringToObject(result, "msg", "success");
    cJSON_AddItemToObject(result, "data", items);
    char *resultJson = cJSON_PrintUnformatted(result);
    return resultJson;

void utils::nms(const std::vector<cv::Rect> &srcRects, std::vector<cv::Rect> &resRects, std::vector<int> &resIndexs,float thresh) {
    resRects.clear();
    const size_t size = srcRects.size();
    if (!size) return;
    // Sort the bounding boxes by the bottom - right y - coordinate of the bounding box
    std::multimap<int, size_t> idxs;
    for (size_t i = 0; i < size; ++i){
        idxs.insert(std::pair<int, size_t>(srcRects[i].br().y, i));
    }
    // keep looping while some indexes still remain in the indexes list
    while (idxs.size() > 0){
        // grab the last rectangle
        auto lastElem = --std::end(idxs);
        const cv::Rect& last = srcRects[lastElem->second];
        resIndexs.push_back(lastElem->second);
        resRects.push_back(last);
        idxs.erase(lastElem);
        for (auto pos = std::begin(idxs); pos != std::end(idxs); ){
            // grab the current rectangle
            const cv::Rect& current = srcRects[pos->second];
            float intArea = (last & current).area();
            float unionArea = last.area() + current.area() - intArea;
            float overlap = intArea / unionArea;
            // if there is sufficient overlap, suppress the current bounding box
            if (overlap > thresh) pos = idxs.erase(pos);
            else ++pos;
        }
    }
}

ricklina90 · 2020-08-19T09:45:59Z

@JiaoPaner If I need C# version, Is there a C# version available? Thanks a lot.

JiaoPaner · 2020-08-19T10:04:20Z

@ricklina90 you just recode above c++ code into c# code.

ricklina90 · 2020-08-25T05:50:43Z

@JiaoPaner After I recode c++ code into c# code, It works fine. Thanks you.

spicybeef003 · 2020-09-04T04:48:15Z

is there a way to reshape this to [255, 20, 20],etc?

JonathanLehner · 2020-09-04T13:41:49Z

My onnx session outputs (1, 25200, 11) but non_max_suppression outputs
torch.Size([300, 6]) (6 classes). Why does it have shape 300? how to convert this to the x, y coordinates?

Edwardmark · 2020-09-23T08:13:24Z

@JonathanLehner 300 means the number of boxes detected, 6 is the center_x, center_y, w, h, score, cls_id

kafan1986 · 2020-10-05T09:10:50Z

@JiaoPaner I have couple of queries regarding your c++ implementation.

a) You are considering only 1 output layer when there are 3 in total. Does considering the bounding boxes from output layer having smallest stride is sufficient?

b) In your box calculation you haven't used any sigmoid function, anchor or stride length. How are you getting the box dimension correctly?

JiaoPaner · 2020-10-15T09:09:01Z

@kafan1986
In export.py ,I set model.model[-1].export = False.
Using netron to display this onnx model :

here are 4 outputs,but we need only first output which name is "output". you needn't use any sigmoid function anymore.

Jiang15 · 2020-10-16T16:35:01Z

❔Question

Hi buddy ,can you help me to explain the outputs of the onnx model ? I don't know how to convert the outputs to boxes ,labels and scores .
I use netron to display this onnx model .
outputs:
name: classes
type: float32[1,3,80,80,85]

name: boxes
type: float32[1,3,40,40,85]

name: 444
type: float32[1,3,20,20,85]

why the type are five dimensions? how to convert them to detection task result ?
thanks.

Hi, have you managed to export onnx model? I tried to do "torch.onnx.export(model,img,"yolos.onnx")" but I got error "Exporting the operator hardswish to ONNX opset version 9 is not supported. Please open a bug to request ONNX export support for the missing operator." I am stuck with this problem for a while.

JiaoPaner · 2020-10-17T17:01:16Z

@Jiang15 set opset_version=12 .
torch.onnx.export(model, img, img, verbose=False, opset_version=12, input_names=['image'],output_names= ['output'])

pangchao-git · 2021-01-27T10:38:11Z

@JiaoPaner Hello, in your way, I modified the export.py of the yolov5 project and modified model.model[-1].export = False. Using the above C++ code, I found that the confidence of the detected results is very low, the classic dog The confidence of the results detected by .jpg are all between 0.4 and 0.8. What is this situation?

JiaoPaner · 2021-01-28T02:07:18Z

@pangchao-git You can see my repo https://github.com/JiaoPaner/detector-onnx-linux.git (based on yolov5 3.0), it work. BTW,before you convert pt model trained to onnx, you must modify the two files in yolov5:

export.py: model.model[-1].export = False

yolo.py

class Detect(nn.Module):
    stride = None  # strides computed during build
    export = True  # onnx export
    def forward(self, x):
            # x = x.copy()  # for profiling
            z = []  # inference output
            #self.training |= self.export
            if (self.training is True) & (self.export is True):
                self.training = False
...

pangchao-git · 2021-02-03T09:41:11Z

@JiaoPaner ,

I re-transformed the onnx model according to your prompts, tried to run your project, and found that the post-processing to obtain the confidence level seems to be logically problematic, resulting in the confidence level of the output result is not right, I want to consult your post-processing based on what written

JiaoPaner · 2021-02-03T10:00:15Z

@pangchao-git My project based on yolov5 3.0, I fixed some bug in yesterday, the post-processing to obtain the confidence level method is referenced to the non_max_suppression method in yolov5/utils/general.py, my c++ skill is in the normal ,so if you found some error,please tell me .

Erfun76 · 2021-04-01T10:11:50Z

@JiaoPaner ,

You have some bug in preprocessing. if you look at detect.py in python code, the picture that is passed to the model is not resized like you did in utils::createInputImage. you should fill image with black border. for example I used following code for (480, 640) input:

cv::copyMakeBorder(image, dst, 0, 160, 0, 0, cv::BORDER_CONSTANT);

and then you should change some variables like xGain and yGain in detector.cp to 1.

abdulw976 · 2021-08-25T07:46:32Z

@JiaoPaner How to interpret the following outputs?
name: classes
type: float32[1,3,80,80,85]

glenn-jocher · 2021-08-26T12:31:28Z

@abdulw976 ONNX inference is very easy:

python detect.py --weights yolov5s.onnx

sazzadhrz · 2022-06-08T11:48:27Z

My onnx session outputs (1, 25200, 11) but non_max_suppression outputs torch.Size([300, 6]) (6 classes). Why does it have shape 300? how to convert this to the x, y coordinates?

@JonathanLehner did you able to solve the issue?

JiaoPaner added the question Further information is requested label Aug 11, 2020

JiaoPaner closed this as completed Sep 4, 2020

hokwangchoi mentioned this issue Nov 20, 2020

Custom ONNX model load problem dusty-nv/jetson-inference#800

Closed

GioFic95 mentioned this issue Oct 5, 2021

Change broadcast Add/Mul to element-wise Add/Mul in Detect layer #4811

Closed

d57montes mentioned this issue Jan 2, 2022

Yolov5s onnx export but with required output format #6155

Closed

1 task

chen2mg mentioned this issue Jun 3, 2022

convert ONNX outputs to Bbox #8085

Closed

2 tasks

sazzadhrz mentioned this issue Jun 8, 2022

How to convert model output to coordinates? #8149

Closed

1 task

divineSix mentioned this issue Feb 5, 2024

[QUESTION] Issue when translating output to bboxes with NMS. Using YoloV5L6.onnx cyrusbehr/tensorrt-cpp-api#48

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to convert the outputs of the yolov5.onnx to boxes ,labels and scores . #708

how to convert the outputs of the yolov5.onnx to boxes ,labels and scores . #708

JiaoPaner commented Aug 11, 2020

NosremeC commented Aug 12, 2020

JiaoPaner commented Aug 12, 2020 •

edited

Loading

yongjingli commented Aug 12, 2020 •

edited

Loading

JiaoPaner commented Aug 15, 2020 •

edited

Loading

ricklina90 commented Aug 19, 2020

JiaoPaner commented Aug 19, 2020

ricklina90 commented Aug 25, 2020

spicybeef003 commented Sep 4, 2020 •

edited

Loading

JonathanLehner commented Sep 4, 2020

Edwardmark commented Sep 23, 2020

kafan1986 commented Oct 5, 2020

JiaoPaner commented Oct 15, 2020 •

edited

Loading

Jiang15 commented Oct 16, 2020

❔Question

JiaoPaner commented Oct 17, 2020 •

edited

Loading

pangchao-git commented Jan 27, 2021 •

edited

Loading

JiaoPaner commented Jan 28, 2021 •

edited

Loading

pangchao-git commented Feb 3, 2021

JiaoPaner commented Feb 3, 2021

Erfun76 commented Apr 1, 2021 •

edited

Loading

abdulw976 commented Aug 25, 2021

glenn-jocher commented Aug 26, 2021

sazzadhrz commented Jun 8, 2022 •

edited

Loading

how to convert the outputs of the yolov5.onnx to boxes ,labels and scores . #708

how to convert the outputs of the yolov5.onnx to boxes ,labels and scores . #708

Comments

JiaoPaner commented Aug 11, 2020

❔Question

NosremeC commented Aug 12, 2020

JiaoPaner commented Aug 12, 2020 • edited Loading

yongjingli commented Aug 12, 2020 • edited Loading

JiaoPaner commented Aug 15, 2020 • edited Loading

ricklina90 commented Aug 19, 2020

JiaoPaner commented Aug 19, 2020

ricklina90 commented Aug 25, 2020

spicybeef003 commented Sep 4, 2020 • edited Loading

JonathanLehner commented Sep 4, 2020

Edwardmark commented Sep 23, 2020

kafan1986 commented Oct 5, 2020

JiaoPaner commented Oct 15, 2020 • edited Loading

Jiang15 commented Oct 16, 2020

❔Question

JiaoPaner commented Oct 17, 2020 • edited Loading

pangchao-git commented Jan 27, 2021 • edited Loading

JiaoPaner commented Jan 28, 2021 • edited Loading

pangchao-git commented Feb 3, 2021

JiaoPaner commented Feb 3, 2021

Erfun76 commented Apr 1, 2021 • edited Loading

abdulw976 commented Aug 25, 2021

glenn-jocher commented Aug 26, 2021

sazzadhrz commented Jun 8, 2022 • edited Loading

JiaoPaner commented Aug 12, 2020 •

edited

Loading

yongjingli commented Aug 12, 2020 •

edited

Loading

JiaoPaner commented Aug 15, 2020 •

edited

Loading

spicybeef003 commented Sep 4, 2020 •

edited

Loading

JiaoPaner commented Oct 15, 2020 •

edited

Loading

JiaoPaner commented Oct 17, 2020 •

edited

Loading

pangchao-git commented Jan 27, 2021 •

edited

Loading

JiaoPaner commented Jan 28, 2021 •

edited

Loading

Erfun76 commented Apr 1, 2021 •

edited

Loading

sazzadhrz commented Jun 8, 2022 •

edited

Loading