SINGLE-CLASS TRAINING EXAMPLE #102

glenn-jocher · 2019-02-20T16:13:25Z

This guide explains how to train your own single-class dataset with YOLOv3.

Before You Start

Update (Python >= 3.7, PyTorch >= 1.3, etc.) and install requirements.txt dependencies.
Clone repo: git clone https://github.com/ultralytics/yolov3
Download COCO: bash yolov3/data/get_coco2017.sh

Train On Custom Data

1. Label your data in Darknet format. After using a tool like Labelbox to label your images, you'll need to export your data to darknet format. Your data should follow the example created by get_coco2017.sh, with images and labels in separate parallel folders, and one label file per image (if no objects in image, no label file is required). The label file specifications are:

One row per object
Each row is class x_center y_center width height format.
Box coordinates must be in normalized xywh format (from 0 - 1). If your boxes are in pixels, divide x_center and width by image width, and y_center and height by image height.
Class numbers are zero-indexed (start from 0).

Each image's label file must be locatable by simply replacing /images/*.jpg with /labels/*.txt in its pathname. An example image and label pair would be:

../coco/images/train2017/000000109622.jpg  # image
../coco/labels/train2017/000000109622.txt  # label

An example label file with 4 persons (all class 0):

2. Create train and test *.txt files. Here we create data/coco_1cls.txt, which contains 5 images with only persons from the coco 2014 trainval dataset. We will use this small dataset for both training and testing. Each row contains a path to an image, and remember one label must also exist in a corresponding /labels folder for each image that has targets.

3. Create new *.names file listing all of the names for the classes in our dataset. Here we use the existing data/coco.names file. Classes are zero indexed, so person is class 0.

4. Update data/coco.data lines 2 and 3 to point to our new text file for training and validation (in your own data you would likely want to use separate train and test sets). Also update line 1 to our new class count, if not 80, and lastly update line 4 to point to our new *.names file, if you created one. Save the modified file as data/coco_1cls.data.

5. Update *.cfg file (optional). Each YOLO layer has 255 outputs: 85 outputs per anchor [4 box coordinates + 1 object confidence + 80 class confidences], times 3 anchors. If you use fewer classes, reduce filters to filters=[4 + 1 + n] * 3, where n is your class count. This modification should be made to the layer preceding each of the 3 YOLO layers. Also modify classes=80 to classes=n in each YOLO layer, where n is your class count (for single class training, n=1).

6. (OPTIONAL) Update hyperparameters such as LR, LR scheduler, optimizer, augmentation settings, multi_scale settings, etc in train.py for your particular task. We recommend you start with all-default settings first updating anything.

7. Train. Run python3 train.py --data data/coco_1cls.data to train using your custom data. If you created a custom *.cfg file as well, specify it using --cfg cfg/my_new_file.cfg.

Visualize Results

Run from utils import utils; utils.plot_results() to see your training losses and performance metrics vs epoch. If you don't see acceptable performance, try hyperparameter tuning and re-training. Multiple results.txt files are overlaid automatically to compare performance.

Here we see results from training on coco_1cls.data using the default yolov3-spp.cfg and also a single-class yolov3-spp-1cls.cfg, available in the data/ and cfg/ folders.

Evaluate your trained model: copy COCO_val2014_000000001464.jpg to data/samples folder and run python3 detect.py --weights weights/last.pt

Reproduce Our Results

To reproduce this tutorial, simply run the following code. This trains all the various tutorials, saves each results*.txt file separately, and plots them together as results.png. It all takes less than 30 minutes on a 2080Ti.

git clone https://github.com/ultralytics/yolov3
python3 -c "from yolov3.utils.google_utils import gdrive_download; gdrive_download('1h0Id-7GUyuAmyc9Pwo2c3IZ17uExPvOA','coco2017demos.zip')"  # datasets (20 Mb)
cd yolov3
python3 train.py --data coco64.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights '' --name from_scratch
python3 train.py --data coco64.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights yolov3-spp-ultralytics.pt --name from_yolov3-spp-ultralytics
python3 train.py --data coco64.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights darknet53.conv.74 --name from_darknet53.conv.74
python3 train.py --data coco1.data --batch 1 --accum 1 --epochs 300 --nosave --cache --weights darknet53.conv.74 --name 1img
python3 train.py --data coco1cls.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights darknet53.conv.74 --cfg yolov3-spp-1cls.cfg --name 1cls

Reproduce Our Environment

To access an up-to-date working environment (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled), consider a:

GCP Deep Learning VM with $300 free credit offer: See our GCP Quickstart Guide
Google Colab Notebook with 12 hours of free GPU time: Google Colab Notebook
Docker Image from https://hub.docker.com/r/ultralytics/yolov3. See Docker Quickstart Guide

The text was updated successfully, but these errors were encountered:

tamasbalassa · 2019-02-21T14:51:15Z

This is a great thing!

However scrolling over the code, I was able to figure these things out even before, but I would suggest You also explaining or showing what kind of modifications are required for the yolov3.cfg file (number of classes and the filter numbers on last layers before the yolo layers).

glenn-jocher · 2019-02-21T18:41:16Z

@tamasbalassa yes good idea. All done!

tamasbalassa · 2019-02-21T19:26:51Z

To be complete, don't forget to add the modifications of the class numbers (classes from 80 to 1). I know it's very obvious, but that one is still missing from the guide. (: 💯

graftedlife · 2019-02-22T02:18:14Z

@glenn-jocher I have a question regarding your normalized labels. I have no problem with w and h, but how did you get the normalized x and y?

Take as example an object from one image (image_id: 574200; COCO_val2014_000000574200.txt; category_id: 13) in the COCO validation set--

Relevant information from instances_val2014.json:

{"license": 6, "file_name": "COCO_val2014_000000574200.jpg", "coco_url": "http://mscoco.org/images/574200", "height": 427, "width": 640, "date_captured": "2013-11-16 18:56:50", "flickr_url": "http://farm5.staticflickr.com/4018/4527879508_8f69659291_z.jpg", "id": 574200}

{"segmentation": [[239.27, 139.42, 236.98, 140.04, 236.35, 159.03, 237.19, 171.97, 241.15, 186.58, 243.24, 186.37, 244.91, 165.5, 242.61, 149.02, 240.11, 140.04]], "area": 277.9082500000005, "iscrowd": 0, "image_id": 574200, "bbox": [236.35, 139.42, 8.56, 47.16], "category_id": 13, "id": 1388638}

I assume that the width/height of this image are respectively 640 and 427, and the raw COCO xywh is [236.35, 139.42, 8.56, 47.16].

The corresponding normalized coordinates in your labels, as I see, are:

11 0.375984 0.381733 0.013375 0.110445

While 0.013375=8.56/640 and 0.110445=47.16/427, I don't know how the normalized x and y, i.e., 0.375984 and 0.381733 were obtained. Could you please elaborate on this point?

Many thanks!

glenn-jocher · 2019-02-22T09:38:07Z

@graftedlife according to http://cocodataset.org/#format-data, the COCO "bbox" is already in xywh, so to transform this to darknet format we should just need to divide by the image width and height.

If we divide x and w by the image width 640, and y and h by the image height 427 we get:

[0.3692, 0.3265, 0.0133, 0.1104] = 
[236.35 / 640, 139.42 / 427, 8.56 / 640, 47.16 / 427]

Which does not match our darknet labels. So the COCO xy coordinates must then represent the bottom corner point of the bounding box rather than its center. If we correct for this offset then the results match. And in the COCO link above you can see it says "box coordinates are measured from the top left image corner".

[0.3759, 0.3817, 0.0133, 0.1104] = 
[(236.35 + 8.56/2) / 640, (139.42 + 47.16/2) / 427, 8.56 / 640, 47.16 / 427]

cy0616 · 2019-02-22T13:58:38Z

@glenn-jocher
I have trained on my own dataset which contains 1000 images of 2 classes, using 1080Ti , batch = 16, train one epoch cost approximate 30s, but at the end of each eopch, it's very slow when running the val code. I found that it takes 46s at NMS eatch batch(16 images), and compute average precision for each sample cost 0.8s. In eval step, the gpu utilization rate is 16%，and only use 1-core cpu

glenn-jocher · 2019-02-22T14:27:14Z

@cy0616 can you run a line profiler to find the slow areas? Yes, NMS is very slow, it is not run during training, only at inference time.

COCO validation takes about 2-3 minutes for 5000 images on a P100. If you see similar time ranges on COCO training then the problem is specific to your dataset.

If you find ways to speed up the code, PR’s are welcome!

tamasbalassa · 2019-02-25T11:00:25Z

@graftedlife I've just read your question this morning, so quickly implemented a code to test out both coco and kitti type of data. This is not for converting, only for testing the bounding box coordinates if they are correct or not.

import os
import numpy as np
import cv2
import matplotlib.pyplot as plt

###############################################################################################################
# CONST
img_ext = ".jpg"
test_kitti = True
test_coco = True
###############################################################################################################
# PATHS
kitti_dirpath = "path/to/your/directory/"
kitti_labels_dirname = "name_of_your_label_dir_in_kitti_dirpath"
kitti_images_dirname = "name_of_your_image_dir_in_kitti_dirpath"

coco_dirpath = "path/to/your/directory/"
coco_labels_dirname = "name_of_your_label_dir_in_coco_dirpath"
coco_images_dirname = "name_of_your_image_dir_in_coco_dirpath"
###############################################################################################################

dlist = os.listdir(os.path.join(kitti_dirpath, kitti_labels_dirname))
for dl in dlist:
    img_name = dl.split('.')

    if test_kitti:
    ###########################################################################################################
    # KITTI PART
    ###########################################################################################################

        kitti_label_path = os.path.join(kitti_dirpath, kitti_labels_dirname, dl)

        with open(kitti_label_path) as f:
            kitti_content = f.readlines()
        kitti_content = [x.strip('\n') for x in kitti_content]

        kitti_image_path = os.path.join(kitti_dirpath, kitti_images_dirname, img_name[0] + img_ext)
        kitti_img = cv2.imread(kitti_image_path)

        for word in kitti_content:
            params = word.split()
            x1 = int(params[4])
            x2 = int(params[6])
            y1 = int(params[5])
            y2 = int(params[7])

            cv2.rectangle(kitti_img, (x1, y1), (x2, y2), (0, 255, 0), 3)

        # plt.imshow(kitti_img)
        # plt.show()

    if test_coco:
    ###########################################################################################################
    # coco PART
    ###########################################################################################################

        coco_label_path = os.path.join(coco_dirpath, coco_labels_dirname, dl)

        with open(coco_label_path) as f:
            coco_content = f.readlines()
        coco_content = [x.strip('\n') for x in coco_content]

        coco_image_path = os.path.join(coco_dirpath, coco_images_dirname, img_name[0] + img_ext)
        coco_img = cv2.imread(coco_image_path)
        coco_img_shape = coco_img.shape

        for word in coco_content:
            params = word.split()

            w = float(params[3]) * coco_img_shape[1]
            h = float(params[4]) * coco_img_shape[0]
            x_center = float(params[1]) * coco_img_shape[1]
            y_center = float(params[2]) * coco_img_shape[0]
            x1 = int(x_center - w/2)
            if x1 < 0: x1 = 0
            x2 = int(x_center + w/2)
            y1 = int(y_center - w/2)
            if y1 < 0: y1 = 0
            y2 = int(y_center + w/2)

            cv2.rectangle(coco_img, (x1, y1), (x2, y2), (0, 255, 0), 3)

        # plt.imshow(coco_img)
        # plt.show()

PS: it's just a quick solution, not extensively tested. The aim was to be easy to understand for everyone.

Jason-cs18 · 2019-02-26T01:14:40Z

Hi, I have a question about using the validation set. In my opinion, we should use a validation set to decide the performance of the training model. But I have found we only use the training set to make a decision. Thus, I suggest you add the validation set in the train.py.

glenn-jocher · 2019-02-26T14:39:16Z

@JacksonLY the validation set is already used during training to compute mAP after each epoch. This is done by calling test.py to evaluate latest.pt on the validation set pointed to by coco.data: valid=../coco/5k.txt

yolov3/train.py

Lines 169 to 171 in eb6a4b5

    
           # Calculate mAP 
        
           with torch.no_grad(): 
        
               mAP, R, P = test.test(cfg, data_cfg, weights=latest, batch_size=batch_size, img_size=img_size)

Jason-cs18 · 2019-02-27T02:49:24Z

@JacksonLY the validation set is already used during training to compute mAP after each epoch. This is done by calling test.py to evaluate latest.pt on the validation set pointed to by coco.data: valid=../coco/5k.txt
yolov3/train.py

Lines 169 to 171 in eb6a4b5

Calculate mAP

with torch.no_grad():
mAP, R, P = test.test(cfg, data_cfg, weights=latest, batch_size=batch_size, img_size=img_size)

Thanks for your reply, I have understood this flow. I found the many people need training customized model with pre-trained model from coco. Thus, I added some code to the train.py (line: 70~90) as follows:

    elif resume and customized:
        # load pretrain model (yolov3 from coco)
        model_pretrain = Darknet('cfg/yolov3_raw.cfg', 416)
        checkpoint = torch.load(latest, map_location='cpu')
        model_pretrain.load_state_dict(checkpoint['model'])
        # load customized model (from yolov3.cfg)
        new_model = Darknet('cfg/yolov3.cfg', 416)

        params1 = new_model.state_dict()
        params2 = model_pretrain.state_dict()

        # modelB = copy.deepcopy(modelA)
        dict_params1 = dict(params1)
        dict_params1 = dict(copy.deepcopy(dict_params1))
        dict_params2 = dict(params2)
        dict_params2 = dict(copy.deepcopy(dict_params2))

        for name, param in dict_params2.items():
            if name in dict_params1 and len(np.array(param.size())) != 0:
                if param.shape[0] != 255:
                    dict_params1[name] = dict_params2[name]
        # load pretrain-parameters to customized model
        new_model.load_state_dict(dict_params1)

I have tested this code on retraining the customized dataset and hope this can helpful.

cuixing158 · 2019-03-12T08:53:25Z

when train this single-class model,the question is :
Is it faster to use a single-class of models than the original author's yolov3-tiny multiple-classes models?

I found my training model is a little faster( ~13 fps) than original yolov3-tiny model?
envionments : win10+ opencv4.0 DNN+VisualStudio2015

yang-jin-hai · 2019-03-20T02:33:23Z

@glenn-jocher Hi bro, thanks for your awesome work.
I am training with my dataset under your repo according to this guide,
While training, the output value 'cls' is always 0. Is that common?

glenn-jocher · 2019-03-20T11:00:37Z

@WannaSeaU yes this is expected. If there is only a single class, how can the network guess the wrong class? Imagine if I asked you to guess a random number between zero and zero... you'd probably be correct every time as well no?

yang-jin-hai · 2019-03-20T11:08:28Z

@glenn-jocher Thank you for replying to such a basic question.

XiaoJiNu · 2019-04-15T07:40:09Z

@glenn-jocher hi, as the following picture shows, if I train one class and a picture dosen't have object in it, so i don't need create a label file of this picture or i should create a label file which contains nothing?

glenn-jocher · 2019-04-15T12:27:34Z

@XiaoJiNu if there are not objects in your training image you don't need to supply a label file. Empty may work as well, try it out.

XiaoJiNu · 2019-04-16T04:16:29Z

thank you @glenn-jocher , i will try it

Jriandono · 2019-04-30T12:54:13Z

@glenn-jocher Hi, I was following your tutorial and it works great and also Thanks for the tutorial

I have several questions regarding training with our own dataset :

Create train and test *.txt files. Here we create data/coco_1cls.txt, which contains 5 images with only persons from the coco 2014 trainval dataset. We will use this small dataset for both training and testing. Each row contains a path to an image, and remember one label must also exist in a corresponding /labels folder for each image that has targets
is this mean I can use any random picture as long as related to the class?

and also if I would like to have 5 classes to train is this mean im going to have
data/coco_1cls.txt
data/coco_2cls.txt
data/coco_3cls.txt
data/coco_4cls.txt
data/coco_5cls.txt

where each of them has pics for training purposes?

glenn-jocher · 2019-04-30T13:43:29Z

@Jriandono you're welcome! Regarding your question, no this is not correct. You only use one *.txt file for your training set and one *.txt file for your test set (they can be the same, as in the demo). This is the same no matter now many classes you have. For multiple class training on custom data please see https://docs.ultralytics.com/yolov5/tutorials/train_custom_data

sanazss · 2019-07-13T06:36:11Z

Got another question. You mentioned that you created a txt file from first 10 images of coco and use it for both training and testing. I only have one txt file for images should I duplicate it as test as well. I didn’t understand what you mean by that. Did you split data first and then make txt file for them or you use one txt file for testing and training.

glenn-jocher · 2019-07-13T10:55:46Z

@sanazss your question about how to split your data among train and test sets are basic machine learning questions, not specific to this repo, I suggest you simply google this.

For single channel images you can modify the *.cfg file from channels=3 to channels=1.

yolov3/cfg/yolov3-spp.cfg

Line 10 in 831b6e3

channels=3

sanazss · 2019-07-30T16:33:17Z

Hi Glenn, regarding normalizing x and y of center coordinates. The way that you explained to @graftedlife should be applied for custom data as well? center coordinates should be first multiplied by width and hight and then be divided by image size? or is different for custom data and this is only for coco dataset

glenn-jocher · 2019-08-02T14:27:12Z

@sanazss all data is handles identically by the repo. All data must be in the darknet format. The coco dataset you use for the tutorials is provided in darknet format.

sip-ops · 2019-09-21T22:16:34Z

Hi Guys

I just need some help, I am training with a single class with my custom data. Perfectly followed the example, but am experiencing the following.
Traceback (most recent call last): File "train.py", line 420, in <module> train() # train normally File "train.py", line 269, in train loss, loss_items = compute_loss(pred, targets, model) File "/Users/user/Desktop/user/yolov3/utils/utils.py", line 320, in compute_loss tcls, tbox, indices, anchor_vec = build_targets(model, targets) File "/Users/user/Desktop/user/yolov3/utils/utils.py", line 438, in build_targets assert c.max() <= model.nc, 'Target classes exceed model classes' AssertionError: Target classes exceed model classes

I have changed the classes to "1", on the file.data and yolo3.cfg

glenn-jocher · 2019-09-21T22:28:43Z

@sip-ops your data has class numbers that exceed 0. If you are training a single class model then all the classes must be 0.

sip-ops · 2019-09-21T23:28:58Z

@glenn-jocher you are absolutely correct, I had to change it in all labels, thanks it works now.

glenn-jocher · 2020-01-04T21:04:04Z

@yangxu351 I don't recall exactly, but the COCO breakdown is 120k training images and 5k validation images, so I'd use similar proportions in xview (i.e. 95%-5%).

Andy7775 · 2020-03-05T20:07:29Z

Thank You for this great guide! I seem to have a problem with I think paths here. What am I doing wrong? I work in Jupyter with Pytorch on Windows 10. Thnx a lot in advance for taking a look!

RuntimeError Traceback (most recent call last)
in
6 optimizer.zero_grad()
7
----> 8 loss = model(imgs, targets)
9
10 loss.backward()
...
RuntimeError: cannot perform reduction function max on tensor with no elements because the operation does not have an identity

coco.data:
classes=1
train=data/artifacts/train.txt
valid=data/artifacts/val.txt
names=config/coco.names
backup=backup/

coco.names:
NUMMER

yolov3.cfg: (changes)
batch=2
3 x classes=1 / filters=18

data has the structure data/artifacts/images (dir), labels (dir), train.txt, val.txt
any label file is a single line like "0 0.347166 0.936526 0.338057 0.113586"
and an example for a line in train.txt or test.txt is
"C:/Users/andre/Documents/Program/CV_Train/custom/data/artifacts/images/pic1.JPG"

the params are:
epochs=20;image_folder="data/artifacts/images";batch_size=2;
model_config_path="config/yolov3.cfg";data_config_path="config/coco.data";
weights_path="config/yolov3.weights";class_path="config/coco.names";
conf_thres=0.8;nms_thres=0.4;n_cpu=0;img_size=416;checkpoint_interval=1;
checkpoint_dir="checkpoints";use_cuda=True;

The dataloader delivers with (if tested just before the training loop)
a,b,c=next(iter(dataloader))
print(a[0],b[0],c[0]):
C:/Users/andre/Documents/Program/CV_Train/custom/data/artifacts/images/pic1.JPG tensor([[[0.5020, 0.5020, 0.5020, ..., 0.5020, 0.5020, 0.5020],
[0.5020, 0.5020, 0.5020, ..., 0.5020, 0.5020, 0.5020],
[0.5020, 0.5020, 0.5020, ..., 0.5020, 0.5020, 0.5020],
...,
[0.5020, 0.5020, 0.5020, ..., 0.5020, 0.5020, 0.5020],
[0.5020, 0.5020, 0.5020, ..., 0.5020, 0.5020, 0.5020],
[0.5020, 0.5020, 0.5020, ..., 0.5020, 0.5020, 0.5020]]]) tensor([[0., 0., 0., 0., 0.]], dtype=torch.float64)

Thnx a lot for any suggestion!

glenn-jocher · 2020-03-05T20:14:40Z

@Andy7775 you might want to start from a working environment, like a colab instance and inspect the paths there for the tutorial *.data files. If you are doing single class, you can use yolov3-spp-1cls.cfg.
Google Colab Notebook

Note I've updated the tutorial to reflect the latest code for reproducing our results. Run those lines and start from there.

Andy7775 · 2020-03-07T17:10:52Z

Thnx, I got it.
My problem was a local class not loaded so the dataloader stayed empty. Additionally the labels had a wrong path.
The tutorial helped a lot with its different examples for bug hunting, thank You!
And its great to get an answer so fast, thnx again!
BR Andy

glenn-jocher · 2020-03-07T17:34:35Z

@Andy7775 great!

claire-0702 · 2020-04-14T07:43:33Z

Hi,I have some problems about my training result.when I seted epoch=1000.I found the map, f1, and precesion were zeros .It confused me .could you give me some advices?thank you very much !

glenn-jocher · 2020-04-14T21:39:18Z

@claire-0702 you likely have no labels in your test set. Start from the tutorial.

claire-0702 · 2020-04-15T02:18:52Z

@glenn-jocher hi，I have changed my dataset about train.txt and test.txt. the images and the lables were seted as same as your tutorial.the figure one shows my dataset.

the figure two shows train.txt and test.txt

when I startet to train my dataset.the result shows that

How can i da somethings to make it better? Thank you very much !

glenn-jocher · 2020-04-15T20:21:31Z

@claire-0702 upload your train_batch0.png and test_batch0.png images here please.

claire-0702 · 2020-04-16T00:30:58Z

@glenn-jocher train_batch0.png and test_batch0.png

s4365g · 2020-04-16T07:54:50Z

@glenn-jocher I have the same problem just like @claire-0702, the values of P, R, GIoU loss, cls_loss are always zero, the following images are

my training process snapshot
train_batch0.png & test_batch0.png

I follow your tutorial https://github.com/ultralytics/yolov3/wiki/Example:-Train-Single-Class, run the command down below to strat training process eventually, and I use the weight until 100 Epoch to test my data, but there is no result in the output images. How can I fix it?
python3.7 train.py --data data/track_1cls.data --cfg cfg/yolov3-spp-1cls.cfg --batch-size 12

glenn-jocher · 2020-04-16T19:17:27Z

@AndyTaiwan if GIoU loss is zero then no anchors are above the iou threshold. You can reduce the iou threshold here:

yolov3/train.py

Line 31 in 9ea8562

'iou_t': 0.225, # iou training threshold

You should also replace the default anchors in the cfg with kmeans anchors for your specific dataset. @claire-0702 this may help you as well:

yolov3/utils/utils.py

Lines 691 to 696 in 9ea8562

    
           def kmean_anchors(path='../coco/train2017.txt', n=12, img_size=(320, 1024), thr=0.10, gen=1000): 
        
               # Creates kmeans anchors for use in *.cfg files: from utils.utils import *; _ = kmean_anchors() 
        
               # n: number of anchors 
        
               # img_size: (min, max) image size used for multi-scale training (can be same values) 
        
               # thr: IoU threshold hyperparameter used for training (0.0 - 1.0) 
        
               # gen: generations to evolve anchors using genetic algorithm

Also, since the repo is updated often, git pull to get the latest updates.

wjtan99 · 2020-07-16T05:45:49Z

I have same problem. After adding a lot more data, the gIOU is now non-zero, but the Cls is still all 0. What can cause this issue?

wjtan99 · 2020-07-16T15:24:51Z

From the visual results you posted above, I just realized that the Cls loss, P, R, mAP are all 0 if a single class cfg file is used. Is that what you intend to do?

glenn-jocher · 2020-07-16T17:00:34Z

Ultralytics has open-sourced YOLOv5 at https://github.com/ultralytics/yolov5, featuring faster, lighter and more accurate object detection. YOLOv5 is recommended for all new projects.

** GPU Speed measures end-to-end time per image averaged over 5000 COCO val2017 images using a V100 GPU with batch size 32, and includes image preprocessing, PyTorch FP16 inference, postprocessing and NMS. EfficientDet data from [google/automl](https://github.com/google/automl) at batch size 8.

August 13, 2020: v3.0 release: nn.Hardswish() activations, data autodownload, native AMP.
July 23, 2020: v2.0 release: improved model definition, training and mAP.
June 22, 2020: PANet updates: new heads, reduced parameters, improved speed and mAP 364fcfd.
June 19, 2020: FP16 as new default for smaller checkpoints and faster inference d4c6674.
June 9, 2020: CSP updates: improved speed, size, and accuracy (credit to @WongKinYiu for CSP).
May 27, 2020: Public release. YOLOv5 models are SOTA among all known YOLO implementations.
April 1, 2020: Start development of future compound-scaled YOLOv3/YOLOv4-based PyTorch models.

Pretrained Checkpoints

Model	AP^val	AP^test	AP₅₀	Speed_GPU	FPS_GPU	params	FLOPS
YOLOv5s	37.0	37.0	56.2	2.4ms	416	7.5M	13.2B
YOLOv5m	44.3	44.3	63.2	3.4ms	294	21.8M	39.4B
YOLOv5l	47.7	47.7	66.5	4.4ms	227	47.8M	88.1B
YOLOv5x	49.2	49.2	67.7	6.9ms	145	89.0M	166.4B

YOLOv5x + TTA	50.8	50.8	68.9	25.5ms	39	89.0M	354.3B

YOLOv3-SPP	45.6	45.5	65.2	4.5ms	222	63.0M	118.0B

** AP^test denotes COCO test-dev2017 server results, all other AP results in the table denote val2017 accuracy.
** All AP numbers are for single-model single-scale without ensemble or test-time augmentation. Reproduce by python test.py --data coco.yaml --img 640 --conf 0.001
** Speed_GPU measures end-to-end time per image averaged over 5000 COCO val2017 images using a GCP n1-standard-16 instance with one V100 GPU, and includes image preprocessing, PyTorch FP16 image inference at --batch-size 32 --img-size 640, postprocessing and NMS. Average NMS time included in this chart is 1-2ms/img. Reproduce by python test.py --data coco.yaml --img 640 --conf 0.1
** All checkpoints are trained to 300 epochs with default settings and hyperparameters (no autoaugmentation).
** Test Time Augmentation (TTA) runs at 3 image sizes. Reproduce by python test.py --data coco.yaml --img 832 --augment

For more information and to get started with YOLOv5 please visit https://github.com/ultralytics/yolov5. Thank you!

bedada2 · 2020-08-12T11:21:15Z

hi, guy's. I got problem with single class detection with yolov3. I mean no problem while training it works well. but while detecting object from video. I got problem with class_number which is 1 in my case. if i change class_number to 2 or other number and update the obj.names to 2 or other list of object names it work. but for only one object it gives me below error: please any help?

Traceback (most recent call last):
File "video.py", line 162, in
list(map(lambda x: write(x, frame), output))
File "video.py", line 162, in
list(map(lambda x: write(x, frame), output))
File "video.py", line 81, in write
label = "{0}".format(classes[cls])
IndexError: list index out of range

goldwater668 · 2020-08-17T02:37:31Z

@glenn-jocher

In your tutorial coco.data How is the training path the same as the test path? Is it wrong?
Test p, R, map, F1 in training is inconsistent with P, R, map, F1 in test, conf_ The thres parameter is 0.001

bedada2 · 2020-08-17T06:42:07Z

@glenn-jocher

in coco.data path for train and validation is look like this: train = data/train.txt valid = data/valid.txt
I used same yolov3.cfg for both training and testing except commenting the batch and subdivisions for training while testing
Again it works for any other class_number except for class_number=1. I train the weight on alexes/darknet on colab.

Mayur2992 · 2020-08-26T06:30:51Z

This guide explains how to train your own single-class dataset with YOLOv3.

Before You Start

Update (Python >= 3.7, PyTorch >= 1.3, etc.) and install requirements.txt dependencies.

Clone repo: git clone https://github.com/ultralytics/yolov3

Download COCO: bash yolov3/data/get_coco2017.sh

Train On Custom Data

1. Label your data in Darknet format. After using a tool like Labelbox to label your images, you'll need to export your data to darknet format. Your data should follow the example created by get_coco2017.sh, with images and labels in separate parallel folders, and one label file per image (if no objects in image, no label file is required). The label file specifications are:

One row per object

Each row is class x_center y_center width height format.

Box coordinates must be in normalized xywh format (from 0 - 1). If your boxes are in pixels, divide x_center and width by image width, and y_center and height by image height.

Class numbers are zero-indexed (start from 0).

Each image's label file must be locatable by simply replacing /images/*.jpg with /labels/*.txt in its pathname. An example image and label pair would be:
../coco/images/train2017/000000109622.jpg  # image
../coco/labels/train2017/000000109622.txt  # label
An example label file with 4 persons (all class 0):

2. Create train and test *.txt files. Here we create data/coco_1cls.txt, which contains 5 images with only persons from the coco 2014 trainval dataset. We will use this small dataset for both training and testing. Each row contains a path to an image, and remember one label must also exist in a corresponding /labels folder for each image that has targets.

3. Create new *.names file listing all of the names for the classes in our dataset. Here we use the existing data/coco.names file. Classes are zero indexed, so person is class 0.

4. Update data/coco.data lines 2 and 3 to point to our new text file for training and validation (in your own data you would likely want to use separate train and test sets). Also update line 1 to our new class count, if not 80, and lastly update line 4 to point to our new *.names file, if you created one. Save the modified file as data/coco_1cls.data.

5. Update *.cfg file (optional). Each YOLO layer has 255 outputs: 85 outputs per anchor [4 box coordinates + 1 object confidence + 80 class confidences], times 3 anchors. If you use fewer classes, reduce filters to filters=[4 + 1 + n] * 3, where n is your class count. This modification should be made to the layer preceding each of the 3 YOLO layers. Also modify classes=80 to classes=n in each YOLO layer, where n is your class count (for single class training, n=1).

6. (OPTIONAL) Update hyperparameters such as LR, LR scheduler, optimizer, augmentation settings, multi_scale settings, etc in train.py for your particular task. We recommend you start with all-default settings first updating anything.

7. Train. Run python3 train.py --data data/coco_1cls.data to train using your custom data. If you created a custom *.cfg file as well, specify it using --cfg cfg/my_new_file.cfg.

Visualize Results

Run from utils import utils; utils.plot_results() to see your training losses and performance metrics vs epoch. If you don't see acceptable performance, try hyperparameter tuning and re-training. Multiple results.txt files are overlaid automatically to compare performance.

Here we see results from training on coco_1cls.data using the default yolov3-spp.cfg and also a single-class yolov3-spp-1cls.cfg, available in the data/ and cfg/ folders.

Evaluate your trained model: copy COCO_val2014_000000001464.jpg to data/samples folder and run python3 detect.py --weights weights/last.pt

Reproduce Our Results

To reproduce this tutorial, simply run the following code. This trains all the various tutorials, saves each results*.txt file separately, and plots them together as results.png. It all takes less than 30 minutes on a 2080Ti.
git clone https://github.com/ultralytics/yolov3
python3 -c "from yolov3.utils.google_utils import gdrive_download; gdrive_download('1h0Id-7GUyuAmyc9Pwo2c3IZ17uExPvOA','coco2017demos.zip')"  # datasets (20 Mb)
cd yolov3
python3 train.py --data coco64.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights '' --name from_scratch
python3 train.py --data coco64.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights yolov3-spp-ultralytics.pt --name from_yolov3-spp-ultralytics
python3 train.py --data coco64.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights darknet53.conv.74 --name from_darknet53.conv.74
python3 train.py --data coco1.data --batch 1 --accum 1 --epochs 300 --nosave --cache --weights darknet53.conv.74 --name 1img
python3 train.py --data coco1cls.data --batch 16 --accum 1 --epochs 300 --nosave --cache --weights darknet53.conv.74 --cfg yolov3-spp-1cls.cfg --name 1cls
Reproduce Our Environment

To access an up-to-date working environment (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled), consider a:

GCP Deep Learning VM with $300 free credit offer: See our GCP Quickstart Guide

Google Colab Notebook with 12 hours of free GPU time: Google Colab Notebook

Docker Image from https://hub.docker.com/r/ultralytics/yolov3. See Docker Quickstart Guide

Hey @glenn-jocher
Currently I am working on person detection!! Right now I am using pretrained model of yolov3. Do I get better accuracy if I train the custom model with the separated person class from the ms coco dataset which contains 65k images of person only??
Can you suggest any hyper parameter to change before training? I am getting mAP @ 0.5 is 0.339 and GIoU 2.91!

bedada2 · 2020-09-22T09:30:58Z

thank you guy's

zunairaR · 2020-10-06T06:42:09Z

why i need to download coco dataset when i have to train it on my custom dataset.
And how i can get .shapes file for my dataset.
Cant get how to modify get_coco17.sh file for my own dataset.

github-actions · 2020-11-06T00:25:33Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

soheilmajidi · 2022-04-08T14:05:42Z

I have a question
I have only one class
how should I change configurations ?
is sigmoid or other loss functions needed?

glenn-jocher · 2022-04-08T14:23:02Z

@soheilmajidi 👋 Hello! Thanks for asking about YOLOv3 🚀 dataset formatting. No changes are required to train single class.

To train correctly your data must be in YOLOv5 format. Please see our Train Custom Data tutorial for full documentation on dataset setup and all steps required to start training your first model. A few excerpts from the tutorial:

1.1 Create dataset.yaml

COCO128 is an example small tutorial dataset composed of the first 128 images in COCO train2017. These same 128 images are used for both training and validation to verify our training pipeline is capable of overfitting. data/coco128.yaml, shown below, is the dataset config file that defines 1) the dataset root directory path and relative paths to train / val / test image directories (or *.txt files with image paths), 2) the number of classes nc and 3) a list of class names:

# Train/val/test sets as 1) dir: path/to/imgs, 2) file: path/to/imgs.txt, or 3) list: [path/to/imgs1, path/to/imgs2, ..]
path: ../datasets/coco128  # dataset root dir
train: images/train2017  # train images (relative to 'path') 128 images
val: images/train2017  # val images (relative to 'path') 128 images
test:  # test images (optional)

# Classes
nc: 80  # number of classes
names: [ 'person', 'bicycle', 'car', 'motorcycle', 'airplane', 'bus', 'train', 'truck', 'boat', 'traffic light',
         'fire hydrant', 'stop sign', 'parking meter', 'bench', 'bird', 'cat', 'dog', 'horse', 'sheep', 'cow',
         'elephant', 'bear', 'zebra', 'giraffe', 'backpack', 'umbrella', 'handbag', 'tie', 'suitcase', 'frisbee',
         'skis', 'snowboard', 'sports ball', 'kite', 'baseball bat', 'baseball glove', 'skateboard', 'surfboard',
         'tennis racket', 'bottle', 'wine glass', 'cup', 'fork', 'knife', 'spoon', 'bowl', 'banana', 'apple',
         'sandwich', 'orange', 'broccoli', 'carrot', 'hot dog', 'pizza', 'donut', 'cake', 'chair', 'couch',
         'potted plant', 'bed', 'dining table', 'toilet', 'tv', 'laptop', 'mouse', 'remote', 'keyboard', 'cell phone',
         'microwave', 'oven', 'toaster', 'sink', 'refrigerator', 'book', 'clock', 'vase', 'scissors', 'teddy bear',
         'hair drier', 'toothbrush' ]  # class names

1.2 Create Labels

After using a tool like Roboflow Annotate to label your images, export your labels to YOLO format, with one *.txt file per image (if no objects in image, no *.txt file is required). The *.txt file specifications are:

One row per object
Each row is class x_center y_center width height format.
Box coordinates must be in normalized xywh format (from 0 - 1). If your boxes are in pixels, divide x_center and width by image width, and y_center and height by image height.
Class numbers are zero-indexed (start from 0).

The label file corresponding to the above image contains 2 persons (class 0) and a tie (class 27):

1.3 Organize Directories

Organize your train and val images and labels according to the example below. YOLOv5 assumes /coco128 is inside a /datasets directory next to the /yolov5 directory. YOLOv5 locates labels automatically for each image by replacing the last instance of /images/ in each image path with /labels/. For example:

../datasets/coco128/images/im0.jpg  # image
../datasets/coco128/labels/im0.txt  # label

Good luck 🍀 and let us know if you have any other questions!

koreanmarine · 2024-01-19T10:37:23Z

Can photos with no objects be included in the training set?
If it's possible to include photos with no objects in the training, what should be done about the bounding box files for these photos

glenn-jocher · 2024-01-19T18:13:01Z

@koreanmarine hello! Great questions regarding training with YOLOv3:

Can photos with no objects be included in the training set?
Yes, photos with no objects can be included in the training set. They are important as they teach the model what background looks like without any objects of interest. This can help reduce false positives.
If it's possible to include photos with no objects in the training, what should be done about the bounding box files for these photos?
For images with no objects, you simply do not need to create a corresponding bounding box .txt file. The training script should be able to handle images without associated label files, assuming there are no objects to detect in those images.

Remember to maintain the same directory structure for images and labels, where the training script expects to find a .txt file for each image in the /labels directory. If an image has no objects, the absence of a .txt file in the corresponding location is treated as an image with no labels.

Good luck with your training! 🚀 If you have any more questions, feel free to ask.

glenn-jocher self-assigned this Feb 20, 2019

glenn-jocher added the tutorial Tutorial or example label Mar 29, 2019

maxmx911 mentioned this issue Sep 20, 2019

What is 'Scale loss by nominal batch_size of 64'? #507

Closed

Ibtastic mentioned this issue Jan 31, 2020

Training YOLO3 on 16 classes VOC --> error pjreddie/darknet#637

Open

github-actions bot added the Stale label Nov 6, 2020

github-actions bot closed this as completed Nov 12, 2020

raphychek mentioned this issue Dec 14, 2021

Single class training WongKinYiu/yolor#146

Open

SINGLE-CLASS TRAINING EXAMPLE #102

SINGLE-CLASS TRAINING EXAMPLE #102

Comments

glenn-jocher commented Feb 20, 2019 • edited Loading

Before You Start

Train On Custom Data

Visualize Results

Reproduce Our Results

Reproduce Our Environment

tamasbalassa commented Feb 21, 2019

glenn-jocher commented Feb 21, 2019

tamasbalassa commented Feb 21, 2019

graftedlife commented Feb 22, 2019 • edited Loading

glenn-jocher commented Feb 22, 2019 • edited Loading

cy0616 commented Feb 22, 2019

glenn-jocher commented Feb 22, 2019 • edited Loading

tamasbalassa commented Feb 25, 2019

Jason-cs18 commented Feb 26, 2019

glenn-jocher commented Feb 26, 2019 • edited Loading

Jason-cs18 commented Feb 27, 2019 • edited by glenn-jocher Loading

Calculate mAP

cuixing158 commented Mar 12, 2019 • edited Loading

yang-jin-hai commented Mar 20, 2019 • edited Loading

glenn-jocher commented Mar 20, 2019 • edited Loading

yang-jin-hai commented Mar 20, 2019

XiaoJiNu commented Apr 15, 2019

glenn-jocher commented Apr 15, 2019

XiaoJiNu commented Apr 16, 2019

Jriandono commented Apr 30, 2019 • edited Loading

glenn-jocher commented Apr 30, 2019 • edited Loading

sanazss commented Jul 13, 2019

glenn-jocher commented Jul 13, 2019 • edited Loading

sanazss commented Jul 30, 2019

glenn-jocher commented Aug 2, 2019

sip-ops commented Sep 21, 2019 • edited Loading

glenn-jocher commented Sep 21, 2019

sip-ops commented Sep 21, 2019

glenn-jocher commented Jan 4, 2020

Andy7775 commented Mar 5, 2020

glenn-jocher commented Mar 5, 2020 • edited Loading

Andy7775 commented Mar 7, 2020

glenn-jocher commented Mar 7, 2020

claire-0702 commented Apr 14, 2020

glenn-jocher commented Apr 14, 2020

claire-0702 commented Apr 15, 2020

glenn-jocher commented Apr 15, 2020

claire-0702 commented Apr 16, 2020

s4365g commented Apr 16, 2020

glenn-jocher commented Apr 16, 2020 • edited Loading

wjtan99 commented Jul 16, 2020

wjtan99 commented Jul 16, 2020

glenn-jocher commented Jul 16, 2020 • edited Loading

Pretrained Checkpoints

bedada2 commented Aug 12, 2020

goldwater668 commented Aug 17, 2020

bedada2 commented Aug 17, 2020

Mayur2992 commented Aug 26, 2020 • edited by glenn-jocher Loading

Before You Start

Train On Custom Data

Visualize Results

Reproduce Our Results

Reproduce Our Environment

bedada2 commented Sep 22, 2020

zunairaR commented Oct 6, 2020

github-actions bot commented Nov 6, 2020

soheilmajidi commented Apr 8, 2022

glenn-jocher commented Apr 8, 2022 • edited Loading

1.1 Create dataset.yaml

1.2 Create Labels

1.3 Organize Directories

koreanmarine commented Jan 19, 2024

glenn-jocher commented Jan 19, 2024

glenn-jocher commented Feb 20, 2019 •

edited

Loading

graftedlife commented Feb 22, 2019 •

edited

Loading

glenn-jocher commented Feb 22, 2019 •

edited

Loading

glenn-jocher commented Feb 22, 2019 •

edited

Loading

glenn-jocher commented Feb 26, 2019 •

edited

Loading

Jason-cs18 commented Feb 27, 2019 •

edited by glenn-jocher

Loading

cuixing158 commented Mar 12, 2019 •

edited

Loading

yang-jin-hai commented Mar 20, 2019 •

edited

Loading

glenn-jocher commented Mar 20, 2019 •

edited

Loading

Jriandono commented Apr 30, 2019 •

edited

Loading

glenn-jocher commented Apr 30, 2019 •

edited

Loading

glenn-jocher commented Jul 13, 2019 •

edited

Loading

sip-ops commented Sep 21, 2019 •

edited

Loading

glenn-jocher commented Mar 5, 2020 •

edited

Loading

glenn-jocher commented Apr 16, 2020 •

edited

Loading

glenn-jocher commented Jul 16, 2020 •

edited

Loading

Mayur2992 commented Aug 26, 2020 •

edited by glenn-jocher

Loading

glenn-jocher commented Apr 8, 2022 •

edited

Loading