the speed of training my custom data #1196

alicera · 2020-10-23T02:25:23Z

❔Question

I try to follow the #475 and https://github.com/ultralytics/yolov5/wiki/Train-Custom-Data
I use the command
python -m torch.distributed.launch --nproc_per_node 4 train.py --batch-size 128 --data coco.yaml --cfg yolov5s.yaml --weights ''

Is it normal speed? or some problem
The log is
Epoch gpu_mem box obj cls total targets img_size
6/299 7.2G 0.08226 0.2537 0 0.3359 2253 640: 100%|█████████████████████| 389/389 [58:10<00:00, 4.97s/it]
Class Images Targets P R mAP@.5 mAP@.5:.95: 100%|███████| 389/389 [1:01:00<00:00, 3.66s/it]
all 4.97e+04 5.03e+06 0.339 0.169 0.106 0.0322

 Epoch   gpu_mem       box       obj       cls     total   targets  img_size
 7/299      7.2G   0.08159    0.2491         0    0.3307       884       640: 100%|█████████████████████| 389/389 [56:26<00:00,  7.13s/it]
           Class      Images     Targets           P           R      mAP@.5  mAP@.5:.95: 100%|█████████| 389/389 [47:50<00:00,  2.80s/it]
             all    4.97e+04    5.03e+06       0.296       0.179       0.112      0.0349

 Epoch   gpu_mem       box       obj       cls     total   targets  img_size
 8/299      7.2G   0.08105    0.2429         0     0.324      2672       640: 100%|█████████████████████| 389/389 [56:50<00:00,  6.04s/it]
           Class      Images     Targets           P           R      mAP@.5  mAP@.5:.95: 100%|███████| 389/389 [1:38:33<00:00,  3.02s/it]
             all    4.97e+04    5.03e+06       0.348       0.173       0.119      0.0372

 Epoch   gpu_mem       box       obj       cls     total   targets  img_size
 9/299      7.2G   0.08098    0.2434         0    0.3244       405       640: 100%|█████████████████████| 389/389 [57:47<00:00,  6.28s/it]
           Class      Images     Targets           P           R      mAP@.5  mAP@.5:.95: 100%|█████████| 389/389 [27:41<00:00,  2.88s/it]
             all    4.97e+04    5.03e+06       0.328       0.159      0.0994      0.0317

 Epoch   gpu_mem       box       obj       cls     total   targets  img_size
10/299      7.2G   0.08078    0.2418         0    0.3226      3567       640: 100%|█████████████████████| 389/389 [56:52<00:00,  5.90s/it]
           Class      Images     Targets           P           R      mAP@.5  mAP@.5:.95: 100%|███████| 389/389 [1:02:21<00:00,  3.00s/it]
             all    4.97e+04    5.03e+06       0.358       0.165       0.108       0.035

 Epoch   gpu_mem       box       obj       cls     total   targets  img_size
11/299      7.2G   0.08084    0.2403         0    0.3212      1459       640: 100%|█████████████████████| 389/389 [52:53<00:00,  5.02s/it]
           Class      Images     Targets           P           R      mAP@.5  mAP@.5:.95: 100%|█████████| 389/389 [26:24<00:00,  3.12s/it]

The text was updated successfully, but these errors were encountered:

glenn-jocher · 2020-10-24T00:21:02Z

@alicera with no details on your hardware there can be no answer to your question.

glenn-jocher · 2020-10-24T00:23:07Z

@alicera also I see you are using a custom dataset despite your yaml being called coco.yaml. So you have an unknown dataset with unknown hardware asking people if your training time is correct.

alicera · 2020-10-26T02:27:54Z

It is a problem about dataset.
Because I use the 30000 images that I prepare to train,validation and the speed is ok.
But I use the 50000 images that I prepare to train,validation and the speed is very slow than COCO 60000up images

alicera · 2020-10-26T11:12:42Z

python test.py --weights yolov5x.pt --data coco.yaml --img 640

Output:

           Class      Images     Targets           P           R      mAP@.5  mAP@.5:.95: 
             all    2.24e+04    4.81e+06       0.373       0.207       0.148      0.0486

Speed: 9.2/8.6/17.8 ms inference/NMS/total per 640x640 image at batch-size 32

Do you know the reason?
https://docs.ultralytics.com/yolov5/tutorials/test_time_augmentation

glenn-jocher · 2020-10-26T13:53:52Z

@alicera pycocotools mAP only runs on the COCO dataset.

dongjuns · 2020-10-31T00:37:54Z

Hi, @alicera
Here is an example in my case.

in road.yaml

# train and val data as 1) directory: path/images/, 2) file: path/images.txt, or 3) list: [path1/images/, path2/images/]
train: ../road/CZ_train,txt
val: ../road/CZ_validation.txt

# number of classes
nc: 4

# class names
names: ['D00', 'D10', 'D20', 'D40']

and just changed only 'nc' in yolov5s.model

# parameters
nc: 4  # number of classes

and training code would be,

python train.py --data data/road.yaml --cfg models/yolov5s.yaml --weights yolov5s.pt --batch-size 16

glenn-jocher · 2020-10-31T11:35:09Z

@dongjuns yes this is good advice! We've updated the training commands to make them even simpler. Now you only need to specify your --data and your pretrained --weights.

python train.py --data road.yaml --weights yolov5s.pt --batch-size 16

github-actions · 2020-12-01T00:39:58Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

alicera added the question Further information is requested label Oct 23, 2020

github-actions bot added the Stale label Dec 1, 2020

github-actions bot closed this as completed Dec 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the speed of training my custom data #1196

the speed of training my custom data #1196

alicera commented Oct 23, 2020

glenn-jocher commented Oct 24, 2020

glenn-jocher commented Oct 24, 2020

alicera commented Oct 26, 2020

alicera commented Oct 26, 2020 •

edited by glenn-jocher

Loading

glenn-jocher commented Oct 26, 2020

dongjuns commented Oct 31, 2020 •

edited

Loading

glenn-jocher commented Oct 31, 2020

github-actions bot commented Dec 1, 2020

the speed of training my custom data #1196

the speed of training my custom data #1196

Comments

alicera commented Oct 23, 2020

❔Question

glenn-jocher commented Oct 24, 2020

glenn-jocher commented Oct 24, 2020

alicera commented Oct 26, 2020

alicera commented Oct 26, 2020 • edited by glenn-jocher Loading

glenn-jocher commented Oct 26, 2020

dongjuns commented Oct 31, 2020 • edited Loading

glenn-jocher commented Oct 31, 2020

github-actions bot commented Dec 1, 2020

alicera commented Oct 26, 2020 •

edited by glenn-jocher

Loading

dongjuns commented Oct 31, 2020 •

edited

Loading