Test-Time Augmentation (TTA) Tutorial #303

glenn-jocher · 2020-07-05T18:43:52Z

📚 This guide explains how to use Test Time Augmentation (TTA) during testing and inference for improved mAP and Recall with YOLOv5 🚀. UPDATED 25 September 2022.

Before You Start

Clone repo and install requirements.txt in a Python>=3.7.0 environment, including PyTorch>=1.7. Models and datasets download automatically from the latest YOLOv5 release.

git clone https://github.com/ultralytics/yolov5  # clone
cd yolov5
pip install -r requirements.txt  # install

Test Normally

Before trying TTA we want to establish a baseline performance to compare to. This command tests YOLOv5x on COCO val2017 at image size 640 pixels. yolov5x.pt is the largest and most accurate model available. Other options are yolov5s.pt, yolov5m.pt and yolov5l.pt, or you own checkpoint from training a custom dataset ./weights/best.pt. For details on all available models please see our README table.

$ python val.py --weights yolov5x.pt --data coco.yaml --img 640 --half

Output:

val: data=./data/coco.yaml, weights=['yolov5x.pt'], batch_size=32, imgsz=640, conf_thres=0.001, iou_thres=0.65, task=val, device=, single_cls=False, augment=False, verbose=False, save_txt=False, save_hybrid=False, save_conf=False, save_json=True, project=runs/val, name=exp, exist_ok=False, half=True
YOLOv5 🚀 v5.0-267-g6a3ee7c torch 1.9.0+cu102 CUDA:0 (Tesla P100-PCIE-16GB, 16280.875MB)

Fusing layers... 
Model Summary: 476 layers, 87730285 parameters, 0 gradients

val: Scanning '../datasets/coco/val2017' images and labels...4952 found, 48 missing, 0 empty, 0 corrupted: 100% 5000/5000 [00:01<00:00, 2846.03it/s]
val: New cache created: ../datasets/coco/val2017.cache
               Class     Images     Labels          P          R     mAP@.5 mAP@.5:.95: 100% 157/157 [02:30<00:00,  1.05it/s]
                 all       5000      36335      0.746      0.626       0.68       0.49
Speed: 0.1ms pre-process, 22.4ms inference, 1.4ms NMS per image at shape (32, 3, 640, 640)  # <--- baseline speed

Evaluating pycocotools mAP... saving runs/val/exp/yolov5x_predictions.json...
...
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.504  # <--- baseline mAP
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.688
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.546
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.351
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.551
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.644
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.382
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.628
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.681  # <--- baseline mAR
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.524
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.735
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.826

Test with TTA

Append --augment to any existing val.py command to enable TTA, and increase the image size by about 30% for improved results. Note that inference with TTA enabled will typically take about 2-3X the time of normal inference as the images are being left-right flipped and processed at 3 different resolutions, with the outputs merged before NMS. Part of the speed decrease is simply due to larger image sizes (832 vs 640), while part is due to the actual TTA operations.

$ python val.py --weights yolov5x.pt --data coco.yaml --img 832 --augment --half

Output:

val: data=./data/coco.yaml, weights=['yolov5x.pt'], batch_size=32, imgsz=832, conf_thres=0.001, iou_thres=0.6, task=val, device=, single_cls=False, augment=True, verbose=False, save_txt=False, save_hybrid=False, save_conf=False, save_json=True, project=runs/val, name=exp, exist_ok=False, half=True
YOLOv5 🚀 v5.0-267-g6a3ee7c torch 1.9.0+cu102 CUDA:0 (Tesla P100-PCIE-16GB, 16280.875MB)

Fusing layers... 
/usr/local/lib/python3.7/dist-packages/torch/nn/functional.py:718: UserWarning: Named tensors and all their associated APIs are an experimental feature and subject to change. Please do not use them for anything important until they are released as stable. (Triggered internally at  /pytorch/c10/core/TensorImpl.h:1156.)
  return torch.max_pool2d(input, kernel_size, stride, padding, dilation, ceil_mode)
Model Summary: 476 layers, 87730285 parameters, 0 gradients
val: Scanning '../datasets/coco/val2017' images and labels...4952 found, 48 missing, 0 empty, 0 corrupted: 100% 5000/5000 [00:01<00:00, 2885.61it/s]
val: New cache created: ../datasets/coco/val2017.cache
               Class     Images     Labels          P          R     mAP@.5 mAP@.5:.95: 100% 157/157 [07:29<00:00,  2.86s/it]
                 all       5000      36335      0.718      0.656      0.695      0.503
Speed: 0.2ms pre-process, 80.6ms inference, 2.7ms NMS per image at shape (32, 3, 832, 832)  # <--- TTA speed

Evaluating pycocotools mAP... saving runs/val/exp2/yolov5x_predictions.json...
...
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.516  # <--- TTA mAP
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.701
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.562
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.361
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.564
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.656
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.388
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.640
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.696  # <--- TTA mAR
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.553
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.744
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.833

Inference with TTA

detect.py TTA inference operates identically to val.py TTA: simply append --augment to any existing detect.py command:

$ python detect.py --weights yolov5s.pt --img 832 --source data/images --augment

Output:

detect: weights=['yolov5s.pt'], source=data/images, imgsz=832, conf_thres=0.25, iou_thres=0.45, max_det=1000, device=, view_img=False, save_txt=False, save_conf=False, save_crop=False, nosave=False, classes=None, agnostic_nms=False, augment=True, update=False, project=runs/detect, name=exp, exist_ok=False, line_thickness=3, hide_labels=False, hide_conf=False, half=False
YOLOv5 🚀 v5.0-267-g6a3ee7c torch 1.9.0+cu102 CUDA:0 (Tesla P100-PCIE-16GB, 16280.875MB)

Downloading https://github.com/ultralytics/yolov5/releases/download/v5.0/yolov5s.pt to yolov5s.pt...
100% 14.1M/14.1M [00:00<00:00, 81.9MB/s]

Fusing layers... 
Model Summary: 224 layers, 7266973 parameters, 0 gradients
image 1/2 /content/yolov5/data/images/bus.jpg: 832x640 4 persons, 1 bus, 1 fire hydrant, Done. (0.029s)
image 2/2 /content/yolov5/data/images/zidane.jpg: 480x832 3 persons, 3 ties, Done. (0.024s)
Results saved to runs/detect/exp
Done. (0.156s)

PyTorch Hub TTA

TTA is automatically integrated into all YOLOv5 PyTorch Hub models, and can be accessed by passing augment=True at inference time.

import torch

# Model
model = torch.hub.load('ultralytics/yolov5', 'yolov5s')  # or yolov5m, yolov5x, custom

# Images
img = 'https://ultralytics.com/images/zidane.jpg'  # or file, PIL, OpenCV, numpy, multiple

# Inference
results = model(img, augment=True)  # <--- TTA inference

# Results
results.print()  # or .show(), .save(), .crop(), .pandas(), etc.

Customize

You can customize the TTA ops applied in the YOLOv5 forward_augment() method here:

yolov5/models/yolo.py

Lines 125 to 137 in 8c6f9e1

    
           def forward_augment(self, x): 
        
               img_size = x.shape[-2:]  # height, width 
        
               s = [1, 0.83, 0.67]  # scales 
        
               f = [None, 3, None]  # flips (2-ud, 3-lr) 
        
               y = []  # outputs 
        
               for si, fi in zip(s, f): 
        
                   xi = scale_img(x.flip(fi) if fi else x, si, gs=int(self.stride.max())) 
        
                   yi = self.forward_once(xi)[0]  # forward 
        
                   # cv2.imwrite(f'img_{si}.jpg', 255 * xi[0].cpu().numpy().transpose((1, 2, 0))[:, :, ::-1])  # save 
        
                   yi = self._descale_pred(yi, fi, si, img_size) 
        
                   y.append(yi) 
        
               return torch.cat(y, 1), None  # augmented inference, train

Environments

YOLOv5 may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):

Notebooks with free GPU:
Google Cloud Deep Learning VM. See GCP Quickstart Guide
Amazon Deep Learning AMI. See AWS Quickstart Guide
Docker Image. See Docker Quickstart Guide

Status

If this badge is green, all YOLOv5 GitHub Actions Continuous Integration (CI) tests are currently passing. CI tests verify correct operation of YOLOv5 training, validation, inference, export and benchmarks on MacOS, Windows, and Ubuntu every 24 hours and on every commit.

The text was updated successfully, but these errors were encountered:

swapnil-saha · 2020-08-09T10:47:41Z

I have a question about the value under column P. Is it map@.60 ? (the default IOU threshold value is .60 at test.py)

Aktcob · 2020-08-10T06:55:24Z

I have a question about the value under column P. Is it map@.60 ? (the default IOU threshold value is .60 at test.py)

The default IOU threshold value is NMS threshold, not the map

github-actions · 2020-09-14T00:39:11Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

github-actions · 2020-11-16T00:33:35Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

gizemtanriver · 2020-11-18T21:08:46Z

Hi, what are those 3 different resolutions that TTA uses? Are they selected randomly? Many thanks

vaskers5 · 2022-11-27T15:35:12Z

#10312 pls check my pr - I have fixed this problem

Now it can works like this:

Robotatron · 2023-01-23T21:42:27Z

does it work with segmentation?

glenn-jocher added documentation Improvements or additions to documentation enhancement New feature or request labels Jul 5, 2020

glenn-jocher self-assigned this Jul 5, 2020

This was referenced Jul 5, 2020

[Help needed] Test-time augmentation (TTA) google/automl#503

Closed

YOLOv5 License Issues with Kaggle Wheat Competition: GPL vs MIT #317

Closed

github-actions bot mentioned this issue Aug 17, 2020

CUDA out of memeory,I set batch_size 1,it does not work ultralytics/yolov3#1381

Closed

This was referenced Aug 17, 2020

When multi-GPU training, my validation map value is very low. ultralytics/yolov3#1436

Closed

Yolo v3 take a lot of time to train on custom data ultralytics/yolov3#1458

Closed

build_targets function ultralytics/yolov3#8

Closed

github-actions bot added the Stale label Sep 14, 2020

github-actions bot closed this as completed Sep 20, 2020

RainbowSun11Q2H mentioned this issue Sep 21, 2020

The reproducing results is differnt from your table #1002

Closed

glenn-jocher removed the Stale label Sep 21, 2020

glenn-jocher reopened this Sep 21, 2020

alicera mentioned this issue Oct 26, 2020

the speed of training my custom data #1196

Closed

This was referenced Oct 26, 2020

How to initial weight without pretrain? ultralytics/yolov3#1535

Closed

WARNING: non-finite loss, ending training tensor([nan, nan, 0., nan], device='cuda:0') ultralytics/yolov3#1539

Closed

glenn-jocher mentioned this issue Nov 10, 2020

What is augmented inference ？ #1340

Closed

github-actions bot added the Stale label Nov 16, 2020

This was referenced Jul 8, 2022

Custom trained object detection model not working #8528

Closed

Input node with name onnx: not found #8532

Closed

How did you determine the values of all hyperparameters in hyp.scratch-low.yaml? #8538

Closed

glenn-jocher mentioned this issue Aug 1, 2022

Few-shot for YOLOv5 #8818

Closed

2 tasks

glenn-jocher mentioned this issue Aug 14, 2022

Augmentation on Val Dataset #8949

Closed

1 task

This was referenced Aug 30, 2022

Zeroes in Hyperparameter Evolution? #9214

Closed

What hyperparams do I need to tune when I want to continue a previous training? #9257

Closed

glenn-jocher mentioned this issue Sep 6, 2022

Performance better on smaller image sizes (compared to training size) on some images #9294

Closed

1 task

glenn-jocher mentioned this issue Sep 15, 2022

TypeError: tuple indices must be integers or slices, not tuple #9390

Closed

2 tasks

This was referenced Sep 25, 2022

I have a .pt how can I load it with model.hub.load() and run validation ultralytics/yolov3#1974

Closed

Documentation of methods, parameters, allowed values, term definitions, etc, etc #9584

Closed

This was referenced Oct 3, 2022

Gradual unfreezing the layers during training. #9677

Closed

Procedure of training the model offline. #9700

Closed

glenn-jocher mentioned this issue Oct 10, 2022

confusion matrix - backgroud part #9754

Closed

This was referenced Oct 24, 2022

Use Yolo for anomaly detection #9906

Closed

I want to pass the image read by opencv to the model I/F #9913

Closed

ingin97 mentioned this issue Oct 27, 2022

Problem with val.py. missing argument in val.py sebastianvitterso/master-sau#4

Closed

kartikeyporwal mentioned this issue Nov 3, 2022

yolov7-w6 and yolov7-w6+TTA weight refer to the same URL derronqi/yolov7-face#14

Closed

This was referenced Nov 6, 2022

Number of Classes #10054

Closed

Multigpu training becomes slower in Kaggle #10078

Closed

Yolov5 cannot detection a video (tfjs) #7416

Closed

vaskers5 mentioned this issue Nov 27, 2022

add support for custom augmentations #10312

Closed

glenn-jocher mentioned this issue Dec 6, 2022

How to freeze backbone and unfreeze it after a specific epoch? #10416

Closed

1 task

emre888 mentioned this issue Apr 15, 2023

TTA YOLOV8 ultralytics/ultralytics#1469

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test-Time Augmentation (TTA) Tutorial #303

Test-Time Augmentation (TTA) Tutorial #303

glenn-jocher commented Jul 5, 2020 •

edited

Loading

swapnil-saha commented Aug 9, 2020

Aktcob commented Aug 10, 2020

github-actions bot commented Sep 14, 2020

github-actions bot commented Nov 16, 2020

gizemtanriver commented Nov 18, 2020

vaskers5 commented Nov 27, 2022 •

edited

Loading

Robotatron commented Jan 23, 2023

Test-Time Augmentation (TTA) Tutorial #303

Test-Time Augmentation (TTA) Tutorial #303

Comments

glenn-jocher commented Jul 5, 2020 • edited Loading

Before You Start

Test Normally

Test with TTA

Inference with TTA

PyTorch Hub TTA

Customize

Environments

Status

swapnil-saha commented Aug 9, 2020

Aktcob commented Aug 10, 2020

github-actions bot commented Sep 14, 2020

github-actions bot commented Nov 16, 2020

gizemtanriver commented Nov 18, 2020

vaskers5 commented Nov 27, 2022 • edited Loading

Robotatron commented Jan 23, 2023

glenn-jocher commented Jul 5, 2020 •

edited

Loading

vaskers5 commented Nov 27, 2022 •

edited

Loading