Is the Focus layer equivalent to a simple Conv layer? #4825

thomasbi1 · 2021-09-16T10:18:51Z

Hi

I had a look at the Focus layer and it seems to me like it is equivalent to a simple 2d-convolutional layer without the need for the space-to-depth operation. For example, a Focus layer with kernel size 3 can be expressed as a Conv layer with kernel size 6 and stride 2 . I wrote some code to verify this:

import torch
from models.common import Focus, Conv
from utils.torch_utils import profile


focus = Focus(3, 64, k=3).eval()
conv = Conv(3, 64, k=6, s=2, p=2).eval()

# Express focus layer as conv layer
conv.bn = focus.conv.bn
conv.conv.weight.data[:, :, ::2, ::2] = focus.conv.conv.weight.data[:, :3]
conv.conv.weight.data[:, :, 1::2, ::2] = focus.conv.conv.weight.data[:, 3:6]
conv.conv.weight.data[:, :, ::2, 1::2] = focus.conv.conv.weight.data[:, 6:9]
conv.conv.weight.data[:, :, 1::2, 1::2] = focus.conv.conv.weight.data[:, 9:12]

# Compare
x = torch.randn(16, 3, 640, 640)
with torch.no_grad():
    # Results are not perfectly identical, errors up to about 1e-7 occur (probably numerical)
    assert torch.allclose(focus(x), conv(x), atol=1e-6)

# Profile
results = profile(input=torch.randn(16, 3, 640, 640), ops=[focus, conv, focus, conv], n=10, device=0)

And the output as follows:

YOLOv5 🚀 v5.0-434-g0dc725e torch 1.9.0+cu111 CUDA:0 (A100-SXM4-40GB, 40536.1875MB)
      Params      GFLOPs  GPU_mem (GB)  forward (ms) backward (ms)                   input                  output
        7040       23.07         2.682         4.055         13.78       (16, 3, 640, 640)      (16, 64, 320, 320)
        7040       23.07         2.368         3.474         9.989       (16, 3, 640, 640)      (16, 64, 320, 320)
        7040       23.07         2.343         3.556         11.57       (16, 3, 640, 640)      (16, 64, 320, 320)
        7040       23.07         2.368         3.456         9.961       (16, 3, 640, 640)      (16, 64, 320, 320)

I did have to slightly tweak the tolerance in torch.allcose for the assertion to succeed, but looking at the errors they seem to be purely numerical.

So am I missing something or could the Focus layer simply be replaced by a Conv layer which would lead to a slight increase in speed?

The text was updated successfully, but these errors were encountered:

github-actions · 2021-09-16T10:19:33Z

👋 Hello @thomasbi1, thank you for your interest in YOLOv5 🚀! Please visit our ⭐️ Tutorials to get started, where you can find quickstart guides for simple tasks like Custom Data Training all the way to advanced concepts like Hyperparameter Evolution.

If this is a 🐛 Bug Report, please provide screenshots and minimum viable code to reproduce your issue, otherwise we can not help you.

If this is a custom training ❓ Question, please provide as much information as possible, including dataset images, training logs, screenshots, and a public link to online W&B logging if available.

For business inquiries or professional support requests please visit https://ultralytics.com or email Glenn Jocher at glenn.jocher@ultralytics.com.

Requirements

Python>=3.6.0 with all requirements.txt installed including PyTorch>=1.7. To get started:

$ git clone https://github.com/ultralytics/yolov5
$ cd yolov5
$ pip install -r requirements.txt

Environments

YOLOv5 may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):

Google Colab and Kaggle notebooks with free GPU:
Google Cloud Deep Learning VM. See GCP Quickstart Guide
Amazon Deep Learning AMI. See AWS Quickstart Guide
Docker Image. See Docker Quickstart Guide

Status

If this badge is green, all YOLOv5 GitHub Actions Continuous Integration (CI) tests are currently passing. CI tests verify correct operation of YOLOv5 training (train.py), validation (val.py), inference (detect.py) and export (export.py) on MacOS, Windows, and Ubuntu every 24 hours and on every commit.

glenn-jocher · 2021-09-16T13:00:57Z

@thomasbi1 thanks for raising this issue! That would be exciting if we can simplify this layer, we are always looking for improvements. I will try to reproduce your results later today.

glenn-jocher · 2021-09-18T18:01:28Z

@thomasbi1 I was able to run your comparison is Colab. The allclose seems ok, but I got significantly different profile results when tested in Colab (with T4). I'm not sure what the cause could be, will try different hardware later.

glenn-jocher · 2021-09-18T18:07:26Z

I profiled using the more traditional %timeit method as well and saw similar forward times (ms) to the YOLOv5 profiler:

thomasbi1 · 2021-09-18T18:24:19Z

Wow, that is quite a big discrepancy!

I also tried on Colab with a K80 which again gives vastly different results (original Focus has a much faster forward time but slightly slower backpass)

thomasbi1 · 2021-09-18T18:28:19Z

I also tried with a GTX 1080Ti which again shows different results. Not sure if this is simply due to different hardware or different CUDA versions or something else.

glenn-jocher · 2021-09-18T20:32:05Z

V100 results here, shows improvement forward and backward at batch size 16 and 1.

YOLOv5 🚀 v5.0-449-g9ef9494 torch 1.9.0 CUDA:0 (Tesla V100-SXM2-16GB, 16160.5MB)
      Params      GFLOPs  GPU_mem (GB)  forward (ms) backward (ms)                   input                  output
        7040       23.07         2.259         4.497         16.88       (16, 3, 640, 640)      (16, 64, 320, 320)
        7040       23.07         1.839         4.107            12       (16, 3, 640, 640)      (16, 64, 320, 320)
        7040       23.07         1.919         4.444         16.63       (16, 3, 640, 640)      (16, 64, 320, 320)
        7040       23.07         1.839         4.113         11.98       (16, 3, 640, 640)      (16, 64, 320, 320)

YOLOv5 🚀 v5.0-449-g9ef9494 torch 1.9.0 CUDA:0 (Tesla V100-SXM2-16GB, 16160.5MB)
      Params      GFLOPs  GPU_mem (GB)  forward (ms) backward (ms)                   input                  output
        7040       1.442         0.581        0.4828         1.377        (1, 3, 640, 640)       (1, 64, 320, 320)
        7040       1.442         0.161        0.4387        0.9845        (1, 3, 640, 640)       (1, 64, 320, 320)
        7040       1.442         0.161        0.4806         1.407        (1, 3, 640, 640)       (1, 64, 320, 320)
        7040       1.442         0.161        0.4376        0.9555        (1, 3, 640, 640)       (1, 64, 320, 320)

On balance it seems to help most setups, though the large K80 and T4 slowdowns are unfortunate as they are Colab mainstays. I'll take a look at the exportability next, though I don't imagine any issues there.

glenn-jocher · 2021-09-18T22:14:35Z

@thomasbi1 export tests are good! Ok, this seems like a good candidate for our upcoming v6.0 release, which will arrive with a few other minor architecture updates in October. I will add this issue to the release notes and make sure credit is assigned to you for uncovering this. Thank you for your contributions and let us know if you spot any other items for improvement!

TODO: apply to v6.0 release backbone updates:

backbone:
  # [from, number, module, args]
  [[-1, 1, Conv, [64, 6, 2, 2]],  # 0-P1/2 <--- update
   [-1, 1, Conv, [128, 3, 2]],  # 1-P2/4
   [-1, 3, C3, [128]],
   [-1, 1, Conv, [256, 3, 2]],  # 3-P3/8
   [-1, 9, C3, [256]],
   [-1, 1, Conv, [512, 3, 2]],  # 5-P4/16
   [-1, 9, C3, [512]],
   [-1, 1, Conv, [1024, 3, 2]],  # 7-P5/32
   [-1, 1, SPP, [1024, [5, 9, 13]]],
   [-1, 3, C3, [1024, False]],  # 9
  ]

glenn-jocher · 2021-10-10T20:09:37Z

Removed TODO, update included in upcoming v6.0 release.

ptklx · 2021-10-15T06:22:59Z

@glenn-jocher Hi ，I use v6.0 to trt is report erro (Unnamed Layer* 0) [Convolution]: group count must divide input channel count
and find this:

[[-1, 1, Conv, [64, 6, 2, 2]], # 0-P1/2 <--- update
the input 3 and 64 canot divisible by 2

glenn-jocher · 2021-10-15T07:07:31Z

@ptklx not sure I follow, groups=1 in all places, there are no grouped convolutions in any YOLOv5 models. If you believe you have a reproducible bug then I would recommend you submit a bug report with clear steps to reproduce.

ptklx · 2021-10-18T02:54:39Z

@glenn-jocher Sorry, I read the wrong place

yao-zheng-yi · 2021-10-18T06:53:35Z

@glenn-jocher Hi, I would like to ask why Conv can replace Focus? I used the above code and the result shows that Focus is significantly faster than Conv.

thomasbi1 · 2021-10-18T08:33:24Z

@yao-zheng-yi As far as I can tell the Conv layer is more easily exported to other formats (onnx, tensorflow, tflite etc.), as it is a more standard operation than the space-to-depth operation in the Focus layer. @glenn-jocher can probably elaborate.

Regarding speed, I seem to remember that the CUDA version plays an important role and that the Conv layer is faster on version 11. You could try updating your CUDA version to 11 and see if that helps.

glenn-jocher · 2021-10-18T09:16:56Z

@yao-zheng-yi yes from your profiling results Focus() is much faster on your machine. We did a pretty thorough analysis of various GPUs and found that Focus() is faster on many consumer cards and on T4 GPUs, while Conv() tends to be faster on enterprise cards and newer hardware.

This change (and most every other change) is an exercise in compromise, and we implement changes if we feel the balance of the benefits outweigh the drawbacks for most users, so this is one place where some users will experience slowdowns while others may experience improvements, and hopefully this ratio will improve in the future for new hardware.

One of the main benefits also is improved exportability and simpler architectures as well.

yao-zheng-yi · 2021-10-19T01:51:46Z

@thomasbi1 @glenn-jocher Many thanks！

duanzhiihao · 2021-10-28T02:00:05Z

I don't know if anyone still cares, but I also tried the script on an Nvidia 1080 ti and 3090; Conv is better in both cases.

glenn-jocher · 2021-10-28T10:24:58Z

@duanzhiihao great, yes thanks for the feedback!

SSHtoyourheart · 2021-10-31T02:54:56Z

my memory only 2G ,55555555555

tothedistance · 2021-11-02T06:01:09Z

this is how i convert yolo focus to caffe. what a coincidence

seeyouagain111 · 2021-11-24T08:09:04Z

nice, thanks for your guys's job

github-actions · 2021-12-28T00:13:21Z

👋 Hello, this issue has been automatically marked as stale because it has not had recent activity. Please note it will be closed if no further activity occurs.

Access additional YOLOv5 🚀 resources:

Wiki – https://github.com/ultralytics/yolov5/wiki
Tutorials – https://docs.ultralytics.com/yolov5
Docs – https://docs.ultralytics.com

Access additional Ultralytics ⚡ resources:

Ultralytics HUB – https://ultralytics.com/hub
Vision API – https://ultralytics.com/yolov5
About Us – https://ultralytics.com/about
Join Our Team – https://ultralytics.com/work
Contact Us – https://ultralytics.com/contact

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐!

thomasbi1 added the question Further information is requested label Sep 16, 2021

glenn-jocher added the TODO label Sep 18, 2021

glenn-jocher assigned glenn-jocher and thomasbi1 and unassigned glenn-jocher Sep 18, 2021

glenn-jocher removed the TODO label Oct 10, 2021

glenn-jocher mentioned this issue Oct 12, 2021

YOLOv5 release v6.0 #5141

Merged

glenn-jocher mentioned this issue Nov 7, 2021

YOLOv5 v6.0 compatibility update (draft) ultralytics/yolov3#1855

Closed

glenn-jocher mentioned this issue Nov 14, 2021

YOLOv5 v6.0 compatibility update ultralytics/yolov3#1857

Merged

ljmiao mentioned this issue Dec 16, 2021

replace Focus with Conv Megvii-BaseDetection/YOLOX#998

Open

Zengyf-CVer mentioned this issue Dec 19, 2021

What does focus mean #6028

Closed

1 task

usstq mentioned this issue Dec 22, 2021

Transformation from StridedSlice to SpaceToDepth in yolov5 openvinotoolkit/openvino#7525

Closed

github-actions bot added the Stale label Dec 28, 2021

github-actions bot closed this as completed Jan 2, 2022

iumyx2612 mentioned this issue Jan 10, 2022

Focus module gone in version 6? #6257

Closed

1 task

WZMIAOMIAO mentioned this issue Mar 21, 2022

YOLOv5 (6.0/6.1) brief summary #6998

Open

zhiqwang mentioned this issue Jul 20, 2022

The performance difference between Focus Module and Conv Module #8637

Closed

1 task

zhiqwang mentioned this issue Aug 21, 2022

Add fully convolutional Focus2 zhiqwang/yolort#457

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the Focus layer equivalent to a simple Conv layer? #4825

Is the Focus layer equivalent to a simple Conv layer? #4825

thomasbi1 commented Sep 16, 2021

github-actions bot commented Sep 16, 2021 •

edited by glenn-jocher

Loading

glenn-jocher commented Sep 16, 2021

glenn-jocher commented Sep 18, 2021 •

edited

Loading

glenn-jocher commented Sep 18, 2021

thomasbi1 commented Sep 18, 2021

thomasbi1 commented Sep 18, 2021

glenn-jocher commented Sep 18, 2021

glenn-jocher commented Sep 18, 2021 •

edited

Loading

glenn-jocher commented Oct 10, 2021

ptklx commented Oct 15, 2021

glenn-jocher commented Oct 15, 2021

ptklx commented Oct 18, 2021

yao-zheng-yi commented Oct 18, 2021

thomasbi1 commented Oct 18, 2021

glenn-jocher commented Oct 18, 2021 •

edited

Loading

yao-zheng-yi commented Oct 19, 2021

duanzhiihao commented Oct 28, 2021

glenn-jocher commented Oct 28, 2021

SSHtoyourheart commented Oct 31, 2021

tothedistance commented Nov 2, 2021

seeyouagain111 commented Nov 24, 2021

github-actions bot commented Dec 28, 2021 •

edited by glenn-jocher

Loading

Is the Focus layer equivalent to a simple Conv layer? #4825

Is the Focus layer equivalent to a simple Conv layer? #4825

Comments

thomasbi1 commented Sep 16, 2021

github-actions bot commented Sep 16, 2021 • edited by glenn-jocher Loading

Requirements

Environments

Status

glenn-jocher commented Sep 16, 2021

glenn-jocher commented Sep 18, 2021 • edited Loading

glenn-jocher commented Sep 18, 2021

thomasbi1 commented Sep 18, 2021

thomasbi1 commented Sep 18, 2021

glenn-jocher commented Sep 18, 2021

glenn-jocher commented Sep 18, 2021 • edited Loading

glenn-jocher commented Oct 10, 2021

ptklx commented Oct 15, 2021

glenn-jocher commented Oct 15, 2021

ptklx commented Oct 18, 2021

yao-zheng-yi commented Oct 18, 2021

thomasbi1 commented Oct 18, 2021

glenn-jocher commented Oct 18, 2021 • edited Loading

yao-zheng-yi commented Oct 19, 2021

duanzhiihao commented Oct 28, 2021

glenn-jocher commented Oct 28, 2021

SSHtoyourheart commented Oct 31, 2021

tothedistance commented Nov 2, 2021

seeyouagain111 commented Nov 24, 2021

github-actions bot commented Dec 28, 2021 • edited by glenn-jocher Loading

github-actions bot commented Sep 16, 2021 •

edited by glenn-jocher

Loading

glenn-jocher commented Sep 18, 2021 •

edited

Loading

glenn-jocher commented Sep 18, 2021 •

edited

Loading

glenn-jocher commented Oct 18, 2021 •

edited

Loading

github-actions bot commented Dec 28, 2021 •

edited by glenn-jocher

Loading