Rerange the blocks of Focus Layer into `row major` to be compatible with tensorflow `SpaceToDepth` #413

ausk · 2020-07-15T10:02:22Z

🚀 Feature

Modify Focus Layer into row major to be compatible with tf.space_to_depth.

Just change the blocks order:
from : torch.cat([x[..., ::2, ::2], x[..., 1::2, ::2], x[..., ::2, 1::2], x[..., 1::2, 1::2]], 1)
to : torch.cat([x[..., ::2, ::2], x[..., ::2, 1::2], x[..., 1::2, ::2], x[..., 1::2, 1::2]], 1)

Motivation

In model/common.py, the Focus Layer is defined in Pytorch as following:

class Focus(nn.Module):
    # Focus wh information into c-space
    def __init__(self, c1, c2, k=1, s=1, p=None, g=1, act=True):  # ch_in, ch_out, kernel, stride, padding, groups
        super(Focus, self).__init__()
        self.conv = Conv(c1 * 4, c2, k, s, p, g, act)

    def forward(self, x):  # x(b,c,w,h) -> y(b,4c,w/2,h/2)
        # original 
        return self.conv(torch.cat([x[..., ::2, ::2], x[..., 1::2, ::2], x[..., ::2, 1::2], x[..., 1::2, 1::2]], 1))
        # suggestion 
        return self.conv(torch.cat([x[..., ::2, ::2], x[..., ::2, 1::2], x[..., 1::2, ::2], x[..., 1::2, 1::2]], 1))

And @bonlime posted a brief answer to What's the Focus layer? #207:

check TResNet paper. p2. They call it SpaceToDepth

In the TResNet paper, p2.1 We wanted to create a fast, seamless stem layer, with little information loss as possible, and let the simple well designed residual blocks do all the actual processing work. The stem sole functionality should be to downscale the input resolution to match the rest of the architecture, e.g., by a factor of 4. We met these goals by using a dedicated SpaceToDepth transformation layer [32], that rearranges blocks of spatial data into depth. The SpaceToDepth transformation layer is followed by simple 1x1 convolution to match the number of wanted channels.

That to say, the focus layer is to fast download the input resolution by rearanging blocks of spatial data into depth, and change the feature channels generally by 1x1 conv.

And there is an op SpaceToDepth (tf.space_to_depth, tf2.nn.space_to_depth) to rearranges blocks of spatial data.

The Fcous layer use torch.cat([x[..., ::2, ::2], x[..., 1::2, ::2], x[..., ::2, 1::2], x[..., 1::2, 1::2]], 1).

Then we compare:

(0) input

[[[[0 1]
   [2 3]]]]

(1) by Focus torch.cat([x[..., ::2, ::2], x[..., 1::2, ::2], x[..., ::2, 1::2], x[..., 1::2, 1::2]], 1)

[[[[0]]
  [[2]]
  [[1]]
  [[3]]]]

(2) by tensorflow

[[[[0]]
  [[1]]
  [[2]]
  [[3]]]]

(3) modify Focus torch.cat([x[..., ::2, ::2], x[..., ::2, 1::2], x[..., 1::2, ::2], x[..., 1::2, 1::2]], 1)

[[[[0]]
  [[1]]
  [[2]]
  [[3]]]]

So, just modify the order of the blocks, we can make it compatible tensorflow SpaceToDepth op.
It will make the model be more likely to transport into tensorflow.

The text was updated successfully, but these errors were encountered:

github-actions · 2020-07-15T10:03:11Z

Hello @ausk, thank you for your interest in our work! Please visit our Custom Training Tutorial to get started, and see our Jupyter Notebook , Docker Image, and Google Cloud Quickstart Guide for example environments.

If this is a bug report, please provide screenshots and minimum viable code to reproduce your issue, otherwise we can not help you.

If this is a custom model or data training question, please note that Ultralytics does not provide free personal support. As a leader in vision ML and AI, we do offer professional consulting, from simple expert advice up to delivery of fully customized, end-to-end production solutions for our clients, such as:

Cloud-based AI systems operating on hundreds of HD video streams in realtime.
Edge AI integrated into custom iOS and Android apps for realtime 30 FPS video inference.
Custom data training, hyperparameter evolution, and model exportation to any destination.

For more information please visit https://www.ultralytics.com.

glenn-jocher · 2020-07-15T17:55:18Z

@ausk modifying the Focus() module will invalidate all YOLOv5 pretrained models, so I would highly advise against it.

ausk · 2020-07-16T01:21:17Z

@glenn-jocher Modifying the Focus() module will bring benefits of improved versatility, because many frameworks/libraries store the data in row major order, such as tensorflow. And onnx/tensorrt also support space2depth.

Yes, it will hurt the accuracy of current pretrained models. But if train from scratch, I still recommand to modify. It's a tradeoff.

glenn-jocher · 2020-07-16T03:08:04Z

Sure. I volunteer you to retrain all of the pretrained models to their current accuracy with your proposed architecture changes then. Once this is done please submit a PR and we are all set :)

ausk · 2020-07-25T17:00:55Z

Thank you for your work, anyway.

I realise that the space2depth( slice and concate ops) of Focus is the 0th layer of the model, so when inference, we can just remove it, just the conv. So the input becomes nchw (nb, 12, nh, nw). Finally, I have translated the small model (v2) into keras( tensorflow) with nhwc(1, nh, nw, nc) input, and inference success.

Just close as you rejected this.

glenn-jocher · 2020-07-25T18:31:56Z

@ausk ok sounds good! But no I didn't reject the idea. If you can retrain the 4 models with your changes to >= performance and submit a PR then we are good to go.

glenn-jocher · 2021-01-10T19:33:07Z

@ausk better late than never, I've reopened this issue and will examine this option more closely to better align PyTorch and TF YOLOv5 versions to possibly improve TFLite export (google-coral/edgetpu#272).

EDIT: I don't see a problem here, seems like a simple change that brings exportability benefits. I'll try my best to include this update in the next release that includes fully retrained models (i.e. 4.1 or 5.0 possibly).

github-actions · 2021-02-12T00:38:33Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

glenn-jocher · 2021-11-05T12:38:33Z

TODO removed following release v6.0 architecture updates.

ausk added the enhancement New feature or request label Jul 15, 2020

ausk changed the title ~~Modify Focus Layer in row major to be compatible with tensorflow SpaceToDepth~~ Rerange the blocks of Focus Layer into row major to be compatible with tensorflow SpaceToDepth Jul 15, 2020

ausk closed this as completed Jul 25, 2020

glenn-jocher mentioned this issue Jan 10, 2021

Will EdgeTPU support LeakyRelu and Hardswish ops? google-coral/edgetpu#272

Open

glenn-jocher added the TODO label Jan 10, 2021

glenn-jocher reopened this Jan 10, 2021

github-actions bot added the Stale label Feb 12, 2021

github-actions bot closed this as completed Feb 17, 2021

This was referenced Apr 11, 2021

YOLOv5 v5.0 Release #2762

Merged

YOLOv5 v5.0 release compatibility update for YOLOv3 ultralytics/yolov3#1737

Merged

thomasbi1 mentioned this issue Jun 17, 2021

Add EdgeTPU support #3630

Merged

1 task

glenn-jocher removed the TODO label Nov 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rerange the blocks of Focus Layer into `row major` to be compatible with tensorflow `SpaceToDepth` #413

Rerange the blocks of Focus Layer into `row major` to be compatible with tensorflow `SpaceToDepth` #413

ausk commented Jul 15, 2020

github-actions bot commented Jul 15, 2020 •

edited by glenn-jocher

Loading

glenn-jocher commented Jul 15, 2020

ausk commented Jul 16, 2020 •

edited

Loading

glenn-jocher commented Jul 16, 2020

ausk commented Jul 25, 2020

glenn-jocher commented Jul 25, 2020

glenn-jocher commented Jan 10, 2021 •

edited

Loading

github-actions bot commented Feb 12, 2021

glenn-jocher commented Nov 5, 2021

Rerange the blocks of Focus Layer into row major to be compatible with tensorflow SpaceToDepth #413

Rerange the blocks of Focus Layer into row major to be compatible with tensorflow SpaceToDepth #413

Comments

ausk commented Jul 15, 2020

🚀 Feature

Motivation

github-actions bot commented Jul 15, 2020 • edited by glenn-jocher Loading

glenn-jocher commented Jul 15, 2020

ausk commented Jul 16, 2020 • edited Loading

glenn-jocher commented Jul 16, 2020

ausk commented Jul 25, 2020

glenn-jocher commented Jul 25, 2020

glenn-jocher commented Jan 10, 2021 • edited Loading

github-actions bot commented Feb 12, 2021

glenn-jocher commented Nov 5, 2021

Rerange the blocks of Focus Layer into `row major` to be compatible with tensorflow `SpaceToDepth` #413

Rerange the blocks of Focus Layer into `row major` to be compatible with tensorflow `SpaceToDepth` #413

github-actions bot commented Jul 15, 2020 •

edited by glenn-jocher

Loading

ausk commented Jul 16, 2020 •

edited

Loading

glenn-jocher commented Jan 10, 2021 •

edited

Loading