About [sam] layer. #3708

ChenCong7375 · 2019-08-05T10:31:22Z

I noticed that you added [sam] layer in darknet. How can we use it?

cfg file with [sam]: yolov3-tiny-sam.cfg.txt

COCO test-dev

Model	Size	BFLOPS	Inference time, ms	AP@.5:.95	AP@.5	AP@.75
yolo_v3_tiny_pan3 aa_ae_mixup_scale_giou (no sgdr).txt	416x416	8.4	6.4	18.8%	36.8%	17.5%
yolov3-tiny-prn.cfg.txt	416x416	3.5	3.8	-	33.1%	-
enet-coco.cfg.txt	416x416	3.7	22.7	-	45.5%	-

LukeAI · 2019-08-06T09:10:25Z

think it's for thundernet #3702

WongKinYiu · 2019-08-06T10:48:58Z

notice that number of filters should be equal to from layer.

ChenCong7375 · 2019-08-07T10:42:12Z

@WongKinYiu could you please share the cfg file?

WongKinYiu · 2019-08-08T01:44:25Z

yolov3-tiny-sam.cfg.txt
here u r.

LukeAI · 2019-08-08T08:38:06Z

@WongKinYiu Thanks for sharing another novel architecture! - Would you be kind enough to explain a little about the design? I notice it contains only a single Yolo layer, what about rough cocoAP / inf. time on an RTX?

WongKinYiu · 2019-08-08T08:53:23Z

#3380 (comment)

u can compare it with efficientnet-b0
#3380 (comment)

by the way, thundernet is a 2-stage detector.
you may need do some modifications to make it suitable for yolo.

LukeAI · 2019-08-08T08:59:06Z

Oh, I see this is the CEM + SAM + Yolov3 with 42.0% mAP@0.5 with 2.90 BFLOPs.? Sounds great, I'll see how it goes and report back. Have you done any other experimental architectures that you would be happy sharing? Do you think it might be improved by trying to use a pan-like head?

AlexeyAB · 2019-08-08T09:06:45Z

@LukeAI If you have a time, try to train this model (CEM + SAM + Yolov3 with 42.0% mAP@0.5 with 2.90 BFLOPs) on this dataset: #3114 (comment)

For adding result (chart Loss & mAP, BFlops) to this table.

ChenCong7375 · 2019-08-08T09:10:03Z

@AlexeyAB Is there any cfg file of CEM + SAM + Yolov3 ?
I will have a try.

WongKinYiu · 2019-08-08T12:27:53Z

enetb0-cemsam.cfg.txt

Because there is no parameter can let up-sampling layer up-sample the feature maps to the size before global average pooling layer, I use max-pooling layer instead of global average pooling layer in CEM.
(#3380 (comment) uses SPP instead of global average pooling layer.)

If you get error while training the model, try to set random=0 of yolo layer.

LukeAI · 2019-08-09T09:58:55Z

yolov3-tiny-sam.cfg.txt
here u r.

I try to train with:
./darknet detector train my_stuff/bdd100k.data my_stuff/yolov3-tiny-sam.cfg my_stuff/yolov3-tiny.conv.15 -dont_show -mjpeg_port 8090 -map -i 1`

But it immediately aborts with:

...
[yolo] params: iou loss: mse, iou_norm: 0.75, cls_norm: 1.00, scale_x_y: 1.00
Total BFLOPS 4.887 
 Allocate additional workspace_size = 1245.71 MB 
Loading weights from my_stuff/yolov3-tiny.conv.15...
 seen 64 
Done!
Learning Rate: 0.001, Momentum: 0.9, Decay: 0.0005
Resizing
608 x 608 
Resizing type 15 
Cannot resize this type of layer: File exists
darknet: ./src/utils.c:293: error: Assertion `0' failed.
....

UPDATE: it works if I set random=0

LukeAI · 2019-08-09T16:48:00Z

training now, looking good so far.
what am I missing with random=0?
How could I add scales_x_y to this model?

WongKinYiu · 2019-08-09T23:28:57Z

@LukeAI
For using scale_x_y, plz see #3114 (comment)

LukeAI · 2019-08-11T10:16:16Z

@WongKinYiu I mean, I know that the scale models have "scale_x_y = 1.05" or something like that in the Yolo layers, I just don't really understand what an appropriate value would be. I could try with 1.05 and just see how that works? or 1.1?

WongKinYiu · 2019-08-16T01:15:11Z

@LukeAI to set an appropriate value, plz see #3293 (comment)

LukeAI · 2019-08-30T10:35:51Z

Hi all,
Some experiments I ran a while back using the berkley deep drive dataset (slightly reduced number of classes)
Baseline:
CEM1.cfg.txt

With anchors generated from the dataset:
CEM_with_anchors.cfg.txt

Using scale_x_y=1.05
CEM_with_scale.cfg.txt

Using swish activations:
CEM_with_swish.cfg.txt

For comparison, the same dataset trained with tiny_3l:

and with tiny_pan2:

AlexeyAB · 2019-08-30T10:39:34Z

@LukeAI
So CEM, scale and swish doesn't give significant improvements?

tiny_pan2 is the most accuracy network?

LukeAI · 2019-08-30T10:42:44Z

yeah, tiny_pan2 is a good one, here's hoping for a full sized pan2 network. I didn't measure the inf. time, I guess the point of the CEM network is that is very fast whilst still being reasonably accurate?

AlexeyAB · 2019-08-30T11:10:26Z

@LukeAI Just add comparison table, with final accuracy, FLOPS, and inference time

WongKinYiu · 2019-09-02T03:35:22Z

I think the mainly improvement is from more anchors/yolo-layers.
In my experiments, yolo-v3-tiny-3l gets 5.7% higher mAP@0.5 than yolo-v3-tiny(2l) on pedestrian detection task.

WongKinYiu · 2019-09-02T03:48:41Z

here list some results of my backbone (evaluate on coco test-dev set):

model A with 2l (6 anchors): 45.0% mAP@0.5, 4.04 BFLOPs.
model A with 3l (9 anchors): 46.3% mAP@0.5, 5.03 BFLOPs.
model B with 2l (6 anchors): 46.8% mAP@0.5, 4.76 BFLOPs.
model B with cem (6 anchors): 45.2% mAP@0.5, 4.81 BFLOPs.
model B with cem sam (6 anchors): 46.1% mAP@0.5, 4.90 BFLOPs.
model B with modified cem sam (9 anchors): 48.0% mAP@0.5, 4.95 BFLOPs.

AlexeyAB · 2019-09-02T10:13:45Z

@WongKinYiu

model B with modified cem sam (9 anchors): 48.0% mAP@0.5, 4.95 BFLOPs.

Thanks!
What modifications did you do in 6-model?

WongKinYiu · 2019-09-03T03:36:28Z

@AlexeyAB Hello, i m on a business trip, i ll share the modified cem sam tonight.

WongKinYiu · 2019-09-03T15:09:05Z

@AlexeyAB modified-cem-sam-head.txt

i using spp instead global average pooling, it is becuz currently this repo can not support multi-scale training when using global average pooling as an intermediate layer.
since yolo is one-stage object detector, i add sam layer for each feature pyramid.

AlexeyAB · 2019-09-03T15:14:28Z

@WongKinYiu Thanks!
Did you compare Inference time (sec) for 2. model A with 3l (9 anchors): 46.3% mAP@0.5, 5.03 BFLOPs. and 6. model B with modified cem sam (9 anchors): 48.0% mAP@0.5, 4.95 BFLOPs. ?

WongKinYiu · 2019-09-03T15:30:38Z

@AlexeyAB Hello,
sam layer is similar to scale channels layer, although it increase only <1% computation, it increase 20%~30% inference time on gpu. on cpu, they take similar inference time.

AlexeyAB · 2019-09-03T15:34:26Z

@WongKinYiu When you will find the best cfg-file, please share it, I will add it to this repository.

WongKinYiu · 2019-09-03T16:07:10Z

@AlexeyAB for best inference speed, i may share this model after discuss with my team.

it reduces 45% number of parameter, 38% of computation, 37% CPU computation time, 19% GPU computation time, and 25% TX2 computation time, while maintaining same mAP@0.5 as yolo-v3-tiny.
this model achieves 485 fps on gtx 1080 ti (batch size = 1).

AlexeyAB · 2019-09-05T09:45:45Z

@WongKinYiu

If I train the model using old repo, then valid the model using new repos. They get worse results, too.

Maybe only the new accuracy checking function is different, and the training is just as good?
I fixed a little in mAP function.

GIoU improves mAP@0.5:0.95, but drops mAP@0.5. For some cases, mAP@0.5 is more important.
PAN2 reduces 13% computation than PAN and reduces 0.5% mAP@0.5 in my experiment.
Mixup can not benefit lightweight model in my experiments.

Did you test it on MS COCO dataset?

I will add PAN3 block and new tiny model today there: #3114 (comment)

WongKinYiu · 2019-09-05T11:41:11Z

@AlexeyAB
I upload predicted bounding boxes to codalab.
And I train a same model for several times, old repo always get better results.

Yes, all of my experiment results are tested on MS COCO test-dev set.

AlexeyAB · 2019-09-05T19:10:49Z

@WongKinYiu
I added another one mode: #3114 (comment)

cfg: https://github.com/AlexeyAB/darknet/files/3580764/yolo_v3_tiny_pan3_aa_ae_mixup_scale_giou.cfg.txt

It seems it is the best cfg-file for this small dataset: #3114 (comment)

You can try to train it on MS COCO and check the mAP if you have a time.

AlexeyAB · 2019-09-05T19:36:00Z

@WongKinYiu Also can you attach entire the best of yours SAM_CEM model (not only head)?
I will attach it here and close the Issue: #3702

@AlexeyAB modified-cem-sam-head.txt

i using spp instead global average pooling, it is becuz currently this repo can not support multi-scale training when using global average pooling as an intermediate layer.
since yolo is one-stage object detector, i add sam layer for each feature pyramid.

WongKinYiu · 2019-09-05T23:07:55Z

@AlexeyAB

Thank you for sharing a good model (yolo_v3_tiny_pan3_aa_ae_mixup_scale_giou).

After discuss with my team, I can not share the backbone of #3708 (comment) currently.
I will add the modified-cem-sam-head to yolo-v3-tiny and share the cfg latter.

WongKinYiu · 2019-09-07T02:39:43Z

@AlexeyAB
now training yolo_v3_tiny_pan3_aa_ae_mixup_scale_giou (no sgdr) on coco dataset.
i ll report the result after finish training.

AlexeyAB · 2019-09-07T09:25:56Z

@WongKinYiu Try to increase assisted_excitation=4000 to assisted_excitation=20000 or 50000

WongKinYiu · 2019-09-17T05:52:09Z

COCO test-dev

Model	Size	AP@.5:.95	AP@.5	AP@.75
yolo_v3_tiny_pan3_aa_ae_mixup_scale_giou (no sgdr).txt	416x416	18.8%	36.8%	17.5%

AlexeyAB · 2019-09-17T12:37:08Z

COCO test-dev

Model	Size	BFLOPS	Inference time, ms	AP@.5:.95	AP@.5	AP@.75
yolo_v3_tiny_pan3 aa_ae_mixup_scale_giou (no sgdr).txt	416x416	8.4	6.4	18.8%	36.8%	17.5%
yolov3-tiny-prn.cfg.txt	416x416	3.5	3.8	-	33.1%	-
enet-coco.cfg.txt	416x416	3.7	22.7	-	45.5%	-

nyj-ocean · 2020-01-09T08:19:35Z

@AlexeyAB
I download the latest repo and set like following in makefile

GPU=1
CUDNN=1
CUDNN_HALF=1
OPENCV=1
AVX=0
OPENMP=0
LIBSO=0
ZED_CAMERA=0

Then use yolov3-tiny-sam.cfg.txt in #3708 (comment) to train with my own dataset
But meet error like following

Total BFLOPS 4.883
Allocate additional workspace_size = 52.43 MB
Loading weights from /home/gc/4-images/9.18/darknet/yolov3-tiny.conv.15...
seen 64
Done! Loaded 23 layers from weights-file
Learning Rate: 0.001, Momentum: 0.9, Decay: 0.0005
If error occurs - run training with flag: -dont_show
Resizing
608 x 608
Resizing type 16
Cannot resize this type of layer:
darknet: ./src/utils.c:297：error:

WongKinYiu · 2020-01-09T08:42:07Z

@nyj-ocean

yes, you should modify the resize function of sam layer, or you can only train it with random=1.

nyj-ocean · 2020-01-09T08:57:13Z

@WongKinYiu

sam layers could not train with Multi-Scale(random=1),is it right?
How to modify the resize function of sam layer to train with random=1

WongKinYiu · 2020-01-09T09:08:52Z

@nyj-ocean

yes.

just add the case of resize function of sam layer in network.c.
it already defined in sam_layer.c.
so you can simply include it and just need a little bit modification.

nyj-ocean · 2020-01-09T11:08:05Z

@WongKinYiu
Thanks a lot

nyj-ocean · 2020-01-09T11:25:23Z

@AlexeyAB
I notice another module:CBAM: Convolutional Block Attention Module

Is there a need to add CBAM Module to this repo?

CBAM: Convolutional Block Attention Module.pdf
code:https://github.com/Jongchan/attention-module

WongKinYiu · 2020-01-09T11:44:51Z

The kernel function of CAM module and SAM module are SE(squeeze-and-excitation) and SAM, respectively, the already supported by this repo.

AlexeyAB · 2020-01-09T16:44:43Z

I added resizing (random=1) for [sam] layers.

924175302 · 2020-06-14T07:07:51Z

@nyj-ocean
Hello, have you tried the CBAM module in YOLOV4?

924175302 · 2020-06-15T06:58:30Z

@WongKinYiu
You mentioned that this repo already supports the SE module, I don’t find the relevant code and how to use it, can you help me answer it, thank you

WongKinYiu · 2020-06-15T07:02:44Z

Squeeze-and-Excitation blocks (layers: [avgpool]->[conv]->[conv]->[scale_channels])

AlexeyAB · 2020-06-15T10:21:26Z

@924175302 Example of SE
https://github.com/raw/AlexeyAB/darknet/master/cfg/enet-coco.cfg

#squeeze-n-excitation
[avgpool]

# squeeze ratio r=16 (recommended r=16)
[convolutional]
filters=24
size=1
stride=1
activation=swish

# excitation
[convolutional]
filters=384
size=1
stride=1
activation=logistic

# multiply channels
[scale_channels]
from=-4

tony71200 · 2021-10-18T08:27:52Z

@924175302 Example of SE https://github.com/raw/AlexeyAB/darknet/master/cfg/enet-coco.cfg

#squeeze-n-excitation
[avgpool]

# squeeze ratio r=16 (recommended r=16)
[convolutional]
filters=24
size=1
stride=1
activation=swish

# excitation
[convolutional]
filters=384
size=1
stride=1
activation=logistic

# multiply channels
[scale_channels]
from=-4

Hi everyone, I have a problem when I train the model which use SE in the architecture. The program can not calculate mAP. I dont know. Please help me,

AlexeyAB added the question label Aug 7, 2019

WongKinYiu mentioned this issue Sep 9, 2019

Matrix Nets: A New Deep Architecture for Object Detection - mAP of 47.8@0.5...0.95 on MS COCO, #3772

Open

WongKinYiu mentioned this issue Nov 8, 2019

Assisted Excitation of Activations ~+2.2 AP@[.5, .95] #3417

Closed

AlexeyAB mentioned this issue Dec 11, 2019

Here you can post your trained models on different Datasets - 3 files: cfg, weights, names #3874

Open

nyj-ocean mentioned this issue May 5, 2020

Training Steps Mismatch in the paper and the code in ImageNet Experiments WongKinYiu/CrossStagePartialNetworks#24

Open

cenit closed this as completed Jan 23, 2021

About [sam] layer. #3708

About [sam] layer. #3708

Comments

ChenCong7375 commented Aug 5, 2019 • edited by AlexeyAB Loading

LukeAI commented Aug 6, 2019

WongKinYiu commented Aug 6, 2019 • edited Loading

ChenCong7375 commented Aug 7, 2019

WongKinYiu commented Aug 8, 2019

LukeAI commented Aug 8, 2019 • edited Loading

WongKinYiu commented Aug 8, 2019 • edited Loading

LukeAI commented Aug 8, 2019

AlexeyAB commented Aug 8, 2019

ChenCong7375 commented Aug 8, 2019

WongKinYiu commented Aug 8, 2019 • edited Loading

LukeAI commented Aug 9, 2019 • edited Loading

LukeAI commented Aug 9, 2019

WongKinYiu commented Aug 9, 2019

LukeAI commented Aug 11, 2019

WongKinYiu commented Aug 16, 2019

LukeAI commented Aug 30, 2019

AlexeyAB commented Aug 30, 2019

LukeAI commented Aug 30, 2019

AlexeyAB commented Aug 30, 2019

WongKinYiu commented Sep 2, 2019

WongKinYiu commented Sep 2, 2019 • edited Loading

AlexeyAB commented Sep 2, 2019

WongKinYiu commented Sep 3, 2019 • edited Loading

WongKinYiu commented Sep 3, 2019 • edited Loading

AlexeyAB commented Sep 3, 2019

WongKinYiu commented Sep 3, 2019

AlexeyAB commented Sep 3, 2019

WongKinYiu commented Sep 3, 2019

AlexeyAB commented Sep 5, 2019

WongKinYiu commented Sep 5, 2019 • edited Loading

AlexeyAB commented Sep 5, 2019

AlexeyAB commented Sep 5, 2019

WongKinYiu commented Sep 5, 2019

WongKinYiu commented Sep 7, 2019

AlexeyAB commented Sep 7, 2019

WongKinYiu commented Sep 17, 2019 • edited Loading

AlexeyAB commented Sep 17, 2019

nyj-ocean commented Jan 9, 2020

WongKinYiu commented Jan 9, 2020

nyj-ocean commented Jan 9, 2020 • edited Loading

WongKinYiu commented Jan 9, 2020

nyj-ocean commented Jan 9, 2020

nyj-ocean commented Jan 9, 2020 • edited Loading

WongKinYiu commented Jan 9, 2020 • edited Loading

AlexeyAB commented Jan 9, 2020

924175302 commented Jun 14, 2020

924175302 commented Jun 15, 2020

WongKinYiu commented Jun 15, 2020

AlexeyAB commented Jun 15, 2020

tony71200 commented Oct 18, 2021

ChenCong7375 commented Aug 5, 2019 •

edited by AlexeyAB

Loading

WongKinYiu commented Aug 6, 2019 •

edited

Loading

LukeAI commented Aug 8, 2019 •

edited

Loading

WongKinYiu commented Aug 8, 2019 •

edited

Loading

WongKinYiu commented Aug 8, 2019 •

edited

Loading

LukeAI commented Aug 9, 2019 •

edited

Loading

WongKinYiu commented Sep 2, 2019 •

edited

Loading

WongKinYiu commented Sep 3, 2019 •

edited

Loading

WongKinYiu commented Sep 3, 2019 •

edited

Loading

WongKinYiu commented Sep 5, 2019 •

edited

Loading

WongKinYiu commented Sep 17, 2019 •

edited

Loading

nyj-ocean commented Jan 9, 2020 •

edited

Loading

nyj-ocean commented Jan 9, 2020 •

edited

Loading

WongKinYiu commented Jan 9, 2020 •

edited

Loading