Does CSP model need to increase training times #26

zsgj-Xxx · 2020-05-14T13:17:23Z

Due to the limitation of GPU devices, I only tested the model with epoch = 1, and found that compared with the traditional resnext model, the result of cspresnext model for an epoch is not satisfactory. Is it because of the residual link used that the model needs more time to learn

WongKinYiu · 2020-05-14T16:52:39Z

Hello,

I have not checked converge speed of models with and without CSP.
However, all of my experiments follow the same setting as https://pjreddie.com/darknet/imagenet/.
So the training epochs are totally same.

zsgj-Xxx · 2020-05-14T19:51:45Z

Thank you very much for your reply,

I want to do some small tests with CSP
I tried to copy it on the pytorch, but the parameters were worse, I haven't found any problems yet
How to modify the CSP method based on resnext?

WongKinYiu · 2020-05-15T00:55:18Z

the topology of resnet, resnext, and darknet are almost same.
#24 (comment) is for your reference.

zsgj-Xxx · 2020-05-15T03:06:03Z

Thank you for your work,

I just need to replace darknet_layer with resne(x)t_layer to get the result I need?:heart_eyes:

zsgj-Xxx · 2020-05-15T03:50:23Z

In addition, in this figure, after maxpooling, is ① CSP? But I think the parameter displayed is not split, but copy

WongKinYiu · 2020-05-15T04:26:37Z

yes.

i think there will be a convolutional layer behind ①. more details: #18

zsgj-Xxx · 2020-05-18T09:27:49Z

I'm sorry that I've read the paper and the cfg file over and over again, but I still don't understand it

14 * 14 * 1024 - > whether two 7 * 7 * 1024 branches have also been trained

It looks like

WongKinYiu · 2020-05-18T13:38:56Z

14x14 is belong to partial transition layer in previous stage.

zsgj-Xxx closed this as completed May 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does CSP model need to increase training times #26

Does CSP model need to increase training times #26

zsgj-Xxx commented May 14, 2020

WongKinYiu commented May 14, 2020

zsgj-Xxx commented May 14, 2020

WongKinYiu commented May 15, 2020

zsgj-Xxx commented May 15, 2020

zsgj-Xxx commented May 15, 2020

WongKinYiu commented May 15, 2020

zsgj-Xxx commented May 18, 2020

WongKinYiu commented May 18, 2020

Does CSP model need to increase training times #26

Does CSP model need to increase training times #26

Comments

zsgj-Xxx commented May 14, 2020

WongKinYiu commented May 14, 2020

zsgj-Xxx commented May 14, 2020

WongKinYiu commented May 15, 2020

zsgj-Xxx commented May 15, 2020

zsgj-Xxx commented May 15, 2020

WongKinYiu commented May 15, 2020

zsgj-Xxx commented May 18, 2020

WongKinYiu commented May 18, 2020