Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The loss on COCO dataset of training stage1 dosen't decrease #5

Open
BruceW91 opened this issue May 15, 2018 · 6 comments
Open

The loss on COCO dataset of training stage1 dosen't decrease #5

BruceW91 opened this issue May 15, 2018 · 6 comments

Comments

@BruceW91
Copy link

Hi, I run the train_coco_word2_1_pool.m, but after more than 10 epochs, I found the train result is still bad(as below). I didn't change any hyper parameters but I don't know why it doesn't work.
qq 20180515115715
Can you tell me what make this result happen? And I found the learning rate in your code is 0.1, but it's reported as 0.001 in the paper. Which lr is correct and better in this task?

@layumi
Copy link
Owner

layumi commented May 15, 2018

The learning rate = global learning rate * layer learning rate.
For COCO, it may converge slowly. It may take about 20 epochs to start converging.

@layumi
Copy link
Owner

layumi commented May 15, 2018

I also notice there is error in your code? @habbakuk1103
Do you prepare the dataset in the right way?

@BruceW91
Copy link
Author

Thank you for response.
I think I prepare the COCO dataset as you told, but the dimension of my 'coco_dictionary.mat' is 29141, but it's 29972 in your code. I don't know whether it's caused by I prepare the dataset in the matlab r2016b on Windows system of my computer because I can't install this version on our linux server. Could you give me a link to download the data after preprocessed as below if possible?
qq 20180515202323
By the way, I am not familiar with matconvnet. Could you tell me what the 'error' in the result mean? I check the code before and I think the data and label should be aligned.

@layumi
Copy link
Owner

layumi commented May 16, 2018

image

I wonder whether you provided the right file path.

@BruceW91
Copy link
Author

The path of the 'prepare_imdb.m' is as below in matlab on my server, but its version is matlab R2014b which doesn't have 'jsondecode' function
1
So I implement the program on my computer, thus the path is changed as below
2
I wonder whether this difference of paths cause the problem.

@BruceW91
Copy link
Author

I have solved the problem. Thank you for your suggestion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants