-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Doesn't work with the current master branch of caffe #1
Comments
When you say the network blows up do you mean that the gradients explode? I can't access my computer with a GPU right now so I can only do CPU training. But running it a few times using the current master branch and your suggested commit, I don't see anything obvious that makes me think it's not training properly. The error plot slowly trends up. |
Hi Trevor, Ya I'm seeing the gradients explode. Maybe it's something with my setup. Are you using the current head of caffe/master? On Sat, Dec 19, 2015 at 8:46 AM, Trevor Barron [email protected]
|
Yeah, I'm fully up to date with caffe/master. If it helps I'm on Ubuntu 14.04 using openblas. I will check the training again later just to make sure. |
Cool! This is very helpful. Maybe it's the nvidia stuff. That would be On Sun, Dec 20, 2015 at 12:01 PM, Trevor Barron [email protected]
|
I'll have to try this on the GPU when I get home later this week. I tried interfacing with a minecraft game and I do get exploding gradients on the head of caffe/master but not with the commit from September. It's possible I introduced a bug in the interface but it's likely something else is still off. I'm not sure why it would give me problems with this setup but not before. |
Hi Trevor, Ya your hitting the problem. I see the same thing. Something seems to On Wed, Dec 23, 2015 at 10:13 AM, Trevor Barron [email protected]
|
Hello watts4, great work. Is the issue resolved? |
Hi Author,
I haven't tried things for a year now. There's no reason things shouldn't
be working with the latest caffe. My guess is the issue is something to do
with the learning rates.
W
…On Sat, Dec 31, 2016 at 12:39 PM, joyousrabbit ***@***.***> wrote:
Hello watts4, great work. Is the issue resolved?
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADg-pRUZmlF9tRmrL-iQwFdMyZmd_79pks5rNr2QgaJpZM4G1TOR>
.
|
@watts4speed You are right, it's learning rate. After changing base_lr from 0.1 to 0.01 without clip_gradient, it works fine. For game breakout, it gives score 45 in training, however, in evaluation, it reachs only 3 (it stays at left or right, almost never move) why? |
Thanks for checking this out!
…On Sat, Dec 31, 2016 at 9:19 PM, joyousrabbit ***@***.***> wrote:
@watts4speed <https://github.com/watts4speed> You are right, it's
learning rate. After changing base_lr from 0.1 to 0.01 without
clip_gradient, it works fine.
The reason is, I guess, some commit in caffe fixed or optimized the
performance (the newer version should have a higher default learning rate).
So the old base_lr 0.1 is too big for the new caffe release.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADg-pR4QJCeCyZVSczTKqSfYO-7J6JJlks5rNzd1gaJpZM4G1TOR>
.
|
Can this be due to using solver prototxt parameter 'solver_type' which is deprecated in newer versions of Caffe? It seems setting solver_type to ADADELTA has no effect and default SGD is used as the solver. |
I'm not sure what the issue was. See joyousrabbit suggestion about
changing the LR. It's been a while since I worked on this so I'm not up to
date on things well enough to say.
…On Fri, May 26, 2017 at 8:35 PM, chshong ***@***.***> wrote:
Can this be due to using solver prototxt parameter 'solver_type' which is
deprecated in newer versions of Caffe? It seems setting solver_type to
ADADELTA has no effect and default SGD is used as the solver.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADg-pVim-KTW0fv4LIq102BtwLaceiKSks5r95nxgaJpZM4G1TOR>
.
|
The network blows up and doesn't train when using the current head of caffe/master. Any help or insight into what's going on is appreciated.
The text was updated successfully, but these errors were encountered: