"Forgetting Learning" using SAC in a Drone environment on PyRep #93

kaelgabriel · 2020-01-17T12:35:50Z

Hi guys, I'm using a drone environment that a friend of mine made using V-REP (and know we are using PyRep).

We got it to converge using Tensorforce PPO and also RL-ADVENTURE2 SAC. I could not make it work on "softlearning" because of the way PyRep uses process/threads.

So it seems to me that the env is legit.

Anyways, when I use RLKIT, even tweaking hyperparameters, things like this happen:

Anyone ever saw this happening?

Thanks for your time.

vitchyr · 2020-01-21T16:12:33Z

Hmm it's hard to say. What hyperparameters are you tuning? And are you using the same hyperparameters as in RL-ADVENTURE2?

kaelgabriel · 2020-01-30T15:48:26Z

@vitchyr, thanks for your answer.

I've figured it out now that is a GPU problem. My tensors in GPU and CPU are different, even casting all to float64 (changing some parts of the library).

So I will have to use CPU for this problem, since I don't have time to continue debugging.

Thanks

kaelgabriel changed the title ~~"Forgetting Learning" using SAC in a Drone's environment on PyRep~~ "Forgetting Learning" using SAC in a Drone environment on PyRep Jan 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Forgetting Learning" using SAC in a Drone environment on PyRep #93

"Forgetting Learning" using SAC in a Drone environment on PyRep #93

kaelgabriel commented Jan 17, 2020

vitchyr commented Jan 21, 2020

kaelgabriel commented Jan 30, 2020

"Forgetting Learning" using SAC in a Drone environment on PyRep #93

"Forgetting Learning" using SAC in a Drone environment on PyRep #93

Comments

kaelgabriel commented Jan 17, 2020

vitchyr commented Jan 21, 2020

kaelgabriel commented Jan 30, 2020