MPC GPS #81

thobotics · 2017-04-07T14:11:53Z

This PR implement MPC Guided Policy Search, which described in [1].

Main contribution:

gps_main.py is modified to make the agent able to run the MPC trajectory optimizer during sampling.
mpc_traj_opt.py: implement the MPC algorithm to minimize the surrogate cost as in [1].
Note that the policy cost term in surrogate cost will be replaced by the offline trajectory cost if you are using AlgorithmTrajOpt.
Create mobile robot agent for experiment in src/gps_agent_pkg.

Experiment [1]:

Launch file: mobilerobot_gps.launch with 3 worlds file (set by parameter): hallway.world, hallway_bend.world, one_obstacle.world.
Experiment files:
turtlebot_hallway_badmm_example: Trying to move the robot in hallway.
turtlebot_badmm_example: Trying to move the robot at desired velocity while avoiding obstacle.

Note:

Must using this version of stage_ros package (Check here).

[1] Tianhao Zhang, Gregory Kahn, Sergey Levine, Pieter Abbeel. Learning Deep Control Policies
for Autonomous Aerial Vehicles with MPC-Guided Policy Search. ICRA 2016.

Conflicts: python/tests/test_box2d/test_box2d.py

1. This MPC mission is follow offline trajectory. 2. Regularize Quu by small eta to make it PD. 3. When using raw cost, must adjust around sample, that the important thing to make the gradient of cost work. 3. Still not sure about this MPC, because original MPC is running online, meaning that x0=current_x (feedback state), and then minimize the same cost function as offline trajectory with shorter horizon. On the other hand, when using this MPC on test phase, it seem we just need to find current_x belong to which MPC (at time t), and then call MPC[m].act(...), NO OPTIMIZATION OCCUR AT RUNTIME ????

When compute gradient of cost using feedback state x0 for all t in short horizon.

…2D PointMass

…cost

TODO: Anwser why the policy too far from trajectory at iteration > 10? It happened to all point mass world (arm world a little bit).

This inhedrit and modified to adapt with robotplugin.cpp in gps_agent_pkg. Use min_distance_to_obstacle from turtlebot_mpepc to measure nearest obstacle. RESULT: It already can move robot in desired velocity and orientation, but can not avoid obstacle. TODO: move updateObstacleTree to another node like costmap_server in Matt Derry original package. Check about thresh in min_distance_to_obstacle and the dsafe in hyperparams.

…admm QUESTION: Why pol_wt (nu) need to scaled using median like tgt_mu in policy_opt_caffe?

RESULT: Still working on it ... (Tried MPC_GPS weighted, then PLATO)

…) for avoiding obstacle task.

Conflicts: README.md

Conflicts: python/gps/algorithm/cost/config.py src/proto/gps.proto

cbfinn · 2017-04-07T16:25:59Z

Thanks for the PR thobotics! @TZ2016 will take the first pass at reviewing.

Glancing over the PR, please make sure that you remove commented out code and remove unneeded changes to parts of the code that MPCGPS does not use.

tcjcxy30 · 2018-10-04T07:18:55Z

Hi:
@thobotics , your mpc-gps can not compile with the error:
/usr/bin/ld: /usr/lib/libvtkWrappingTools-6.2.a(vtkParse.tab.c.o): relocation R_X86_64_32S against '.rodata' can not be used when making a shared object; recompile with -fPIC
/usr/lib/libvtkWrappingTools-6.2.a(vtkParse.tab.c.o): error adding symbols: Bad value collect2: error: ld returned 1 exit status
CMakeFiles/gps_agent_lib.dir/build.make:2527: recipe for target 'lib/libgps_agent_lib.so' failed
make[2]: *** [lib/libgps_agent_lib.so] Error 1
CMakeFiles/Makefile2:67: recipe for target 'CMakeFiles/gps_agent_lib.dir/all' failed
make[1]: *** [CMakeFiles/gps_agent_lib.dir/all] Error 2
Makefile:127: recipe for target 'all' failed
make: *** [all] Error 2

I think the reason is that the libvtkWrappingTools-6.2.a is a static link library so I try to add some code in CMakeLists.txt to fix the -fPIC problem like:

set(CMAKE_C_FLAGS "${CMAKE_C_FLAGS} -fPIC") set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -fPIC")

but I still get the same error above. I can compile the master branch of gps normally, so how can I fix this compiled error?
My vtk 6.2.0 is installed from source and set the build_share_lib to ON but the libvtkWrappingTools-6.2.a is a static link library.
Thankyou very much!

mpc_gps information

thobotics added 30 commits December 30, 2016 17:41

Hingle L2 loss for cost obstacle. Experiment on Box2D point_mass.

b23475e

Compute mu, sigma of marginal distribution p(x) and test it gradient

536f7fa

test_box2d custom sample

f3cdf6a

Merge branch 'mpc_gps' of https://github.com/thobotics/gps into mpc_gps

79d2def

Conflicts: python/tests/test_box2d/test_box2d.py

Surogate cost, regularized for PD Quu, non line search

448aad6

Real MPC (Online Optimization) follow offline trajectory.

7319ee5

When compute gradient of cost using feedback state x0 for all t in short horizon.

Using MPC when sample to train trajectory distribution. Tested on Box…

c69e0bf

…2D PointMass

Small fix: Added check for using mpc in algorithm.py

6e89046

Train update MPC use previous sample to evaluate cost.

cf87743

Fix pointmass return nan of closest point of obstacle, this make nan …

710b3f6

…cost

It work for both Arm and Pointmass by solving MPC unconstrained problem.

00f7d73

Add DESIGN_DOC for mpc_gps trajectory optimization

1beecb8

Update document for fitting dynamics from sample

2e9b22f

Merge branch 'master' of https://github.com/cbfinn/gps

41d440a

Update README.md

70b768b

Update fomula of Sigma in gmm

4282056

Merge branch 'master' of https://github.com/thobotics/gps

f8644ef

Update DESIGN_DOC

93e2645

Seperate Pointmass non-obstacle and obstacle world

f6df92f

Create a point mass obstacle world and test with BADMM.

2e9b925

TODO: Anwser why the policy too far from trajectory at iteration > 10? It happened to all point mass world (arm world a little bit).

Check use_mpc key exists in hyperparams

1eebe77

Small fix for cost_translation_table_ caused Seg fault

043ca6d

Trained work when using OMNI direction mobile robot.

5ef44a8

** Completed BADMM MPC_GPS surogate cost. Tested by box2d_pointmass_b…

f7cbde1

…admm QUESTION: Why pol_wt (nu) need to scaled using median like tgt_mu in policy_opt_caffe?

Experiment on turtlebot_badmm one obstacle world.

bba5400

RESULT: Still working on it ... (Tried MPC_GPS weighted, then PLATO)

Use 1d raw laser range sensor as observation source.

0c05978

** Successful train (2 initial state) and test (check note git status…

e8bbd30

…) for avoiding obstacle task.

Update video for testing phase

556c48c

thobotics added 16 commits March 28, 2017 23:07

Update video for testing phase

c68f6be

** DIFF Mobile robot move in hall way

7270a21

Merge branch 'mpc_gps' of https://github.com/thobotics/gps into mpc_gps

d2cb11d

Update link for turtlebot (differential drive) move in hallway.

1064913

Clean up

feac983

Merge branch 'mpc_gps' of https://github.com/thobotics/gps into mpc_gps

dfc921f

Remove junk file

2fdb3be

Merge branch 'mpc_gps' and edit README for pull request.

4abf703

Conflicts: README.md

Reset cost_state

06ba91e

Merge branch 'master' of https://github.com/cbfinn/gps

e910769

Conflicts: python/gps/algorithm/cost/config.py src/proto/gps.proto

Update cost obstacle config

d92d2f9

Add reference to MPC GPS

ea71a69

Remove junk worlds and maps of turtlebot

3de3115

Restore pr2_gazebo launch file

4f8722d

Update camera sensor

0a95108

Change weight of cost_obstacle and initial state.

df218b2

cbfinn assigned cbfinn and unassigned cbfinn Apr 7, 2017

thobotics added 4 commits May 31, 2017 16:18

Show iLQG and MPC Plan on RVIZ

67b4716

Merge branch 'master' of https://github.com/cbfinn/gps

c1f28b8

Publish nearest obstacle points

aec8993

Agent turtlebot mpepc.

d7f2707

thobotics added 2 commits February 9, 2021 15:50

Update README.md

ff9e82f

mpc_gps information

Update README.md

dcb7ef5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPC GPS #81

MPC GPS #81

thobotics commented Apr 7, 2017

cbfinn commented Apr 7, 2017

tcjcxy30 commented Oct 4, 2018 •

edited

Loading

MPC GPS #81

Are you sure you want to change the base?

MPC GPS #81

Conversation

thobotics commented Apr 7, 2017

cbfinn commented Apr 7, 2017

tcjcxy30 commented Oct 4, 2018 • edited Loading

tcjcxy30 commented Oct 4, 2018 •

edited

Loading