sb3 is much slower to train than rl_games #528

LyuJZ · 2024-06-17T17:59:50Z

LyuJZ
Jun 17, 2024

Hi, I am training a policy for object grasping with KUKA arm with IsaacLab. After the same training iterations, I found that the training results of sb3 and rl_games are similar. However, it takes much longer for sb3 than rl_games during the same number of training iterations.

Is this phenomenon normal? If so, do you have any suggestions for improving SB3's training speed?

Mayankm96 · 2024-06-18T08:49:32Z

Mayankm96
Jun 18, 2024
Maintainer

Hi @LyuJZ,

Yes, this is a known phenomenon. Please check Section V-A of the paper: https://arxiv.org/abs/2301.04195

TL;DR: SB3 expects the data buffers to be in numpy. This leads to an overhead when converting torch tensors that live on GPU to numpy arrays and vice-versa.

1 reply

LyuJZ Jun 19, 2024
Author

From the figure, it seems the sb3 performance is worse than rl_games?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sb3 is much slower to train than rl_games #528

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

sb3 is much slower to train than rl_games #528

LyuJZ Jun 17, 2024

Replies: 1 comment · 1 reply

Mayankm96 Jun 18, 2024 Maintainer

LyuJZ Jun 19, 2024 Author

LyuJZ
Jun 17, 2024

Replies: 1 comment 1 reply

Mayankm96
Jun 18, 2024
Maintainer

LyuJZ Jun 19, 2024
Author