Skip to content

Tags: rail-berkeley/rlkit

Tags

v0.2.1

Toggle v0.2.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Change sampling method from randint to choice in Replay and robustify…

… policy networks in SAC (#111)

* Introduced possibility to change alpha parameter

* Fix sum operation which causes trouble for more that two batch dimensions

* Replace randint with choice to avoid duplicates

* Added replace as an option to the replay buffer and a warning if desired behaviour is not possible

v0.2.0

Toggle v0.2.0's commit message
Initial v0.2.0 code

v0.1.2

Toggle v0.1.2's commit message
Saved version before v0.2 with RIG and HER

v0.1

Toggle v0.1's commit message
Initial version built off of pytorch v0.3

v0.1.1

Toggle v0.1.1's commit message
upgrade to MuJoCo 1.5

v0.1.0

Toggle v0.1.0's commit message
First tagged version