rlkit

Additions:

Reinforcement learning framework and algorithms implemented in PyTorch.

Some implemented algorithms:

Temporal Difference Models (TDMs)
Deep Deterministic Policy Gradient (DDPG)
- example script
- DDPG paper
(Double) Deep Q-Network (DQN)
Soft Actor Critic (SAC)
Twin Dueling Deep Determinstic Policy Gradient (TD3)
- example script
- TD3 paper

To get started, checkout the example scripts, linked above.

Install and use the included ananconda environment

$ conda env create -f rlkit-env.yml
$ source activate rlkit-env

A lot of the coding infrastructure is based on rllab. The serialization and logger code are basically a carbon copy of the rllab versions.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
examples		examples
rlkit		rlkit
scripts		scripts
README.md		README.md
rlkit-env.yml		rlkit-env.yml