Skip to content

Modularized Implementation of Deep RL Algorithms in PyTorch

License

Notifications You must be signed in to change notification settings

ShangtongZhang/DeepRL

 
 

Repository files navigation

This branch is the code for the paper

Generalized Off-Policy Actor-Critic
Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson (NeurIPS 2019)

.
├── Dockerfile                                      # Dependencies
├── requirements.txt                                # Dependencies
├── MDP.py                                          # Two-circle MDP 
├── job.py                                          # Entrance for the Mujoco experiments
|   ├── batch                                       # Start Geoff-PAC and baseline algorithms
|   ├── geoff_pac                                   # Entrance of Geoff-PAC / ACE / Off-PAC
├── deep_rl/agent/GeoffPAC_agent.py                 # Implementation of Geoff-PAC / ACE / Off-PAC
└── plot_paper.py                                   # Plotting

I can send the data for plotting via email upon request.

This branch is based on the DeepRL codebase and is left unchanged after I completed the paper. Algorithm implementations not used in the paper may be broken and should never be used. It may take extra effort if you want to rebase/merge the master branch.