Shahdsaf / Semi-Supervised-World-Models Public

Notifications You must be signed in to change notification settings
Fork 1
Star 6

A new version of world models using Echo-state networks and random weight-fixed CNNs

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docs		docs
rcrc		rcrc
train_in_the_dream		train_in_the_dream
vanilla_ppo		vanilla_ppo
vm		vm
vrc		vrc
README.md		README.md
env.py		env.py

Repository files navigation

World Models with PyTorch

A new version of world models using Echo-state networks and random weight-fixed CNNs in Pytorch. Also, the controller leverages RL algorithms, e.g. PPO methods.

Requirement

To run the code, you need

pytorch
gym

Method

Every action will be repeated for 8 frames. To get velocity information, state is defined as adjacent 4 frames in shape (4, 96, 96). Use a two heads FCN to represent the actor and critic respectively. The actor outputs α, β for each actin as the parameters of Beta distribution.

Training

Start a Visdom server with python -m visdom.server, it will serve http://localhost:8097/ by default.

To train the agent, runpython train.py --render --vis or python train.py --render without visdom. To test, run python test.py --render.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

World Models with PyTorch

Requirement

Method

Training

Performance

About

Releases

Packages

Contributors 2

Languages

Shahdsaf/Semi-Supervised-World-Models

Folders and files

Latest commit

History

Repository files navigation

World Models with PyTorch

Requirement

Method

Training

Performance

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages