RL with MinAtar - Assignment 2 of cs234 2023

Intorduction

In this assignment, a modified version of deep Q-learning from DeepMind’s paper is implemented. For the environment setting, the player controls a bar that can move horizontally, and gets rewards by bouncing a ball into bricks, breaking them. We are going to use MinAtar ([7]), a miniaturized version of the original Atari game. Instead of the original 210 × 160 RGB image resolution, MinAtar uses a 10 × 10 boolean grid, which makes it possible to use a significantly smaller model and still get a good performance.

Experiment and results

Linear approaximation

The $Q$ values is represented as a parametric function $Q_w(s, a)$ where $w$ is the weights and biases of a linear function. The code parts are q4_linear_torch.py and q6_train_atari_linear.py.

Neural network approaximation, DeepMind's DQN

The $Q$ values is represented as a parametric function $Q_w(s, a)$ where $w$ is the weights and biases of smaller version of the deep Q-network, listed as following.

One convolution layer with 16 output channels, a kernel size of 3, stride 1, and no padding.
A ReLU activation.
A dense layer with 128 hidden units.
Another ReLU activation.
The final output layer. The code parts are q4_nature_torch.py and q5_nature_torch.py.

Results

The result of linear approaximation, code: q6_train_atari_linear.py

The result of neural network approaximation, code: q6_train_atari_nature.py

As the result shows, the value of the neural network approaximation is higher than the value of the linear approaximation, but the std of neural network approaximation is also higher than the std of the linear approaximation, showing some unstable properties of neural network approaximation.

Future direction

Test different hyperparameters for the training
Implement different model structure for neural network approaximation

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
configs		configs
core		core
results		results
utils		utils
.gitattributes		.gitattributes
CS234_A2.pdf		CS234_A2.pdf
Makefile		Makefile
README.md		README.md
collect_submission.sh		collect_submission.sh
cs234-torch-10.1.yml		cs234-torch-10.1.yml
cs234-torch-11.0.yml		cs234-torch-11.0.yml
cs234-torch-mac.yml		cs234-torch-mac.yml
cs234-torch-windows.yml		cs234-torch-windows.yml
q3_schedule.py		q3_schedule.py
q4_linear_torch.py		q4_linear_torch.py
q5_nature_torch.py		q5_nature_torch.py
q6_train_atari_linear.py		q6_train_atari_linear.py
q6_train_atari_nature.py		q6_train_atari_nature.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL with MinAtar - Assignment 2 of cs234 2023

Intorduction

Experiment and results

Linear approaximation

Neural network approaximation, DeepMind's DQN

Results

Future direction

About

Releases

Packages

Languages

JoshWuuu/Deep-Q-Learning-Atari-Game

Folders and files

Latest commit

History

Repository files navigation

RL with MinAtar - Assignment 2 of cs234 2023

Intorduction

Experiment and results

Linear approaximation

Neural network approaximation, DeepMind's DQN

Results

Future direction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages