Skip to content

a simple tabular Q-learning using epsilon-greedy on frozen ice openAI gym environment.

Notifications You must be signed in to change notification settings

hbonnavaud/Simple-E-greedy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Simple-E-greedy

a simple tabular Q-learning using epsilon-greedy on frozen ice openAI gym environment.

Result:

The red line represent the evolution of epsilon value over time. The blue line represent the average accuracy on goal-reaching task for the last 20 episodes. The x axis represent the episode id + 20. image

About

a simple tabular Q-learning using epsilon-greedy on frozen ice openAI gym environment.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages