Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems #29

Closed
lorrp1 opened this issue Oct 16, 2020 · 10 comments
Closed

Problems #29

lorrp1 opened this issue Oct 16, 2020 · 10 comments

Comments

@lorrp1
Copy link

lorrp1 commented Oct 16, 2020

A

@AminHP
Copy link
Owner

AminHP commented Oct 18, 2020

It seems there is a problem with stable_baselines. I tried to implement a DQN from scratch, and it works just fine with the sin function.

@toksis
Copy link

toksis commented Oct 18, 2020

Can you change the learning rate?

model = DQN(MlpPolicy, env, verbose=1,learning_rate=0.00001)?

I tried to do a learning rate of 1. and it do this.

image

@toksis
Copy link

toksis commented Oct 18, 2020

Learning rate of .0001

image

@toksis
Copy link

toksis commented Oct 18, 2020

Im using the old stable baseline.

@toksis
Copy link

toksis commented Oct 18, 2020

Maybe you can share your sin function code? @lorrp1

@lorrp1
Copy link
Author

lorrp1 commented Oct 21, 2020

@AminHP im getting even worse results with ray

@AminHP
Copy link
Owner

AminHP commented Oct 23, 2020

I suggest you read this article and implement your own DQN and test it with the environment.

@toksis
Copy link

toksis commented Oct 24, 2020

try the zoo in stable baselines.

@lorrp1
Copy link
Author

lorrp1 commented Oct 24, 2020

@toksis are you able to get it working with the sin function there?

@lorrp1 lorrp1 closed this as completed Oct 25, 2020
@toksis
Copy link

toksis commented Oct 26, 2020

Sadly no... I want to try the zoo in stable baselines maybe it will help. But first I will try the article from AdminHP.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants