-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluation for die reorientation #26
Comments
Just chiming in to say that I agree here. The current die reorientation task doesn't seem generally solveable within 50 time-steps based on my experiments. |
|
Thank you for the reply. I didn't consider how the variable horizon length affects the effort calculation. Thinking about it, 200 steps might even be slightly too long then. In my experiments, it's very difficult for the policy to stabilize an object in the tight thresholds during the right time window. But it's hard to say. |
closed with #29 |
I feel that the evaluation criteria for the die reorientation task are a bit restrictive. Some of my policies are able to solve the task, but only before or after the specific time window that would count as a success.
Would it be possible to relax this a bit?
I suggest an episode limit of 200 and measuring a success if the goal is reached for 5 consecutive time steps at any point in the episode.
This preserves the spirit of the task, but is a bit easier.
The text was updated successfully, but these errors were encountered: