Skip to content
View taodav's full-sized avatar

Block or report taodav

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. brownirl/lambda_discrepancy brownirl/lambda_discrepancy Public

    Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

    Python 14

  2. aux-inputs aux-inputs Public

    reinforcement learning with auxiliary inputs

    Jupyter Notebook 1 1

  3. microsoft/TextWorld microsoft/TextWorld Public

    ​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

    Jupyter Notebook 1.2k 188

  4. nsrs nsrs Public

    Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.

    Jupyter Notebook 14 3