Skip to content
View jannerm's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report jannerm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. diffuser diffuser Public

    Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

    Python 770 118

  2. trajectory-transformer trajectory-transformer Public

    Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

    Python 445 61

  3. gamma-models gamma-models Public

    Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"

    Python 39 6

  4. mbpo mbpo Public

    Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

    Python 467 83

  5. ddpo ddpo Public

    Code for the paper "Training Diffusion Models with Reinforcement Learning"

    Python 299 25

  6. berkeleydeeprlcourse/homework_fall2020 berkeleydeeprlcourse/homework_fall2020 Public

    Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)

    Jupyter Notebook 249 246