Miffyli

Anssi Miffyli

Researcher at Meta Fundamental Artificial Intelligence Research (FAIR), working on reinforcement learning.

242 followers · 58 following

@facebookresearch
London, UK

Achievements

x3 x3 x2

Achievements

x3 x3 x2

Organizations

Stars

37 stars written in Python

Clear filter

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 8,817 1,674 Updated Sep 18, 2024

hill-a / stable-baselines

Forked from openai/baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 4,133 723 Updated Sep 4, 2022

DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,002 510 Updated Aug 6, 2024

hauntsaninja / pyp

Easily run Python at the shell! Magical, but never mysterious.

Python 1,412 39 Updated May 27, 2024

openai / Video-Pre-Training

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,281 142 Updated Jun 10, 2024

HumanCompatibleAI / imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1,277 244 Updated Aug 6, 2024

pfnet / pfrl

PFRL: a PyTorch-based deep reinforcement learning library

Python 1,183 157 Updated Aug 4, 2024

alex-petrenko / sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 809 109 Updated Aug 30, 2024

TeaPearce / Conditional_Diffusion_MNIST

Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.

Python 614 68 Updated Jan 7, 2024

Stable-Baselines-Team / stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Python 463 173 Updated Aug 13, 2024

danijar / crafter

Benchmarking the Spectrum of Agent Capabilities

Python 375 63 Updated Jan 23, 2024

pytorch-labs / LeanRL

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 346 9 Updated Sep 27, 2024

TeaPearce / Counter-Strike_Behavioural_Cloning

IEEE CoG & NeurIPS workshop paper 'Counter-Strike Deathmatch with Large-Scale Behavioural Cloning'

Python 327 43 Updated Sep 6, 2024

Stable-Baselines-Team / stable-baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 282 60 Updated Apr 29, 2023

google-research / falken

Falken provides developers with a service that allows them to train AI that can play their games

Python 253 35 Updated Sep 13, 2024

utilForever / baba-is-auto

Baba Is You simulator using C++ with some reinforcement learning

Python 151 19 Updated Apr 23, 2023

Wesleyliao / QWOP-RL

Python 136 27 Updated Apr 1, 2021

AIcrowd / music-demixing-challenge-starter-kit

Starter kit for getting started in the Music Demixing Challenge.

Python 134 44 Updated Jul 30, 2021

kachayev / pyage2

"Age of Empires II" Learning Environment

Python 63 8 Updated Jul 12, 2021

vwxyzjn / PPO-Implementation-Deep-Dive

DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details

Python 44 3 Updated Apr 14, 2022

Bam4d / Neural-Game-Engine

Code to reproduce Neural Game Engine experiments and pre-trained models

Python 40 1 Updated Jun 22, 2022

entity-neural-network / entity-gym

Standard interface for entity based reinforcement learning environments.

Python 35 5 Updated Feb 28, 2024

AaltoVision / realant-rl

Reinforcement learning with RealAnt: an open-source low-cost quadruped

Python 30 8 Updated Jan 11, 2022

joonaspu / video-game-behavioural-cloning

Behavioural cloning experiments with video games

Python 30 5 Updated Apr 15, 2020

chscheller / sc2_imitation_learning

StarCraft 2 Imitation Learning

Python 29 3 Updated Jul 2, 2021

polarbart / SuperHexagonAI

An AI for the game Super Hexagon based on reinforcement learning

Python 27 4 Updated Feb 7, 2021

Miffyli / gan-aimbots

Code for the experiments done in the paper "GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters"

Python 23 7 Updated May 13, 2022

amiranas / minerl_imitation_learning

Python 21 6 Updated Jul 14, 2020

Miffyli / policy-supervectors

Creating fixed-length vectors to describe RL/GA policies

Python 20 Updated Oct 23, 2021

MichalOp / MineRL2020

Python 16 5 Updated Aug 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anssi Miffyli

Achievements

Achievements

Organizations

Block or report Miffyli

Stars

DLR-RM / stable-baselines3

hill-a / stable-baselines

DLR-RM / rl-baselines3-zoo

hauntsaninja / pyp

openai / Video-Pre-Training

HumanCompatibleAI / imitation

pfnet / pfrl

alex-petrenko / sample-factory

TeaPearce / Conditional_Diffusion_MNIST

Stable-Baselines-Team / stable-baselines3-contrib

danijar / crafter

pytorch-labs / LeanRL

TeaPearce / Counter-Strike_Behavioural_Cloning

Stable-Baselines-Team / stable-baselines

google-research / falken

utilForever / baba-is-auto

Wesleyliao / QWOP-RL

AIcrowd / music-demixing-challenge-starter-kit

kachayev / pyage2

vwxyzjn / PPO-Implementation-Deep-Dive

Bam4d / Neural-Game-Engine

entity-neural-network / entity-gym

AaltoVision / realant-rl

joonaspu / video-game-behavioural-cloning

chscheller / sc2_imitation_learning

polarbart / SuperHexagonAI

Miffyli / gan-aimbots

amiranas / minerl_imitation_learning

Miffyli / policy-supervectors

MichalOp / MineRL2020