Dopamine - Research framework for fast prototyping of reinforcement learning algorithms. OpenAI Gym - Toolkit for developing and comparing reinforcement learning algorithms. RLlib - Open-source library for reinforcement learning that offers both high scalability and a unified API for a variety of applications. Stable Baselines - Set of improved implementations of reinforcement learning algorithms based on OpenAI Baselines. pytorch-a3c - PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". PlaNet - Deep Planning Network: Control from pixels by latent planning with learned dynamics. Learning to Paint - Painting AI that can reproduce paintings stroke by stroke using deep reinforcement learning. RL Baselines Zoo - Collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included. bsuite - Collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent. OpenSpiel - Collection of environments and algorithms for research in general reinforcement learning and search/planning in games. KataGo - Research and experimentation with self-play training in Go. Catalyst - Reproducible and fast DL & RL. BCQ - PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration". TorchBeast - PyTorch Platform for Distributed RL. rlpyt - Reinforcement Learning in PyTorch. RLax - Library built on top of JAX that exposes useful building blocks for implementing reinforcement learning agents. DeepRLHacks - Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp. prob_mbrl - Library of probabilistic model based RL algorithms in pytorch. PhoenixGo - Go AI program which implements the AlphaGo Zero paper. TensorTrade - Trade Efficiently with Reinforcement Learning. AlphaZero.jl - Generic, simple and fast implementation of Deepmind's AlphaZero algorithm. (HN) TensorSwarm - Framework for reinforcement learning of robot swarms. mentalRL - A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry. Coach - Python reinforcement learning framework containing implementation of many state-of-the-art algorithms. dm_env - DeepMind RL Environment API. SURREAL - Fully integrated framework that runs state-of-the-art distributed reinforcement learning (RL) algorithms. Tonic - Deep reinforcement learning library. TF-Agents - Reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning. Optax - Gradient processing and optimization library for JAX. Chex - Library of utilities for helping to write reliable JAX code. GenRL - PyTorch reinforcement learning library centered around reproducible and generalizable algorithm implementations. (HN) (Docs) (Tutorials) (Reddit) Stable Baselines3 - PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms. Minigo - Minimalist Go engine modeled after AlphaGo Zero, built on MuGo. Mathy - Platform for using computer algebra systems to solve math problems step-by-step with reinforcement learning. (Code) RLCard - Toolkit for Reinforcement Learning in Card Games. GridRoyale - Life simulation for exploring social dynamics. (HN) TorchRL - PyTorch Implementation of Reinforcement Learning Algorithms. AI safety gridworlds - Suite of reinforcement learning environments illustrating various safety properties of intelligent agents. TensorLayer - Deep Learning and Reinforcement Learning Library for Scientists and Engineers. (Docs) FitML - Collection of python Machine Learning articles and examples. PFRL - PyTorch-based deep reinforcement learning library. ChainerRL - Deep reinforcement learning library built on top of Chainer. EvoStrat - Library that makes Evolutionary Strategies (ES) simple to use. Alpha Zero Boosted - "build to learn" implementation of the Alpha Zero algorithm written in Python that uses LightGBM (Gradient Boosted Decision Trees) in place of a Deep Neural Network for value/policy functions. XingTian - Componentized library for the development and verification of reinforcement learning algorithms. mazelab - Customizable framework to create maze and gridworld environments. DeepMind Lab2D - Flexible and fast engine for rapidly creating 2D environments. Built for RL, and well suited for the needs of multi-agent research. (Paper) (HN) PettingZoo - Python library for conducting research in multi-agent reinforcement learning. It's akin to a multi-agent version of OpenAI's Gym library. DeepMind Hard Eight Tasks - Set of 8 diverse machine-learning tasks that require exploration in partially observable environments to solve. TetrisRL - Tetris environment to train machine learning agents. dm_env_rpc - Networking protocol for agent-environment communication. PHYRE - Benchmark for physical reasoning. (Web) SuperSuit - Easy-to-use micro-wrappers for Gym and PettingZoo based RL Environments. ViZDoom - Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. (Web) banditml - Lightweight contextual bandit & reinforcement learning library designed to be used in production Python services. SUMO-RL - Provides a simple interface to instantiate Reinforcement Learning environments with SUMO for Traffic Signal Control. PyGeneses - PyTorch based DeepRL framework to train and study artificial species in bio-inspired environments. (Docs) (Article) CompilerGym - Reinforcement learning toolkit for compiler optimizations. (Docs) (HN) MuZero General - Commented and documented implementation of MuZero based on the Google DeepMind paper (Nov 2019) and the associated pseudocode. ReBeL - Algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games. NEAT Gym - Learn OpenAI Gym environments using NEAT. RLStructures - Library to facilitate the implementation of new reinforcement learning algorithms. FinRL - Deep Reinforcement Learning Library for Quantitative Finance. ReAgent - Platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.). (Docs) minimalRL PyTorch - Implementations of basic RL algorithms with minimal lines of code. h-baselines - High-performing hierarchical reinforcement learning models and algorithms. CleanRL - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features. MTEnv - MultiTask Environments for Reinforcement Learning. MADRL - Code for multi-agent deep reinforcement learning. adeptRL - Reinforcement learning framework to accelerate research. OpenAI Baselines - Set of high-quality implementations of reinforcement learning algorithms. Jax (Flax) RL - Jax (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces. Awesome Offline RL - Collection of research and review papers for offline reinforcement learning. RL Baselines3 Zoo - Training Framework for Stable Baselines3 Reinforcement Learning Agents. Ecole - Extensible Combinatorial Optimization Learning Environments. (Web) RoboDesk - Multi-Task Reinforcement Learning Benchmark. Cherry - PyTorch Library for Reinforcement Learning Research. lifelong_rl - PyTorch implementations of RL algorithms. Meta-World - Open source robotics benchmark for meta- and multi-task reinforcement learning. (Web) garage - Toolkit for reproducible reinforcement learning research. Mava - Research framework for distributed multi-agent reinforcement learning. (Paper) BRAX - Massively parallel rigidbody physics simulation on accelerator hardware. Python MARL - Python Multi-Agent Reinforcement Learning framework. Sample Factory - High throughput asynchronous reinforcement learning. Tianshou - Elegant PyTorch deep reinforcement learning library. (Docs) AlphaGPU - Alphazero on GPU thanks to CUDA.jl. rliable - Open-source library for reliable evaluation on reinforcement learning and machine learnings benchmarks. d3rlpy - Offline deep reinforcement learning library. (Web) Spice.ai - Open source, portable runtime for training and using deep learning on time series data. (HN) PPO-PyTorch - Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch. rlberry - Easy-to-use reinforcement learning library for research and education. SEED RL - Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture. MiniHack - Sandbox for Open-Ended Reinforcement Learning Research. Falken - Provides developers with a service that allows them to train AI that can play their games. irl-imitation - Implementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL. DrQ-v2 - Improved Data-Augmented Reinforcement Learning. RLs - Reinforcement Learning Algorithms Based on PyTorch. gym-hybrid - Collection of environment for reinforcement learning task possessing discrete-continuous hybrid action space. RL Starter Files - RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code. JORLDY - Open Source Reinforcement Learning Framework. sinergym - Gym environment for building simulation and control using reinforcement learning. MetaDrive - Composing Diverse Driving Scenarios for Generalizable RL. Crafter - Benchmarking the Spectrum of Agent Capabilities. RLDS - Reinforcement Learning Datasets. TD3+BC - Minimalist Approach to Offline Reinforcement Learning. EnvPool - C++-based high-performance parallel environment execution engine for general RL environments. (Docs) WarpDrive - Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU. Embodied - Fast reinforcement learning research. MARL-Baselines3 - Multi-Agent Reinforcement Learning with Stable-Baselines3. ALF - Reinforcement learning framework emphasizing on the flexibility and easiness of implementing complex algorithms involving many different components. rvs - Reinforcement Learning via Supervised Learning. RLHive - Framework designed to facilitate research in reinforcement learning. Gym-ANM - Design Reinforcement Learning environments that model Active Network Management (ANM) tasks in electricity distribution networks. DeepRL - Modularized Implementation of Deep RL Algorithms in PyTorch. RLMeta - Light-weight flexible framework for Distributed Reinforcement Learning Research. HandyRL - Handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.