ikostrikov / pytorch-a2c-ppo-acktr-gail

Star

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

reinforcement-learning deep-learning deep-reinforcement-learning pytorch atari hessian second-order continuous-control actor-critic ale mujoco proximal-policy-optimization ppo advantage-actor-critic a2c acktr natural-gradients roboschool kfac kronecker-factored-approximation

Updated Jan 17, 2021
Python

deepmind / dm_control

Star

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

machine-learning reinforcement-learning deep-learning artificial-intelligence neural-networks physics-simulation mujoco

Updated Jan 15, 2021
Python

IntelLabs / coach

Star

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

reinforcement-learning deep-learning mxnet tensorflow openai-gym rl starcraft imitation-learning hierarchical-reinforcement-learning coach mujoco starcraft2 onnx roboschool carla starcraft2-ai distributed-reinforcement-learning

Updated Dec 21, 2020
Python

MushroomRL / mushroom-rl

Star

Python library for Reinforcement Learning.

reinforcement-learning qlearning deep-learning deep-reinforcement-learning openai-gym pytorch dqn rl atari ddpg sac trpo mujoco pybullet

Updated Jan 20, 2021
Python

rlworkgroup / metaworld

Star

An open source robotics benchmark for meta- and multi-task reinforcement learning

multi-task mujoco meta-rl benchmark-environments

Updated Jan 23, 2021
Python

navneet-nmk / pytorch-rl

Star

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Updated Jul 14, 2019
Python

zuoxingdong / lagom

Star

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

python machine-learning research reinforcement-learning deep-learning deep-reinforcement-learning pytorch artificial-intelligence policy-gradient ddpg sac cem cmaes evolution-strategies mujoco deep-deterministic-policy-gradient proximal-policy-optimization ppo td3 soft-actor-critic

Updated Feb 26, 2020
Jupyter Notebook

pat-coady / trpo

Star

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

machine-learning reinforcement-learning tensorflow policy-gradient mujoco

Updated Jun 2, 2020
Jupyter Notebook

ikostrikov / pytorch-trpo

Star

PyTorch implementation of Trust Region Policy Optimization

reinforcement-learning deep-learning deep-reinforcement-learning pytorch continuous-control trpo mujoco trust-region-policy-optimization

Updated Sep 13, 2018
Python

denisyarats / drq

Star

DrQ: Data regularized Q

python control reinforcement-learning deep-learning pixel deep-reinforcement-learning pytorch gym rl data-augmentation sac actor-critic mujoco model-free off-policy dm-control drq soft-actor-crit

Updated Jan 16, 2021
Jupyter Notebook

chingyaoc / pytorch-REINFORCE

Star

PyTorch Implementation of REINFORCE for both discrete & continuous control

reinforcement-learning pytorch gym reinforce continuous-control mujoco

Updated Apr 16, 2017
Python

denisyarats / pytorch_sac

Star

PyTorch implementation of Soft Actor-Critic (SAC)

reinforcement-learning deep-learning deep-reinforcement-learning pytorch gym sac continuous-control actor-critic mujoco dm-control soft-actor-critic d4pg

Updated Sep 19, 2020
Python

aravindr93 / mjrl

Star

Reinforcement learning algorithms for MuJoCo tasks

reinforcement-learning robotics simulation mujoco

Updated Jan 12, 2021
Python

denisyarats / pytorch_sac_ae

Star

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

reinforcement-learning deep-learning pixels deep-reinforcement-learning pytorch gym autoencoder actor-critic mujoco image-based dm-control soft-actor-critic

Updated May 3, 2020
Python

RchalYang / torchrl

Star

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

algorithm reinforcement-learning pytorch dqn gym ddpg sac trpo mujoco ppo td3 rl-algorithms policy-agent

Updated Jan 1, 2021
Python

pokaxpoka / sunrise

Star

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

deep-neural-networks reinforcement-learning deep-learning deep-reinforcement-learning rainbow rl codebase deep-q-network sac deep-q-learning mujoco model-free off-policy dm-control soft-actor-critic

Updated Jul 12, 2020
Python

andrewliao11 / pytorch-a3c-mujoco

Star

Implement A3C for Mujoco gym envs

reinforcement-learning pytorch a3c continuous-control actor-critic mujoco

Updated Nov 2, 2017
Python

RITCHIEHuang / DeepRL_Algorithms

Star

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

deep-reinforcement-learning dqn policy-gradient reinforcement-learning-algorithms reinforcement trpo mujoco pytorch-rl ppo td3 pytorch-implementation soft-actor-critic tensorflow2 policygradient

Updated Jan 21, 2021
Python

aravindr93 / trajopt

Star

Trajectory optimization algorithms for robotic control.

robotics trajectory-optimization robotics-control mujoco

Updated Sep 19, 2020
Python

nicklashansen / policy-adaptation-during-deployment

Star

Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

reinforcement-learning deep-learning robotics deep-reinforcement-learning pytorch gym mujoco self-supervised-learning dm-control

Updated Nov 24, 2020
Python

cxy1997 / Robotiq-UR5

Star

Simulator of UR5 robotic arm with Robotiq gripper, built with MuJoCo

ur5 simulation-modeling mujoco robotiq-gripper rl-environment

Updated Mar 4, 2018
C++

navneet-nmk / Pytorch-RL-CPP

Star

A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)

Updated Jul 18, 2019
C++

PaulDanielML / MuJoCo_RL_UR5

Star

A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.

reinforcement-learning computer-vision robotics mujoco gym-environment pick-and-place