Simple Reinforcement learning tutorials
machine-learning
tutorial
reinforcement-learning
q-learning
dqn
policy-gradient
sarsa
tensorflow-tutorials
a3c
deep-q-network
ddpg
actor-critic
asynchronous-advantage-actor-critic
double-dqn
prioritized-replay
sarsa-lambda
dueling-dqn
deep-deterministic-policy-gradient
proximal-policy-optimization
ppo
-
Updated
Aug 5, 2020 - Python


There seems to be some issues stemming from some Pytorch changes, seemingly to do with float/double. See: https://travis-ci.com/github/cpnota/autonomous-learning-library/builds/179286204