Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
machine-learning
tutorial
reinforcement-learning
deep-reinforcement-learning
q-learning
pomdps
policy-gradient
sarsa
a3c
dynamic-programming
imitation-learning
dyna
td-learning
actor-critic
meta-learning
-
Updated
Jan 22, 2019 - Jupyter Notebook

