The Wayback Machine - https://web.archive.org/web/20200530011349/https://github.com/zhoubolei/introRL
Skip to content
Intro to Reinforcement Learning (强化学习纲要)
Branch: master
Clone or download

Latest commit

Latest commit f82b80c May 28, 2020

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
asset smaller teaser Apr 2, 2020
README.md update readme May 28, 2020
lecture1.pdf lecture 1 slide Mar 16, 2020
lecture10.pdf finally the last lecture ! May 25, 2020
lecture2.pdf update a typo in bellman equation Mar 25, 2020
lecture3.pdf lecture 3 youtube videos are uploade. B zhan is waiting to be approve… Mar 29, 2020
lecture4.pdf lecture 4 slide Apr 4, 2020
lecture5.pdf lecture 5 slide Apr 16, 2020
lecture6.pdf update lecture9 May 17, 2020
lecture7.pdf lecture7 May 3, 2020
lecture8.pdf slide updated May 10, 2020
lecture9.pdf update lecture9 May 17, 2020

README.md

teaser

Overview

This short RL course introduces the basic knowledge of reinforcement learning. Slides are made in English and lectures are given by Bolei Zhou in Mandarin. The course is for personal educational use only. Please open an issue if you spot some typos or errors in the slides.

Course Schedule

The course is scheduled as follows. There are 10 lectures in total, where the first one was premiered on 16 March 2020 and the last one was finished on 25 May 2020. Thanks for watching and may ReinForce be with you!

Topic Resources
Lecture 1 Overview (课程概括与RL基础) slide, Youtube(part1, part2), B站(上集, 下集)
Lecture 2 Markov Decision Process (马尔科夫决策过程) slide, Youtube(part1, part2), B站(上集, 下集)
Lecture 3 Model-free Prediction and Control (无模型的预测和控制) slide, Youtube(part1, part2), B站(上集, 下集)
Lecture 4 Value Function Approximation (价值函数近似) slide, Youtube(part1, part2), B站(上集, 下集)
Lecture 5 Policy Optimization: Foundation (策略优化基础篇) slide, Youtube(part1, part2), B站(上集, 下集)
Lecture 6 Policy Optimization: State of the art (策略优化进阶篇) slide, Youtube(part1, part2), B站(上集, 下集)
Lecture 7 Model-based RL (基于环境模型的RL) slide, Youtube, B站
Lecture 8 Imitation Learning (模仿学习) slide, Youtube, B站
Lecture 9 Distributed systems for RL (分布式系统) slide, Youtube, B站
Lecture 10 RL in a nutshell (课程结局篇) slide, Youtube, B站
You can’t perform that action at this time.