强化学习理论(RL),重点是样本复杂性分析。
- Basics of MDPs and RL.
- Sample complexity analyses of tabular RL.
- Policy Gradient.
- Off-policy evaluation.
- State abstraction theory.
- Sample complexity analyses of approximate dynamic programming.
- PAC exploration theory (tabular).
- PAC exploration theory (function approximation).
- Partial observability and dynamical system modeling.
http://nanjiang.cs.illinois.edu/cs598/