Lectures will be places of discussion where the current topic is summarized by the lecturer and discussed among all present. The students are expected to prepare by reading given material in advance prior to each lecture.
Course lectures will be given during first and second periods on Tuesdays 14:15-16:00. All lectures will be given over Zoom at link https://aalto.zoom.us/j/63645678644. Please download and install Zoom before the first lecture to attend the course. Lecture recordings will likely be available afterwards, but this cannot be guaranteed due to potential technical issues. The lectures are interactive in nature so that participation is encouraged.
Schedule and Readings
For each lecture starting from the third one, there will be reading materials that the students should study before attending the lecture.
Course arrangements, Overview, Tue 8.9., no readings
Markov decision processes, Tue 15.9., Sutton & Barto, chapters 2-2.3, 2.5-2.6, 3-3.8
RL in discrete domains (value-based RL), Tue 22.9., Sutton&Barto Ch. 5-5.4, 5.6, 6-6.5
Function approximation, Tue 29.9., Sutton&Barto Ch. 9-9.3, 10-10.1
Policy gradient, Tue 6.10., Sutton&Barto, Ch. 13-13.3
Actor-critic, Tue 13.10., Sutton & Barto, Ch. 13.5, 13.7
Towards model-based reinforcement learning: optimal control, Tue 20.10. Platt: Introduction to Linear Quadratic Regulation
Model-based reinforcement learning, Tue 27.10., Sutton & Barto, Ch. 8-8.2
Guest lecture: Safety and constraints, Tue 3.11.