- Lecture slides
- Our videos: lecture seminar (russian)
- [main] lecture by David Silver - url
- Alternative lecture by Pieter Abbeel (english): part 1, part 2
- Alternative lecture by John Schulmann (english): video
- Definitive guide in policy/value iteration from Sutton: start from page 81 here.
The main assignment is seminar_vi.ipynb
notebook in this week's folder. It has no requirements besides the most basic data science libraries (e.g. numpy) so you should be able to run it locally.
Note: if you have any difficulty using graphviz, just set has_graphviz=False
.