Lecture slides - here
- Russian materials:
- Lecture - video
- Seminars
- Q-learning seminar - video (older track - assignment)
- SARSA & stuff - video
- English materials:
- Lecture by David Silver (english) - video part I, video part II
- Alternative lecture by Pieter Abbeel (english) - video
- Alternative lecture by John Schulmann (english) - video
- Blog post on q-learning Vs SARSA - url
- N-step temporal difference from Sutton's book - suttonbook chapter 7
- Eligibility traces from Sutton's book - suttonbook chapter 12
- Blog post on eligibility traces - url
Just as usual, start with homework.ipynb
For seminar, implement q-learning agent and test it on Taxi and CartPole with binarizer. And then, implement EV-SARSA agent, experience replay + bonus tasks for homework.