IN3050 - Spring 2021

Week 12: April 7-April 14

Interactive session, Wednesday April 7

Zoom link (needs UiO login)
Anonymous feedback and suggestions
No Recording

Weekly lecture:

Slides:

Reinforcement learning

Videos:

The reinforcement learning problem
Reward and action selection
Policy and value
The q-learning algorithm
Q-learning example
On-policy and off-policy learning

Readings:

Marsland Chapter 11

Optional Reading

The readings below go beyond the syllabus for this class, so do not worry if you don't understand everything. But, it can provide some interesting perspectives if you want to dig deeper into RL.

From Q-Learning to Deep Q-Learning: https://towardsdatascience.com/reinforcement-learning-tutorial-part-3-basic-deep-q-learning-186164c3bf4

The Paths Perspective on Value Learning: https://distill.pub/2019/paths-perspective-on-value-learning/

(Advanced) Article showing important limitations of state-of-the-art Reinforcement Learning: https://thegradient.pub/why-rl-is-flawed/

Weekly exercises

Exercises
Solution
All group sessions will be digital this week.

Published Apr. 6, 2021 8:50 AM - Last modified Jan. 17, 2022 8:58 AM