Schedule

Event

Date

Description

Course Material
Lecture

09/19/2025
Friday

Introduction
[Course policy] [Intro2RL]
Suggested Readings:
Lecture

09/26/2025
Friday

Basic Concepts in Reinforcement Learning
[slides]
Suggested Readings:
Lecture

10/10/2025
Friday

Multi-Armed Bandits
[slides] [KDD20 Tutorial]
Suggested Readings:
Due

10/16/2025 23:59
Thursday

Project Idea Due
1. Required content: Sales pitch about your course project idea, especially what makes you excited.
2. Purpose: Find your teammates who share the same passion and complementary skills
3. Where to submit: Post on our 网络学堂 forum.
Assignment

10/18/2025
Saturday

MP #1 - Bandit Algorithms released!

[MP #1 - Bandit Algorithms] [Solutions]
Quiz

10/24/2025
Friday

Quiz #1 - RL Basics & Bandits
[solution]

This quiz is designed to cover essential concepts in basic concepts in reinforcement learning and multi-armed bandit.
Lecture

10/24/2025
Friday

Markov Decision Process
[slides]
Suggested Readings:-
Due

10/25/2025 23:59
Saturday

MP #1 due
Due

10/27/2025 23:59
Monday

Project Proposal Due
1. Required template: You are required to use the latest NeurIPS templates for your project proposal.
2. Maximum length: 3 pages, excluding references and appendix.
3. Where to submit: A submission page will be created on Xuetang. One group only needs to submit one proposal to collab; and please name your submission as “name[+name]*-proposal.pdf”, for example, “王宏宁-李旭浚-proposal.pdf”.
Lecture

10/31/2025
Friday

Dynamic Programming
[slides]
Suggested Readings:
Lecture

11/07/2025
Friday

Monte Carlo Methods
[slides]
Suggested Readings:-
Assignment

11/07/2025
Friday

MP #2 - Markov Decision Process released!

[MP #2 - Markov Decision Process] [Solutions]
Lecture

11/14/2025
Friday

Temporal-Difference Learning
[slides]
Suggested Readings:-
Due

11/15/2025 23:59
Saturday

MP #2 due
Quiz

11/21/2025
Friday

Quiz #2 - DP & MC
[solution]

This quiz is designed to cover essential concepts in dynamic programming and Monto Carlo methods.
Lecture

11/28/2025
Friday

Policy Gradient Methods
[slides]
Suggested Readings:-
Assignment

12/06/2025
Saturday

MP #3 - Temporal difference and policy gradient methods released!

[MP #3 - Temporal difference and policy gradient methods]
Quiz

12/12/2025
Friday

Quiz #3 - TD & PG
[solution]

This quiz is designed to cover essential concepts in temporal difference method and policy gradient method.
Lecture

12/12/2025
Friday

Approximation Methods
[slides]
Suggested Readings:-
Due

12/13/2025 23:59
Saturday

MP #3 due
Lecture

12/19/2025
Friday

Deep Reinforcement Learning
[slides]
Suggested Readings:-
Assignment

12/27/2025
Saturday

MP #4 - Deep RL methods released!

[MP #4 - Deep RL methods]
Quiz

01/02/2026
Friday

Quiz #4 - Deep RL
[solution]

This quiz is designed to cover essential concepts in deep reinforcement learning methods.
Due

01/02/2026 15:35
Friday

Project Presentation
1. Presentation location: 四教4104.
2. Presentation length: maximum 12 minutes presentation, including Q&A, given in person.
3. Presentation format: any format you prefer, power point slides or live demonstration.
Due

01/03/2026 23:59
Saturday

MP #4 due
Due

01/14/2026 23:59
Wednesday

Project Report Due
1. Required template: You should use the same template that you have used for your project proposal.
2. Maximum length: 6 pages, excluding references and appendix.
3. Where to submit: A submission page will be created on Xuetang. One group only needs to one report to 网络学堂; and please name your submission as “name[+name]*-report.pdf”, for example, “王宏宁-李旭浚-report.pdf”.